gpt4 book ai didi

bash - 计算文件中相似行的数量

转载 作者:行者123 更新时间:2023-11-29 08:49:52 25 4
gpt4 key购买 nike

我在一个论坛上有一个主题,人们可以在上面写下他们的前 10 首歌曲列表。我想计算一首歌被列出的次数。相似度必须不区分大小写进行比较。

文件结构示例:

Join Date: Apr 2005
Location: bama via new orleans
Age: 48
Posts: 2,369
Re: Top 10 Songs Jethro Tull
oh dearrrr. the only way for all kaths to keep their last shred of sanity: fly through this list as quickly as possible, without stopping to think for a microsecond...
velvet green
dun ringill
skating away on the thin ice of a new day
sossity yer a woman
fat man
life's a long song
jack-a-lynn
teacher
mother goose
elegy

03-10-2010, 02:29 AM #5 (permalink)
Sox
Avoiding The Swan Song



Join Date: Jan 2010
Location: Derbyshire, England
Age: 43
Posts: 5,991
Re: Top 10 Songs Jethro Tull
Wow !!!! Where do I start ?
Dun Ringill
Aqualung
With You There To Help Me
Jack Frost And The Hooded Crow
We Used To Know
Witch's Promise
Pussy Willow
Heavy Horses
My Sunday Feeling
Locomotive Breath

Join Date: Nov 2009
Posts: 1,418
Re: Top 10 Songs Jethro Tull
Too bad they all can't make the list, but here's ten I never get tired of listening to:

Christmas Song
Witches Promise
Life's A Long Song
Living In The Past
Rainbow Blues
Sweet Dream
Minstral In The Gallery
Cup of Wonder
Rover
Something's On the Move

示例输出:

life's a long song 3
aqualung 1
...

最佳答案

你的文件的“结构”在结构部门有点欠缺,所以你必须处理一些过程中的错误。

假设所有这些都在一个名为 input 的文件中,请尝试:

tr '[A-Z]' '[a-z]' < input | \
egrep -v "^ *(join date|age|posts|location|re):" | \
sort | \
uniq -c

第一行将所有内容小写,第二行删除样本中看起来像电子邮件标题的内容,然后对唯一项目进行排序和计数。

关于bash - 计算文件中相似行的数量,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/8627014/

25 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com