词频统计

import operator
file=open('baby.txt','r')
lyrics=file.read()
file.close()
# lyrics='''Oh woah
# Oh woah
# Oh woahohhohh
#
# You know you love me, I know you care
# You shout whenever, and I'll be there
# You want my love, you want my heart
# And we will never ever ever be apart
# Are we an item? Girl quit playing
# We're just friends, what are you saying
# Said there's another and look right in my eyes
# My first love broke my heart for the first time
# And I was like
#
# Baby, baby, baby, oh like
# Baby, baby, baby, no like
# Baby, baby, baby, oh
# I thought you'd always been mine, mine
# Baby, baby, baby, oh like
# Baby, baby, baby, no like
# Baby, baby, baby, oh
# I thought you'd always been mine, mine
# Oh, Oh
#
# For you, I would have done whatever
# And I just can't believe we ain't together
# And I wanna play it cool, but I'm losing you
# I'll buy you anything, I'll buy you any ring
# And I'm in pieces, baby fix me
# And you'll shake me till you wake me from this bad dream
# I'm going down, down, down, down
# And I just can't believe my first love won't be around
# And I'm now like
#
# Baby, baby, baby, oh like
# Baby, baby, baby, no like
# Baby, baby, baby, oh
# I thought you'd always been mine, mine
# Baby, baby, baby, oh like
# Baby, baby, baby, no like
# Baby, baby, baby, oh
# I thought you'd always been mine, mine
#
# When I was 13 I had my first love
# There was nobody that compared to my baby
# And nobody came between us or could ever come above
# She had me going crazy
# Oh I was starstruck
# She woke me up daily don't need no starbucks
# She made my heart pound
# Asking for a beat when I see her in the street
# And at school on the playground
# But I really wanna see her on the weekend
# She knows she got me dazy
# Cause she was so amazing
# And now my heart is breaking
# But I'll just keep on saying
#
# Baby, baby, baby, oh like
# Baby, baby, baby, no like
# Baby, baby, baby, oh
# I thought you'd always been mine, mine
# Baby, baby, baby, oh like
# Baby, baby, baby, no like
# Baby, baby, baby, oh
# I thought you'd always been mine, mine
# Now I'm all gone
# Now I'm all gone
# Now I'm all gone
# Now I'm all gone
# I am gone'''
non={'a','above','all','am','an','any','be','been','between'}

lyrics=lyrics.replace("'",' ')
lyrics=lyrics.replace(',',' ')

wordlist=lyrics.lower().split()

dic={}
word=set(wordlist)-non
for w in word:
    dic[w]=wordlist.count(w)

# for w in wordlist:
#     dic[w]=dic.get(w,0)+1

sorted_dic=sorted(dic.items(),key=operator.itemgetter(1),reverse=True)

for i in range(10):
    print(sorted_dic[i])

运行截图

原文地址:https://www.cnblogs.com/BOBOWZH/p/8652796.html