文件方式实现完整的英文词频统计实例

fo=open(('test.txt','w')
fo.write('''During my childhood, think lucky money and new clothes are necessary for New Year, but as the advance of the age, will be more and more found that those things are optional; Junior high school, thought to have a crush on just means that the real growth, but over the past three years later, his writing of alumni in peace, suddenly found that isn't really grow up, it seems is not so important; Then in high school, think don't want to give vent to out your inner voice can be in the high school children of the feelings in a period, but was eventually infarction when graduation party in the throat, later again stood on the pitch he has sweat profusely, looked at his thrown a basketball hoops, suddenly found himself has already can't remember his appearance.''')
fo=open('test.txt','r')
str=fo.read()
fo.close()

str=str.lower()
for i in ',.!':
    str=str.replace(i,' ')
words=str.split(' ')
exp={'',',','the','to','a','and','is','are','it','in','of','but','that',
     'his','be','then'}

dic={}
keys=set(words)-exp
for i in keys:
    dic[i]=words.count(i)
d=list(dic.items())
d.sort(key=lambda x:x[1],reverse=True)
for i in range(20):
    print(d[i])

从结果可以看出,此文是讲述作者在童年时期关于红包的故事。

原文地址:https://www.cnblogs.com/hxl316/p/7602332.html