读一个文件中词频最高的5个词（python）

最新推荐文章于 2022-10-10 20:47:54 发布

船公司的投放的好人

最新推荐文章于 2022-10-10 20:47:54 发布

阅读量2.2k

点赞数 1

分类专栏： python

本文链接：https://blog.csdn.net/qq_39283195/article/details/89176984

版权

python 专栏收录该内容

19 篇文章 0 订阅

订阅专栏

def getText():    
    txt=open(r'C:\Users\jxiong\Desktop\xu\1.txt','r',encoding='utf-8').read()    
    txt=txt.lower()    
    for ch in "~@#$%^&*()_-+=<>?/,.:;{}[]|\'""":    
        txt=txt.replace(ch,' ')       
    return txt    

hamletTxt=getText()    
words=hamletTxt.split()    
counts={}    
sumcount = 0  
for word in words:    
    counts[word]=counts.get(word,0)+1  
    sumcount = sumcount + 1 
sorted_word_freq = sorted(counts.items(), key=lambda v: v[1], reverse=True)
for item in sorted_word_freq[:5]:  # 输出 Top 5 的单词
    print(item[0], item[1])

重点就是字典的排序。