python词频统计之《哈姆雷特》_哈姆雷特词频统计,Hamlet

最新推荐文章于 2023-09-25 20:47:03 发布

lover bacon

最新推荐文章于 2023-09-25 20:47:03 发布

阅读量7k

点赞数 3

文章标签： python词频统计之《哈姆雷特》

本文链接：https://blog.csdn.net/weixin_34234442/article/details/114463139

版权

关键词由CSDN通过智能技术生成

def getText():

txt=open(“hamlet.txt”,“r”).read()

txt=txt.lower() #将所有字母转换成小写

for ch in ‘|"#$%&()*+,-./:;<=>?@[\]^{|}~’:

txt=txt.replace(ch," “) #完成对txt的遍历，用空格代替特殊符号

return txthamlet

Txt=getText()

words=hamletTxt.split() #分隔字符，默认分割字符为空格

counts = {}#定义一个空字典

for word in words:

counts[word] = counts.get(word,0) + 1 #若在遍历中找到单词，则返回该单词+1后的个数，若该单词不在，则将这个单词加到字典中，count+1。

items=list(counts.items()) #将字典转换成列表

items.sort(key=lambda x:x[1],reverse=True) #按列表元素中的第二个项从大到小排序

for i in range(10):

word,count=items[i]

print(”{0:<10}{1:>5}".format(word,count))

源代码出自MOOC Python程序语言设计6.6

添加链接描述

关于sort()的详细解释在https://blog.csdn.net/weixin_38198369/article/details/105021708

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

关注关注