利用python jieba库统计政府工作报告词频

最新推荐文章于 2024-06-26 00:00:00 发布

weixin_34072637

最新推荐文章于 2024-06-26 00:00:00 发布

阅读量1.4k

点赞数 2

文章标签： python 开发工具爬虫

原文链接：http://www.cnblogs.com/hzxxxb/p/10652504.html

版权

1.安装jieba库

舍友帮装的，我也不会( ╯□╰ )

2.上网寻找政府工作报告

3.参照课本三国演义词频统计代码编写

import jieba
txt = open("D:\政府工作报告.txt","r",encoding='utf-8').read()
words  = jieba.lcut(txt)
counts = {}
for word in words:
    if len(word) == 1:
        continue
    else:
        counts[word] = counts.get(word,0) + 1
items = list(counts.items())
items.sort(key=lambda x:x[1], reverse=True) 
for i in range(10):
    word, count = items[i]
    print ("{0:<10}{1:>5}".format(word, count))

　　结果显示如下

可见改革和发展出现的次数还是很高的，高频词体现了政府工作的重点在于改革方面。

转载于:https://www.cnblogs.com/hzxxxb/p/10652504.html

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

weixin_34072637

关注关注

2
点赞
踩
3

收藏

觉得还不错? 一键收藏
0
评论
利用python jieba库统计政府工作报告词频

1.安装jieba库舍友帮装的，我也不会( ╯□╰ )2.上网寻找政府工作报告3.参照课本三国演义词频统计代码编写import jiebatxt = open("D:\政府工作报告.txt","r",encoding='utf-8').read()words = jieba.lcut(txt)counts = {}for word in words: ...
复制链接

扫一扫