中文词频统计及词云制作

最新推荐文章于 2019-09-08 23:31:43 发布

weixin_34293911

最新推荐文章于 2019-09-08 23:31:43 发布

阅读量62

点赞数

原文链接：http://www.cnblogs.com/lqy-36/p/7591174.html

版权

import jieba
from wordcloud import WordCloud
import matplotlib.pyplot as plt
fr=open('t.txt','r',encoding='utf-8').read()
words=jieba.lcut(fr)
excludes={'.....'}
counts={}

for word in words:
    if len(word)==1:
        continue
    else:
        counts[word] = counts.get(word,0)+1
        
for word in excludes:
    del(counts[word])
    
items=list(counts.items())
items.sort(key=lambda x:x[1],reverse=True)

for i in range(20):
    word,count=items[i]
    print("{0:<10}{1:>5}".format(word,count))
wl_split=word,count
mywc = WordCloud().generate(wl_split)
plt.show()

转载于:https://www.cnblogs.com/lqy-36/p/7591174.html

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

weixin_34293911

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
中文词频统计及词云制作

import jiebafrom wordcloud import WordCloudimport matplotlib.pyplot as pltfr=open('t.txt','r',encoding='utf-8').read()words=jieba.lcut(fr)excludes={'.....'}counts={}for word in words...
复制链接

扫一扫