遇到的问题:
1. scipy 库 导入有问题 找不到模块
第一次安装时下载了 .whl 文件
pip install 绝对路径\.... .whl
- 解决办法:
pip install scipy
Orz...................................................................
2.英文(上课老师的demo)
import re
import numpy as np
import matplotlib.pyplot as plt
from wordcloud import WordCloud
from PIL import Image
file1=open("wcs.txt",'r')
txt=file1.read()
file1.close()
txt=txt.lower()
re.sub(',."/%?:',' ',txt)#replace
words=txt.split()
words_index=set(words)
dic={i:words.count(i) for i in words_index}
print(dic)
image1=Image.open("hreat.jpg")
graph=np.array(image1)
wc= WordCloud(background_color='white',mask=graph)
wc.generate_from_frequencies(dic)
plt.imshow(wc)# create picture
plt.axis("off")
plt.show() # show pic
待处理的问题:
屏蔽无效词语 and the ……
stopword作用
3.中文词云
from os import path
import jieba
import numpy as np
from wordcloud import WordCloud,ImageColorGenerator
import matplotlib.pyplot as plt
from scipy.misc import imread
from PIL import Image
txt1=open('quanzhi.txt',encoding='gbk').read()
txt1_list=txt1.split(',')#列表
d= path.dirname(__file__)
word_list=["".join(jieba.cut(sentences))for sentences in txt1_list]
new_text=''.join(word_list)
# 设置背景图片
alice_coloring = imread(path.join(d, "yexiu.jpg"))
fontname=path.join(d,"simfang.ttf")
wordcloud= WordCloud(
background_color='white',
mask=alice_coloring, # 设置背景图片
font_path=fontname,
max_words=2000
).generate(new_text)#中文必须制定字体路径
plt.imshow(wordcloud)#生成随机颜色的词云
plt.axis("off")
#plt.show()
image= Image.open('yexiu.jpg')#生成图片颜色的词云
graph = np.array(image)
image_color = ImageColorGenerator(graph)
plt.imshow(wordcloud.recolor(color_func=image_color))
plt.axis("off")
plt.show()
wordcloud.to_file('wc_c.jpg') #生成图片
待处理的问题:
- 词频
- 怎么更好的展现图片……应该词汇越来越多会更清楚些吧……