博客1106
需要依赖的库:
jieba
wordcloud
matplotlib
scipy
安装方式: 命令行输入 pip install jieba / pip install wordcloud
jieba 分词
先用jieba分词对简单句子进行划分:
import jieba
sentence = "我来到了异世界,转生成一只史莱姆。萌王万岁!"
print("Default Mode: " + "/".join(jieba.cut(sentence, cut_all=False, HMM=True)))
print("Full Mode: " + "/".join(jieba.cut(sentence, cut_all=True)))
print("HMM OFF: " + "/".join(jieba.cut(sentence, cut_all=False, HMM=False)))
print("Search Engine Mode: " + "/".join(jieba.cut_for_search(sentence, cut_all=True, HMM=False)))
输出如下:
上述输出中,“异世界”一词被划分开,可以通过调节单个词语的语频,使其能(或不能)被分出来。也可以选择调整词典。