![](https://img-blog.csdnimg.cn/20201014180756925.png?x-oss-process=image/resize,m_fixed,h_64,w_64)
自然语言处理
waiting&fighting
我喜欢高效率和一劳永逸
展开
-
统计词频+生成词云+生成graph
代码# -*- coding: utf-8 -*-"""Created on Tue Jun 22 23:00:19 2021@author: K43"""import xlrdimport xlwtimport refrom wordcloud import WordCloudimport matplotlib.pyplot as pltimport matplotlib.patches as mpatchesimport networkx as nxdef ReadExce原创 2021-06-23 17:04:05 · 276 阅读 · 0 评论 -
TF-IDF
复现# -*- coding: utf-8 -*-"""Created on Mon May 31 14:27:13 2021@author: K43"""#import jieba.analyse#text='''关键词是能够表达文档中心内容的词语,常用于计算机系统标引论文内容特征、#信息检索、系统汇集以供读者检阅。关键词提取是文本挖掘领域的一个分支,是文本检索、#文档比较、摘要生成、文档分类和聚类等文本挖掘研究的基础性工作''' #keywords=jieba.analyse原创 2021-05-31 15:30:16 · 82 阅读 · 0 评论 -
自动分词+热词统计
代码# -*- coding: utf-8 -*-"""Spyder EditorThis is a temporary script file."""import xlrdimport numpy as npimport pandas as pdimport jiebaimport collectionsimport xlwtdef readexcel(rPath): workbook = xlrd.open_workbook(rPath) #pr原创 2021-05-25 14:02:43 · 281 阅读 · 0 评论