![](https://img-blog.csdnimg.cn/20201014180756754.png?x-oss-process=image/resize,m_fixed,h_64,w_64)
jieba
文章平均质量分 62
神创
这个作者很懒,什么都没留下…
展开
-
【实例】python 将jieba分词 展示在html
--------------------------------------------------------参考:http://blog.csdn.net/reallocing1/article/details/51694967--------------------------------------------------------配置:windows +python 3.6.3 + j...原创 2018-02-08 16:13:35 · 597 阅读 · 0 评论 -
【实例】词频统计及其可视化python+jieba+wordcloud
文本提供最后案例的文档下载:https://download.csdn.net/download/qq_19741181/10278764python 根据文本生成标签云 -----------------------------------------------------------------------------------------------效果>>> impo...原创 2018-03-10 10:26:31 · 7844 阅读 · 0 评论 -
【python jieba excel】用结巴分词,将文章分句,一行一行分词,并导入excel
第一步:将文章以句号形式分开,并标号第二步:使用结巴遍历每一句,并分词第三步:使用txt导入excel------------------------------------------------------------------参考自己的文章:第一篇:python(给每行开头添加序号)&(每行末尾添加序号)第二篇:python【jieba】如何换行 (分词同时)| pythonjie...原创 2018-04-06 21:43:37 · 13080 阅读 · 4 评论 -
【python】正则表达式,处理文章,获得首尾大意
参考:https://blog.csdn.net/u011089523/article/details/61914968 分句参考:https://zhidao.baidu.com/question/401008771.html 标点分句>>> f.close()>>> f = open('E:/序言.txt','r')>>> line =...原创 2018-04-15 20:15:40 · 315 阅读 · 0 评论 -
【python分词】镜像分词
>>> import re>>> text = "目前已经有不少部哲学史了">>> from bs4 import BeautifulSoup>>> import jieba>>> seg = jieba.cut(text.strip(),cut_all = False)原创 2018-04-15 21:57:41 · 442 阅读 · 0 评论 -
python【jieba】如何换行 (分词同时)
参考:https://blog.csdn.net/sinat_35376396/article/details/52415328------------------------------------------------------------------代码实现:>>> with open('E:/99999.txt','r')as f:... for line in...原创 2018-04-05 09:08:45 · 3139 阅读 · 0 评论 -
pythonjieba 分词 结束后用txt打开()
>>> with open('E:/99999.txt','r')as f:... for line in f:... seg = jieba.cut(line.strip(),cut_all = False)... output = '/'.join(seg)... with open('E:/13212.txt','a+')as s:... ...原创 2018-04-05 00:17:25 · 1272 阅读 · 0 评论 -
python【】词性标注横排
>>> import re>>> import jieba.posseg as pseg>>> f = open('E:/序言.txt','r').read()>>> words = pseg.cut(f)>>> l = []>>> m = []原创 2018-04-17 14:27:01 · 531 阅读 · 0 评论 -
【python镜像分词】运用到文章
>>> import re>>> t = open('E:/序言.txt','r')>>> text = t.read()>>> import jieba>>> b = ','or '。'>>> textCut = text.split(b)>&原创 2018-04-16 11:30:49 · 203 阅读 · 0 评论 -
[python jieba]词性标注 2018年4月16日10:40:07
>>> import jieba>>> import jieba.posseg>>> string = '陈晨和林迪是好朋友'>>> seg = jieba.posseg.cut(string)>>> print(seg)<generator object cut at 0x000001原创 2018-04-16 10:40:19 · 340 阅读 · 0 评论 -
[python]灵感-镜像
原创 2018-04-15 20:48:27 · 200 阅读 · 0 评论 -
【python】正则表达式处理文章,结构化和提炼大意方法1
>>> import re>>> end = re.compile(r'[u4e00-\u9fa5].$')>>> start = re.compile(r'[u4e00-\u9fa5].')>>> with open('E:/切图.txt','r')as f:... for line in f:... ...原创 2018-04-15 19:58:47 · 252 阅读 · 0 评论 -
【实例】python jieba词性标注 并导出txt
>>> import jieba.posseg as pseg>>> f = open('E:/西方哲学史.txt','r') f = f.read()>>> words = pseg.cut(f)>>> for w in words:... print (w.word,w.flag)...Building pre...原创 2018-02-24 15:15:02 · 2852 阅读 · 0 评论 -
【实例】Python 用jieba分词 导出txt(干货)
--------------------------------------------------------------------------------------完全的菜鸟,琢磨了好久 = =,终于两天时间成功捣鼓出来了, 参考了很多页面,翻来倒去所有的试过都没成功 = =----------------------------------我是分割线-------------------...原创 2018-02-08 11:47:58 · 12326 阅读 · 5 评论 -
【实例】python中jieba 添加 自定义词语?
参考:http://blog.sina.com.cn/s/blog_7d8326290102vzpb.html分词词典:jieba.load_userdict(file_name) # file _name 为路径【例如:jieba.load_userdict("C:\\Users\\Luo Chen\\Desktop\\lixiaofu.txt")seg_list = jieba.cut("李小...原创 2018-02-24 00:06:34 · 19331 阅读 · 4 评论 -
【实例】通过jieba 提取 关键词 (python)
>>> import jieba>>> import os>>> f = open("E:/西方哲学史.txt",'r')>>> f = f.read()>>> seg_list = jieba.cut(f)>>> print("原创 2018-02-23 23:43:40 · 3835 阅读 · 0 评论 -
【实例】python中文词频排序 + html提取文本工具下载链接
>>> with open("E:/cipin.txt") as wf,open("E:/asd.txt",'w') as wf2:... for word in wf:... word_lst.append(word.split(','))... for item in word_lst:... for item2 in item:... ...原创 2018-03-10 13:58:14 · 585 阅读 · 0 评论 -
【记录】python中,两种读取txt的方式;并结合jieba找出词频位置分布
>>> f = open('E:/西方哲学史.txt','r')>>> print(f)<_io.TextIOWrapper name='E:/西方哲学史.txt' mode='r' encoding='cp936'>>>> f = open('E:/西方哲学史.txt').read()>>> print(f)西方原创 2018-03-01 23:22:41 · 1067 阅读 · 0 评论 -
【python jieba】词频统计并标出数量
参考:https://blog.csdn.net/u014070086/article/details/73201590----------------------------------------------------------------------------------------------------------------------代码:import jiebatext =...原创 2018-04-07 11:56:19 · 20823 阅读 · 1 评论