![](https://img-blog.csdnimg.cn/20201014180756923.png?x-oss-process=image/resize,m_fixed,h_64,w_64)
nltk分词
qq_42052864
这个作者很懒,什么都没留下…
展开
-
亚马逊标题分词
导包import pandas as pdfrom nltk.tokenize import word_tokenizefrom nltk.corpus import stopwordsfrom nltk.stem.porter import PorterStemmerfrom nltk.text import Textfrom nltk import ngrams,FreqDist读数据data = pd.read_csv(r'D:\数据\亚马逊搜索词排名\asin.csv',.原创 2021-04-21 15:38:17 · 243 阅读 · 0 评论 -
nltk分词
先读入数据import pandas as pddata = pd.read_excel(r'D:\python\zxzy\amazon_asin\review.xlsx')title = data['review_revs']data.head(1)对每条review进行分句#分句import nltkfrom nltk.tokenize import sent_tokenizesent = []for i in title: sent.append(sent_.原创 2021-04-21 15:15:56 · 487 阅读 · 0 评论