nltk分词
qq_42052864
这个作者很懒,什么都没留下…
展开
-
亚马逊标题分词
导包 import pandas as pd from nltk.tokenize import word_tokenize from nltk.corpus import stopwords from nltk.stem.porter import PorterStemmer from nltk.text import Text from nltk import ngrams,FreqDist 读数据 data = pd.read_csv(r'D:\数据\亚马逊搜索词排名\asin.csv',.原创 2021-04-21 15:38:17 · 253 阅读 · 0 评论 -
nltk分词
先读入数据 import pandas as pd data = pd.read_excel(r'D:\python\zxzy\amazon_asin\review.xlsx') title = data['review_revs'] data.head(1) 对每条review进行分句 #分句 import nltk from nltk.tokenize import sent_tokenize sent = [] for i in title: sent.append(sent_.原创 2021-04-21 15:15:56 · 502 阅读 · 0 评论