一、分割句子与单词(例1)
nltk:自然语言工具包(分词、词干提取、同义词与反义词)
安装MLTK:conda install nltk
(1)导入包
import nltk
nltk.download('punkt') #安装NLTK数据
print('一、分割句子、单词:')
(2)给出样本文本
mytextl = 'Hello Adam, how are you? I hope everything is going well. Today isa good day, see you dude.'
mytext2 = 'Hello Mr Adam, how are you? I hope everything is going well. Today is a good day,see you dude.'
(3)分割句子
以标点符号划分句子: sent_tokenize(无效)
因为nltk安装下载的是免费版本,功能不全,无法分割句子,但是能够分割单词
from nltk.tokenize import sent_tokenize
print('分割后的句子:')
print(sent_tokenize(mytextl))
(4)用标点符号来拆分句子
fr