一、安装
到NLTK主页或从下面的链接中下载和自己机器上python版本相应的安装包。
http://code.google.com/p/nltk/downloads/list
下载的是egg文件用下面命令安装:
easy_install nltk-?.?b4-py?.?.egg
下载的是zip文件,解压后用下面命令安装:
sudo python setup.py install
二、测试
在python命令行终端运行:
>>> from nltk.stem.porter import PorterStemmer
>>> from nltk.tokenize.regexp import WordTokenizer
>>> text = WordTokenizer().tokenize("And now for something . completely different")
>>> for i in text:
... print PorterStemmer().stem_word(i)
...
And
now
for
someth
complet
differ
Resources
http://nltk.googlecode.com/svn/trunk/doc/api/index.html
http://www.ibm.com/developerworks/library/l-cpnltk.html?S_TACT=105AGX52&S_CMP=cn-a-l