#!/usr/bin/env python
"""
nltk的源码
./versions/anaconda3-5.0.1/lib/python3.6/site-packages/nltk
book.py比较简单,90行代码,从各部分导入文本
text1的type是 nltk.text.Text。类Text定义在text.py里。
moby = Text(nltk.corpus.gutenberg.words('melville-moby_dick.txt'))
melville-moby_dick.txt是一个ascii的可阅读文本。
词意消歧,一个词有多个含义,根据上下文确认合适的含义。比如,by有多个含义:
a. The lost children were found by the searchers (施事)
b. The lost children were found by the mountain (位置格)
c. The lost children were found by the afternoon (时间)
指代消解anaphora resolution。 代词有多个含义:
a. The thieves stole the paintings. They were subsequently sold .
b. The thieves stole the paintings. They were