当我运行nltk的词分割时:
from nltk.tokenize import word_tokenize
text = "God is Great! I won a lottery."
print(word_tokenize(text))
出现了缺少punkt包,于是采用如下代码下载:
import nltk
nltk.download()
报错 [Error:11004] getaddrinfo failed
解决方法:
1.打开查询IP地址的网址:https://www.ipaddress.com/,并输入raw.githubusercontent.com
2.复制下图四个网址
3.打开 C:\Windows\System32\drivers\etc\hosts 将上述网址粘贴到后面