Python 手动安装nltk数据包

最新推荐文章于 2024-08-03 20:56:21 发布

vincent_hahaha

最新推荐文章于 2024-08-03 20:56:21 发布

阅读量2.5k

点赞数 4

分类专栏： python

本文链接：https://blog.csdn.net/vincent_duan/article/details/127369657

版权

python 开发语言深度学习

python 专栏收录该内容

52 篇文章 4 订阅

订阅专栏

自然语言处理工具包,是NLP研究领域常用的一个Python库。安装过程较为麻烦，这里记录一下。

首先安装nltk，pip install nltk。
查看数据存放路径：

import nltk
from nltk.book import *

运行程序，将出现报错，同时也可以知道代码会从哪些文件夹中寻找：

Searched in:
    - 'C:\\Users\\1/nltk_data'
    - 'C:\\Users\\1\\anaconda3\\envs\\htgn\\nltk_data'
    - 'C:\\Users\\1\\anaconda3\\envs\\htgn\\share\\nltk_data'
    - 'C:\\Users\\1\\anaconda3\\envs\\htgn\\lib\\nltk_data'
    - 'C:\\Users\\1\\AppData\\Roaming\\nltk_data'
    - 'C:\\nltk_data'
    - 'D:\\nltk_data'
    - 'E:\\nltk_data'

我们从https://github.com/nltk/nltk_data下载nltk数据包，解压后，重命名为nltk_data，然后移动到上面任意的一个文件夹。

也就是将packages重命名为nltk_data
在这里插入图片描述
里面的内容：

然后移动到C盘：

测试一下：

import nltk
from nltk.book import *

输出：

*** Introductory Examples for the NLTK Book ***
Loading text1, ..., text9 and sent1, ..., sent9
Type the name of the text or sentence to view it.
Type: 'texts()' or 'sents()' to list the materials.
text1: Moby Dick by Herman Melville 1851
text2: Sense and Sensibility by Jane Austen 1811
text3: The Book of Genesis
text4: Inaugural Address Corpus
text5: Chat Corpus
text6: Monty Python and the Holy Grail
text7: Wall Street Journal
text8: Personals Corpus
text9: The Man Who Was Thursday by G . K . Chesterton 1908

>>> from nltk.corpus import brown
>>> print(brown.words()[0:10])
['The', 'Fulton', 'County', 'Grand', 'Jury', 'said', 'Friday', 'an', 'investigation', 'of']