python nltk库[nltk_data] Error loading words: <urlopen error [Errno 61] Connection refused>解决

在测试场景中往往会需要生成一段随机的段落,每个段落的单词是实际的英文单词,不是随机的字母,这时就用到了python的random模块和nltk库。

如果在代码中使用如下语句下载资源库时会报错:

nltk.download('words')
nltk.download('brown')
[nltk_data] Error loading words: <urlopen error [Errno 61] Connection
[nltk_data]     refused>
[nltk_data] Error loading brown: <urlopen error [Errno 61] Connection
[nltk_data]     refused>
Traceback (most recent call last):
  File "/Users/testmanzhang/PycharmProjects/practiceUICatalog/.venv/lib/python3.12/site-packages/nltk/corpus/util.py", line 84, in __load
    root = nltk.data.find(f"{self.subdir}/{zip_name}")
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/testmanzhang/PycharmProjects/practiceUICatalog/.venv/lib/python3.12/site-packages/nltk/data.py", line 579, in find
    raise LookupError(resource_not_found)
LookupError: 
**********************************************************************
  Resource words not found.
  Please use the NLTK Downloader to obtain the resource:

  >>> import nltk
  >>> nltk.download('words')
  
  For more information see: https://www.nltk.org/data.html

  Attempted to load corpora/words.zip/words/

  Searched in:
    - '/Users/testmanzhang/nltk_data'
    - '/Users/testmanzhang/PycharmProjects/practiceUICatalog/.venv/nltk_data'
    - '/Users/testmanzhang/PycharmProjects/practiceUICatalog/.venv/share/nltk_data'
    - '/Users/testmanzhang/PycharmProjects/practiceUICatalog/.venv/lib/nltk_data'
    - '/usr/share/nltk_data'
    - '/usr/local/share/nltk_data'
    - '/usr/lib/nltk_data'
    - '/usr/local/lib/nltk_data'
**********************************************************************


During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/Users/testmanzhang/PycharmProjects/practiceUICatalog/unittest_aosu.py", line 17, in <module>
    word_list = words.words()
                ^^^^^^^^^^^
  File "/Users/testmanzhang/PycharmProjects/practiceUICatalog/.venv/lib/python3.12/site-packages/nltk/corpus/util.py", line 120, in __getattr__
    self.__load()
  File "/Users/testmanzhang/PycharmProjects/practiceUICatalog/.venv/lib/python3.12/site-packages/nltk/corpus/util.py", line 86, in __load
    raise e
  File "/Users/testmanzhang/PycharmProjects/practiceUICatalog/.venv/lib/python3.12/site-packages/nltk/corpus/util.py", line 81, in __load
    root = nltk.data.find(f"{self.subdir}/{self.__name}")
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/testmanzhang/PycharmProjects/practiceUICatalog/.venv/lib/python3.12/site-packages/nltk/data.py", line 579, in find
    raise LookupError(resource_not_found)
LookupError: 
**********************************************************************
  Resource words not found.
  Please use the NLTK Downloader to obtain the resource:

  >>> import nltk
  >>> nltk.download('words')
  
  For more information see: https://www.nltk.org/data.html

  Attempted to load corpora/words

  Searched in:
    - '/Users/testmanzhang/nltk_data'
    - '/Users/testmanzhang/PycharmProjects/practiceUICatalog/.venv/nltk_data'
    - '/Users/testmanzhang/PycharmProjects/practiceUICatalog/.venv/share/nltk_data'
    - '/Users/testmanzhang/PycharmProjects/practiceUICatalog/.venv/lib/nltk_data'
    - '/usr/share/nltk_data'
    - '/usr/local/share/nltk_data'
    - '/usr/lib/nltk_data'
    - '/usr/local/lib/nltk_data'
**********************************************************************

使用提示信息中的方法也是不行:

>>> import nltk
>>> nltk.download('words')
[nltk_data] Error loading words: <urlopen error [Errno 61] Connection
[nltk_data]     refused>
False

后来访问了nltk data的地址:https://www.nltk.org/nltk_data/,手动下载了资源:

下载完成后时一个zip文件:words.zip

解压后放到错误提示信息中的目录中,例如,'/Users/testmanzhang/nltk_data'

在我的本地并没有nltk_data这个目录,于是就手动创建了一个在这个目录下还要创建corpora,将解压后的words放到这个目录中:

/Users/testmanzhang/nltk_data/corpora

这时候再调试就OK了~

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值