OSError: We couldn‘t connect to ‘https://huggingface.co‘ to load this file, couldn‘t find it(亲测有效)

l8947943

已于 2024-11-27 14:41:52 修改

阅读量6.8k

点赞数 70

分类专栏： python问题文章标签： huggingface os 人工智能

于 2024-10-20 21:52:42 首次发布

本文链接：https://blog.csdn.net/l8947943/article/details/143099409

版权

python问题专栏收录该内容

52 篇文章

订阅专栏

实验背景，尝试下载离线模型，代码报错提示：

OSError: We couldn't connect to 'https://huggingface.co' to load this file, couldn't find it in the cached files and it looks like IDEA-CCNL/Erlangshen-SimCSE-110M-Chinese is not the path to a directory containing a file named config.json.
Checkout your internet connection or see how to run the library in offline mode at 'https://huggingface.co/docs/transformers/installation#offline-mode'.

意思是无法访问这个网址，主要是代码会从huggingface上下载模型，但是国内又存在墙的问题，因此，我们有两种解决方式。

1. 科学上网，访问该网址

通过全局代理的方式，实现模型的下载。

2. 使用镜像网址

国内huggingface镜像地址：https://hf-mirror.com/
往下翻，直接可看到使用教程。主要有四种解决方式。最直接的方式就是一个个下载使用。

3. 在代码中增加设置

import os
os.environ['HF_ENDPOINT'] = 'https://hf-mirror.com'

截取部分代码如下所示：

import os
os.environ['HF_ENDPOINT'] = 'https://hf-mirror.com'

from transformers import AutoModelForCausalLM, AutoTokenizer

# access_token="Your_huggingface_access_tokens" # 有些需要token
model_name = "Qwen/Qwen2.5-1.5B-Instruct"
model = AutoModelForCausalLM.from_pretrained(model_name)
tokenizer = AutoTokenizer.from_pretrained(model_name)

方法三是最简单的方式，也是最直接的方式，但是记得添加代码内容放在最前面，否则无效。需要什么模型，替换名字即可实现。（ps：看到评论区有些朋友还是不能解决，建议你用jupyternotebook尝试运行一下，先运行前两行代码，再执行后续代码）

4. 尝试huggingface mirror配置方式

wget https://hf-mirror.com/hfd/hfd.sh
chmod a+x hfd.sh

接着在自己目录下找到 ~/.bashrc，在文件末尾插入一行：
然后关闭 ~/.bashrc 文件，在命令行中运行如下命令使得上述配置生效：

export HF_ENDPOINT=https://hf-mirror.com

然后关闭 ~/.bashrc 文件，在命令行中运行如下命令使得上述配置生效：

source ~/.bashrc

接着，重启一下jupyter或者编辑器，重新运行代码即可生效。

5. 尝试本地化运行

去huggingface官网直接进行下载或者上镜像网站，下载对应所需的文件。至于需要哪些文件，参考这篇博客传送门：
其次修改代码

# 模型放在./models下
model_name = "./models/Qwen/Qwen2.5-1.5B-Instruct"
model = AutoModelForCausalLM.from_pretrained(model_name)
tokenizer = AutoTokenizer.from_pretrained(model_name)