在Windows上用Llama Factory微调Llama 3的基本操作

最新推荐文章于 2025-01-07 16:26:17 发布

蛐蛐蛐

最新推荐文章于 2025-01-07 16:26:17 发布

阅读量3.6k

点赞数 20

分类专栏：深度学习 Python技巧科研工具文章标签： llama

本文链接：https://blog.csdn.net/qysh123/article/details/139529041

版权

科研工具同时被 3 个专栏收录

135 篇文章 13 订阅

订阅专栏

Python技巧

97 篇文章 2 订阅

订阅专栏

深度学习

61 篇文章 4 订阅

订阅专栏

这篇博客参考了一些文章，例如：教程：利用LLaMA_Factory微调llama3:8b大模型_llama3模型微调保存-CSDN博客

也可以参考Llama Factory的Readme：GitHub - hiyouga/LLaMA-Factory: Unify Efficient Fine-Tuning of 100+ LLMsUnify Efficient Fine-Tuning of 100+ LLMs. Contribute to hiyouga/LLaMA-Factory development by creating an account on GitHub.https://github.com/hiyouga/LLaMA-Factory?tab=readme-ov-file#installation首先将Llama Factory clone到本地：GitHub - hiyouga/LLaMA-Factory: Unify Efficient Fine-Tuning of 100+ LLMs

其次创建一个conda环境：

conda create -n llama_factory python=3.10

激活环境后首先安装pytorch，具体参考这个页面：Start Locally | PyTorch，例如：

conda install pytorch torchvision torchaudio pytorch-cuda=12.1 -c pytorch -c nvidia

而后进入到LLaMA-Factory文件夹，参考其Readme，运行：

pip install -e .[torch,metrics]

同时，按照其Readme，在Windows系统上还需要运行：

pip install https://github.com/jllllll/bitsandbytes-windows-webui/releases/download/wheels/bitsandbytes-0.41.2.post2-py3-none-win_amd64.whl

具体原因我就不展开讲了。然后依次运行：

Set CUDA_VISIBLE_DEVICES=0
Set GRADIO_SHARE=1
llamafactory-cli webui

就可以看到其webui了。不过这时候还没有模型参数文件，对于国内用户而言，可以在这里https://modelscope.cn/organization/LLM-Researchhttps://modelscope.cn/organization/LLM-Research

进行下载，例如可以下载Llama3中文版本（如果没有git lfs可以用前两个命令安装）：

conda install git-lfs
git-lfs install
git lfs clone https://www.modelscope.cn/LLM-Research/Llama3-8B-Chinese-Chat.git

下载好之后，可以构造自己的微调数据集，具体而言，按照这里的介绍：

https://github.com/hiyouga/LLaMA-Factory/tree/main/data

Llama Factory支持alpaca and sharegpt的格式，前者类似于这种格式：

[
  {
    "instruction": "human instruction (required)",
    "input": "human input (optional)",
    "output": "model response (required)",
    "system": "system prompt (optional)",
    "history": [
      ["human instruction in the first round (optional)", "model response in the first round (optional)"],
      ["human instruction in the second round (optional)", "model response in the second round (optional)"]
    ]
  }
]

我们构造数据集的时候，最简单的方法就是只构造instruction和output。把生成的json文件放到LLaMA-Factory\data目录下，然后打开dataset_info.json文件，增加这个文件名记录即可，例如我这里增加：