LLM-结合三元组SPO和提示工程来试用Baichuan2-7B-Chat-4bits模型_基于大模型提示工程的三元组方法-CSDN博客

本文链接：https://blog.csdn.net/alexgaoyihang/article/details/138784204

概述

《LLM-结合三元组SPO和提示工程来试用Baichuan2-7B-Chat-4bits模型》近期对LLM进行了一些应用场景的思考，其中很简单的一个场景是客服，假设目前所有的知识信息都在一个Excel文档中，首先将其转换为三元组关系，然后结合提示工程技术向LLM进行提问，期望得到反馈。

效果

最左侧是一个Excel表格，包含商品信息，中间的文字部分是将Excel中的数据转换为三元组SPO信息，并且添加上如图所示的提示工程，右侧是模型返回的结果，可以看到能够按照要求返回数据。

调用

在安装Baichuan2-7B-Chat-4bits后，使用如下代码进行调用，得到返回结果。

import torch
from transformers import AutoModelForCausalLM, AutoTokenizer
from transformers.generation.utils import GenerationConfig
import os

# 获取当前文件所在的目录路径
current_dir = os.path.dirname(os.path.abspath(__file__))
# 将当前目录和'model'连接起来，获得'model'文件夹的完整路径
model_path = os.path.join(current_dir, 'model/Baichuan2-7B-Chat-4bits')
print('model_path=', model_path)

if __name__ == '__main__':
tokenizer = AutoTokenizer.from_pretrained(model_path, use_fast=False, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(model_path, device_map="auto", torch_dtype=torch.bfloat16, trust_remote_code=True)
model.generation_config = GenerationConfig.from_pretrained(model_path)
messages = []
messages.append({"role": "user", "content": "解释一下“温故而知新”"})
response = model.chat(tokenizer, messages)
print(response)

部署

git clone https://github.com/baichuan-inc/Baichuan2.git

cd Baichuan2

pip install -r requirements.txt

## 处理安装包的兼容问题，Baichuan2-7B-Chat-4bits
pip install bitsandbytes==0.41.1
pip install accelerate-0.25.0

## 同样的可以将如上的python脚本放到 ${Baichuan2} 文件夹下。
## 匹配如上的python脚本，需要下载模型文件到 ${Baichuan2}/model/Baichuan2-7B-Chat-4bits 文件夹下。