大模型LLM与agent

shlhhy

已于 2024-07-11 15:56:36 修改

阅读量355

点赞数 6

文章标签：人工智能

于 2024-03-14 14:47:54 首次发布

本文链接：https://blog.csdn.net/shlhhy/article/details/136710440

版权

本文探讨了利用LLM的强大意图识别能力，构建以LLM为核心的控制中心，调用不同垂直领域的小模型的新型应用趋势。重点介绍了开源框架langchain和魔搭社区在这一领域的实践资源。

摘要由CSDN通过智能技术生成

最近，基于LLM强大的意图识别能力，采用LLM作为控制中心，调用各种垂直领域的小模型，这一研究方向比较热门，即大模型的agent应用。常用的开源框架：langchain。
魔搭社区：https://modelscope.cn/home。

1. 模型下载

在modelscope上下载需要的模型文件，例如https://modelscope.cn/models上搜索某一模型。
如果是联网环境，可以通过脚本中指定ModelType参数，这样命令执行时会自动下载模型

from swift.llm import (
    get_model_tokenizer, get_template, inference, ModelType,
    get_default_template_type, inference_stream
)
model_type = ModelType.qwen1half_7b_chat

本次下载离线文件：chatglm2-6b，下载地址：https://modelscope.cn/models/ZhipuAI/chatglm2-6b/files

2. 环境配置

申请一台带GPU和cuda环境的Ubuntu服务器，安装swift。

pip install 'ms-swift[llm]' -U
pip install transformers==4.30.2

将下载好的llm模型文件放到某一路径以备调用，需要注意某些依赖包的版本要符合模型要求，例如chatglm2-6b的transformers版本如果为4.41.2时模型加载会报错。

3. 运行模型推理

from transformers import AutoTokenizer, AutoModel

path = "/mnt/glm2"
tokenizer = AutoTokenizer.from_pretrained(path, trust_remote_code=True)
model = AutoModel.from_pretrained(path, trust_remote_code=True, device='cuda')
model = model.eval()
while True:
	print("input your question:")
	input_text = input()
	response, history = model.chat(tokenizer, input_text, history=[])
	print(response)