千问Qwen7B chat：简单代码使用

最新推荐文章于 2024-07-04 10:19:14 发布

duhaining1976

最新推荐文章于 2024-07-04 10:19:14 发布

阅读量601

点赞数 6

分类专栏： LLM研究及应用系列文章标签： LLM

本文链接：https://blog.csdn.net/duhaining1976/article/details/139538930

版权

LLM研究及应用系列专栏收录该内容

2 篇文章 0 订阅

订阅专栏

我们先用一个简单的例子看一下千问的代码逻辑及效果。

from modelscope import AutoModelForCausalLM, AutoTokenizer
from modelscope import GenerationConfig

# 加装分词
tokenizer = AutoTokenizer.from_pretrained("qwen/Qwen-7B-Chat", revision='v1.0.5', trust_remote_code=True)
# 加载大模型
model = AutoModelForCausalLM.from_pretrained("qwen/Qwen-7B-Chat", revision='v1.0.5', device_map="auto",
                                             trust_remote_code=True, fp16=True).eval()
# 加载配置
model.generation_config = GenerationConfig.from_pretrained("Qwen/Qwen-7B-Chat", revision='v1.0.5',
                                                           trust_remote_code=True)  # 可指定不同的生成长度、top_p等相关超参

response, history = model.chat(tokenizer, "你好", history=None)

# 显示千问的回答

print(response)
response, history = model.chat(tokenizer, "浙江的省会在哪里？", history=history)

# 增加这行查看对话历史

print(history)

print(response)
response, history = model.chat(tokenizer, "它有什么好玩的景点", history=history)
print(history)

print(response)

1. 第一次回答：

你好！很高兴为你提供帮助。

2. 第二次回答（提问：浙江的省会在哪里？）：

浙江的省会是杭州。

3. 第二次对答保留的历史，可以看出包括了第一次对答的历史数据。