【大模型】炸裂！！阿里开源大模型 Qwen/QwQ-32B 性能追平 DeepSeek-R1、o1-mini

最新推荐文章于 2025-04-01 17:43:42 发布

szZack

最新推荐文章于 2025-04-01 17:43:42 发布

阅读量1.2k

点赞数 4

分类专栏：人工智能大语言模型文章标签： QwQ-32B 大模型推理大模型

本文链接：https://blog.csdn.net/zengNLP/article/details/146065582

版权

人工智能同时被 2 个专栏收录

113 篇文章

订阅专栏

大语言模型

43 篇文章

订阅专栏

【大模型】炸裂！！阿里开源大模型 Qwen/QwQ-32B 性能追平 DeepSeek-R1、o1-mini

Qwen/QwQ-32B 模型介绍
- 模型特性
- 发布时间
模型性能
运行环境安装
运行模型
下载
gguf量化后的模型
开源协议
参考

Qwen/QwQ-32B 模型介绍

QwQ是Qwen系列的推理模型。与传统的指令调优模型相比，QwQ具有思考和推理能力，可以在下游任务，特别是难题中实现显著提高的性能。QwQ-32B是中等规模的推理模型，能够与最先进的推理模型（如DeepSeek-R1、o1-mini）实现竞争性能。

模型特性

Type: Causal Language Models
Training Stage: Pretraining & Post-training (Supervised Finetuning and Reinforcement Learning)
Architecture: transformers with RoPE, SwiGLU, RMSNorm, and Attention QKV bias
Number of Parameters: 32.5B
Number of Paramaters (Non-Embedding): 31.0B
Number of Layers: 64
Number of Attention Heads (GQA): 40 for Q and 8 for KV
Context Length: Full 131,072 tokens

发布时间

2025年3月5日

模型性能

在这里插入图片描述

运行环境安装

pip install transformers==4.47 -i https://mirrors.aliyun.com/pypi/simple/

运行模型

from transformers import AutoModelForCausalLM, AutoTokenizer

model_name = "Qwen/QwQ-32B"

model = AutoModelForCausalLM.from_pretrained(
    model_name,
    torch_dtype="auto",
    device_map="auto"
)
tokenizer = AutoTokenizer.from_pretrained(model_name)

prompt = "How many r's are in the word \"strawberry\""
messages = [
    {"role": "user", "content": prompt}
]
text = tokenizer.apply_chat_template(
    messages,
    tokenize=False,
    add_generation_prompt=True
)

model_inputs = tokenizer([text], return_tensors="pt").to(model.device)

generated_ids = model.generate(
    **model_inputs,
    max_new_tokens=32768
)
generated_ids = [
    output_ids[len(input_ids):] for input_ids, output_ids in zip(model_inputs.input_ids, generated_ids)
]

response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
print(response)