部署阿里云团队的Qwen2-VL大模型发生报错如下:
模型为GPTQ-int8 计算卡为P40Qwen/Qwen2-VL-7B-Instruct-GPTQ-Int8 · Hugging Facehttps://huggingface.co/Qwen/Qwen2-VL-7B-Instruct-GPTQ-Int8
虚拟环境各py包如下:
(qwen2vl) PS C:\Users\Administrator> pip list
Package Version
----------------------------- ------------
accelerate 0.34.2
aiofiles 23.2.1
aiohappyeyeballs 2.4.0
aiohttp 3.10.5
aiosignal 1.3.1
altair 5.4.1
annotated-types 0.7.0
anyio 4.4.0
async-timeout 4.0.3
attrs 24.2.0
auto_gptq 0.7.1+cu118
autoawq 0.2.6
autoawq_kernels 0.0.7
av 13.0.0
certifi 2024.8.30
charset-normalizer 3.3.2
click 8.1.7
colorama 0.4.6
coloredlogs 15.0.1
contourpy 1.3.0
cycler 0.12.1
datasets 2.21.0
dill 0.3.8
einops 0.8.0
exceptiongroup 1.2.2
fastapi 0.114.1
ffmpy 0.4.0
filelock 3.13.1
fonttools 4.53.1
frozenlist 1.4.1
fsspec 2024.2.0
gekko 1.2.1
gradio 4.29.0
gradio_client 0.16.1
h11 0.14.0
httpcore 1.0.5
httpx 0.27.2
huggingface-hub 0.24.6
humanfriendly 10.0
idna 3.8
importlib_resources 6.4.5
intel-openmp 2021.4.0
Jinja2 3.1.3
jsonschema 4.23.0
jsonschema-specifications 2023.12.1
kiwisolver 1.4.7
markdown-it-py 3.0.0
MarkupSafe 2.1.5
matplotlib 3.9.2
mdurl 0.1.2
mkl 2021.4.0
mpmath 1.3.0
multidict 6.1.0
multiprocess 0.70.16
narwhals 1.6.4
networkx 3.2.1
ninja 1.11.1.1
numpy 1.26.3
optimum 1.22.0
orjson 3.10.7
packaging 24.1
pandas 2.2.2
peft 0.12.0
pillow 10.2.0
pip 24.2
protobuf 5.28.0
psutil 6.0.0
pyarrow 17.0.0
pydantic 2.9.1
pydantic_core 2.23.3
pydub 0.25.1
Pygments 2.18.0
pyparsing 3.1.4
pyreadline3 3.4.3
python-dateutil 2.9.0.post0
python-multipart 0.0.9
pytz 2024.2
PyYAML 6.0.2
qwen-vl-utils 0.0.2
referencing 0.35.1
regex 2024.7.24
requests 2.32.3
rich 13.8.1
rouge 1.0.1
rpds-py 0.20.0
ruff 0.6.4
safetensors 0.4.5
semantic-version 2.10.0
sentencepiece 0.2.0
setuptools 72.1.0
shellingham 1.5.4
six 1.16.0
sniffio 1.3.1
starlette 0.38.5
sympy 1.12
tbb 2021.13.1
tokenizers 0.19.1
tomlkit