前言
vLLM在开启多显卡并行模式下,-tp 2 或者 --tensor-parallel-size 2,运行报错提示如下:
The above exception was the direct cause of the following exception:
Traceback (most recent call last): File "/usr/lib/python3.8/runpy.py", line 194, in _run_module_as_main return _run_code(code, main_globals, None, File "/usr/lib/python3.8/runpy.py", line 87, in</