vllm_server.sh脚本问题 #1

QingChengLineOne · 2024-05-28T06:52:39Z

vllm_server.sh脚本配置：
model_path="/public/zzy/model/Mistral-7B-Instruct-v0.2"
model_name="Mistral-7B-Instruct-v0.2"
tensor_parallel_size=2

cd $model_path
cd ..
python -m vllm.entrypoints.openai.api_server --model $model_name --dtype=half --tensor-parallel-size $tensor_parallel_size

报错：
Traceback (most recent call last):
File "/root/anaconda/envs/moss/lib/python3.8/runpy.py", line 194, in _run_module_as_main
return _run_code(code, main_globals, None,
File "/root/anaconda/envs/moss/lib/python3.8/runpy.py", line 87, in _run_code
exec(code, run_globals)
File "/root/anaconda/envs/moss/lib/python3.8/site-packages/vllm/entrypoints/openai/api_server.py", line 37, in
_running_tasks: Set[asyncio.Task[Any]] = set()
TypeError: 'type' object is not subscriptable

wlyh514 · 2024-05-28T16:25:43Z

Seems like a known vllm bug introduced in v0.4.2.
Issue
The PR that introduced this bug
Diff v0.4.1...v0.4.2
The fix is not published yet, you could try rolling back to v0.4.1.

QingChengLineOne · 2024-05-29T01:37:24Z

thank you for your answer. I try to use python==3.10 to slove this problem.

QingChengLineOne closed this as completed May 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vllm_server.sh脚本问题 #1

vllm_server.sh脚本问题 #1

QingChengLineOne commented May 28, 2024

wlyh514 commented May 28, 2024

QingChengLineOne commented May 29, 2024

vllm_server.sh脚本问题 #1

vllm_server.sh脚本问题 #1

Comments

QingChengLineOne commented May 28, 2024

wlyh514 commented May 28, 2024

QingChengLineOne commented May 29, 2024