[Bug]: RuntimeError: shape mismatch: value tensor of shape [3328, 7168] cannot be broadcast to indexing result of shape [3328] for OpenGVLab/InternVL2-40B #8275
Labels
bug
Something isn't working
Your current environment
The output of `python collect_env.py`
🐛 Describe the bug
While serving the OpenGVLab/InternVL2-40B using Multi-Node Multi-GPU (tensor parallel plus pipeline parallel inference) facing these issue
RuntimeError: shape mismatch: value tensor of shape [3328, 7168] cannot be broadcast to indexing result of shape [3328]
But I don't face these issue while serving the OpenGVLab/InternVL2-8B and OpenGVLab/InternVL2-26B
Command: vllm serve OpenGVLab/InternVL2-40B --tensor-parallel-size 1 --pipeline-parallel-size 4 --dtype bfloat16 --gpu-memory-utilization 0.9 --max-model-len 6000 --enforce-eager --trust-remo
te-code --tokenizer-mode "auto"
Log:
Debug Info that I found:
Debug: input_ids shape: torch.Size([6000])
Debug: inputs_embeds shape: torch.Size([6000, 7168])
Debug: mask shape: torch.Size([6000]), num True values: 3328
Debug: flattened shape: torch.Size([3328, 7168])
Before submitting a new issue...
The text was updated successfully, but these errors were encountered: