[Bug]: InternVL2-26B tensor_parallel_size=4, AssertionError: 25 is not divisible by 4 #8097
Closed
1 task done
Labels
bug
Something isn't working
Your current environment
Python3.8
8*A10 GPU
Model:InternVL2-26B
vllm branch:main
torch 2.4.0
torchvision 0.19.0
🐛 Describe the bug
ref:
#8055 (comment)
#7996
This issue solves some of the inference problems of Intern VL2, but there are still problems in multi-card parallel situations.
my code
Before submitting a new issue...
The text was updated successfully, but these errors were encountered: