Your current environment
VLLM version: 0.7.3
Model: nvidia/Llama-3_3-Nemotron-Super-49B-v1
🐛 Describe the bug
I am trying to run the new Nvidia model Nemotron 49B-v1 using the VLLM 0.7.3 version but I got this error
DeciLMConfig object has no attribute ‘num_key_value_heads_per_layer’
I have two questions: I know there are PR such as
#15068
#15008
I am wondering about if the error would be resolved after adding the support for the model. Or the error is unrelated and something is wrong in my end?