[Question]: From which version at least, the vllm supports to infrence and serve Qwen2.5-14B-Instruct model? #962

zengqingfu1442 · 2024-09-25T10:37:44Z

zengqingfu1442
Sep 25, 2024

Which version of vllm support to inference and serve Qwen2.5-14B-Instruct model？

the latest version is recommended. tested with v0.6.1. qwen2 arch (Qwen1.5, Qwen2, Qwen2.5) is supported by vllm since v0.3.0.

jklj077 · 2024-09-26T03:07:13Z

the latest version is recommended. tested with v0.6.1. qwen2 arch (Qwen1.5, Qwen2, Qwen2.5) is supported by vllm since v0.3.0.

0 replies

thiner · 2024-09-27T09:49:45Z

If you want to use the tools calling feature of Qwen2.5, you need to use version 0.6.0 or above.

0 replies