Replies: 1 comment
-
Hi, Qwen2.5 is supported by Xinference.
|
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Has this been raised before?
Description
今天,用Xinference 在英伟达 4090 Gpu加载QWen2.5 14B,显存用了17G,好慢,不知道是不是Xinference没适配Qwen 2.5。
Beta Was this translation helpful? Give feedback.
All reactions