Skip to content

[New Model]: Qwen3 support #642

@Yikun

Description

@Yikun

The model to consider.

  • Qwen/Qwen3-8B
  • Qwen/Qwen3-MoE-15B-A2B

The closest model vllm already supports.

No response

What's your difficulty of supporting the model you want?

First priority:

Second priority:

  • Fix MOE error: [Model] Support common fused moe ops for moe model #709
  • Add CI for Qwen/Qwen3-MoE-15B-A2B
  • Download models and run test (functional / accuray / perf) for Qwen/Qwen3-MoE-15B-A2B
  • Announcement on wechat post on Open Source Now: 使用 vLLM Ascend 部署 Qwen/Qwen3-MoE

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions