Skip to content

vLLM Ascend Roadmap Q1 2025 #71

@Yikun

Description

@Yikun

This is a living document!

Note that: vLLM Ascend 0.7.3 (match vLLM v0.7.3) is main release for 2025 Q1, see more in link.

Supported models track: #260

Hardware Plugin

Basic support

Initial vLLM Ascend support will start to support with basic hardware compatibility support.

Feature support

Model support

Performance

  • add vllm-ascend perf website like vLLM does https://perf.vllm.ai/
  • focus on llama3, qwen2.5, qwen2-vl, deepseek v3/R1, improve the performance

Quality

  • Full UT coverage
  • Model e2e test
  • Multi card/node e2e test

Docs

  • README
  • vllm-ascend website: https://vllm-ascend.readthedocs.org/
  • Quick start / Installation / Turtorial
  • User guide: supported feature / models
  • Developer guide: Contributing / Versioning policy

CI and Developer Productivity

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions