-
Notifications
You must be signed in to change notification settings - Fork 544
Description
Release Checklist
Release Version: v0.9.1
Release Branch: v0.9.1-dev
Release Date: 2025/08/30
Release Manager: @wangxiyuan
Prepare Release Note
-
Create a new issue for release feedback [v0.9.1] FAQ / Feedback | 问题/反馈 #2643
-
Write the release note PR. [Doc] add v0.9.1 release note #2646 (should backport to 0.9.1-dev branch then)
-
Update the feedback issue link in docs/source/faqs.md
-
Add release note to docs/source/user_guide/release_notes.md
-
Update version info in docs/source/community/versioning_policy.md
-
Update contributor info in docs/source/community/contributors.md
-
Update package version in docs/conf.py
-
PR need Merge
we should make sure all 0.9.1-dev related PR is done. https://github.com/vllm-project/vllm-ascend/pulls?page=1&q=is%3Apr+is%3Aopen+base%3Av0.9.1-dev
Merge/Close/Keep as it is(should leave the reason here)
TO be merge:
- [v0.9.1][Doc]Large Scale Ep doc update #2595
- [0.9.1][doc]Update doc for 0.9.1 #2648
- [0.9.1][Bugfix][Aclgraph] Fix qwen3-moe + aclgraph + tp #2647
- [v0.9.1][Bugfix] Fix guided decoding invalid backend #2645
- [v0.9.1][bugfix] disable the chunked prefill feature in Non-MLA LLMs #2659
- rm ptach run_engine_core in dense case #2665
- [v0.9.1] Add CN docs #2657
Fix in post1 release later:
- [0.9.1][Fixbug] Fix num_hidden_layers on Qwen2-Audio and Qwen2.5-Omni #2656
- [0.9.1][PromptLogprobs][V1] Support prompt logprobs to fix ceval accuracy in V1 #2654
Functional Test
we should make sure 0.9.1 is well tested. The core function is:
- PD disaggreagate with LLMDatadist @Potabk
- aclgraph @MengqingCao
- spec decode @wxsIcey Ngram, MTP works. Eagle doesn't work.
- guided decode @shen-shanshan
- quantization @22dimensions
- vlm @zhangxinyuehfad
- rlhf @leo-pony
- loar @wxsIcey
- ascend scheduler @wxsIcey
Doc Test
- Tutorial is updated.
- User Guide is updated.
- Developer Guide is updated.
- Check V1 env in 0.9.1-dev branch
Prepare Artifacts
- Docker image is ready.
- Wheel package is ready.
Release Step
- Release note PR is merged.
- Post the release on GitHub release page.
- Generate official doc page on https://app.readthedocs.org/dashboard/
- Wait for the wheel package to be available on https://pypi.org/project/vllm-ascend
- Wait for the docker image to be available on https://quay.io/ascend/vllm-ascend
- Upload 310p wheel to Github release page
- Broadcast the release news (By message, blog , etc)
- Close this issue