-
Couldn't load subscription status.
- Fork 521
Closed
Labels
guideguide noteguide note
Description
Overview
In our roadmap, we plan to support guided decoding in 2025 Q1 as shown here (#71).
Feel free to feedback your issues when using guided decoding with vllm-ascend, and we will try to fix them if we can.
Roadmap
Latest Community News
- [Structured Outputs] Refactor bitmask construction into get_grammar_bitmask vllm#23361
- [PERF] Async structured outputs vllm#23224
- [Perf] Parallelize fill_bitmask to accelerate high-throughput guided decoding vllm#21862
- Frontend: Adding LM Format Enforcer support to V1 engine vllm#22564
- [RFC][core][V1] generalize structured output manager and backends vllm#17503
- [V1] Add
structural_tagsupport using xgrammar vllm#17085 - [V0][V1][Core] Add outlines integration for V1, and update V0 integration. vllm#15975
- [V1][Experimental] Jump-forward decoding vllm#15490
- [V1][Feature] Enable Speculative Decoding with Structured Outputs vllm#14702
- Add support for xgrammar backend on aarch64:
- Add support for reasoning model (DeepSeek-R1):
Adaptation for vllm-ascend
- [7/N] Refactor for structured output module:
- [V1][Structured Output] Minor modification to
_validate_structured_output()vllm#16748 - [V1][Structured Output] Move xgrammar related utils to
backend_xgrammar.pyvllm#16578 - [V1][Platform] Remove
supports_structured_output()in platform #531 - [V1][Platform] Add
supports_structured_output()method to Platform #475 - [V1][Structured Output] Add
supports_structured_output()method to Platform vllm#16148 - [Structured Output][Refactor] Move
apply_grammar_bitmask()method fromModelRunnerto structured output utils vllm#21999 - [Structured Output] Replace
apply_grammar_bitmask()method with that in vllm to avoid maintenance #2524
- [V1][Structured Output] Minor modification to
- [3/N] Bugfix for xgrammar backend:
- [2/N] Bugfix for guidance backend:
- [2/N] CI & Test:
- [3/N] Doc:
- [3/N] Others:
Learning Materials
I have written some posts for better understanding this feature:
MengqingCao, wangxiyuan, gaocegege, Yikun and shen-shanshan
Metadata
Metadata
Assignees
Labels
guideguide noteguide note