Skip to content

[Bugfix] Ensure multistep lookahead allocation is compatible with cuda graph max capture#8340

Merged
youkaichao merged 4 commits intovllm-project:mainfrom neuralmagic:bug_msSep 10, 2024

Commits

Commits on Sep 10, 2024