Skip to content

Conversation

@yzh119
Copy link
Collaborator

@yzh119 yzh119 commented Feb 17, 2025

The scheduling algorithm in #863 do not consider some requests have kv-cache length 0, this PR fixes the issue.

@yzh119 yzh119 merged commit 6ec3bae into main Feb 17, 2025
@yzh119 yzh119 deleted the fix-mla-kv-0 branch February 18, 2025 22:51
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants