Skip to content

Conversation

@zzzzwwjj
Copy link
Collaborator

What this PR does / why we need it?

fix ascend_scheduler for v0.9.0

Does this PR introduce any user-facing change?

How was this patch tested?

@zzzzwwjj zzzzwwjj force-pushed the main branch 2 times, most recently from 0f2bdde to 5270507 Compare May 23, 2025 11:49
@wangxiyuan
Copy link
Collaborator

need add a test to make sure ascend scheduler always works

@wangxiyuan wangxiyuan mentioned this pull request May 27, 2025
@whx-sjtu
Copy link
Collaborator

assert num_new_tokens == 1
looks like this fix still doesn't support speculative decoding like MTP, I will just test my next PR pr943 which tries to support both MTP and disaggregated-prefill.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants