-
-
Couldn't load subscription status.
- Fork 10.9k
Revert "Add batch invariant kernel override for FlashInfer backend [2/n]" #26220
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Code Review
This pull request reverts a previous change that added a batch invariant kernel override for the FlashInfer backend, which reportedly caused test failures. The revert appears to be clean and complete, correctly removing the feature and its associated logic from the FlashInfer backend implementation and the batch invariance layer. The corresponding test cases have also been reverted to their previous state. I have reviewed the changes and found no issues with the revert itself.
|
Let's confirm that the test in question now passes before merging |
…/n]" (vllm-project#26220) Signed-off-by: Tomer Asida <57313761+tomeras91@users.noreply.github.com>
…/n]" (vllm-project#26220) Signed-off-by: Karan Goel <3261985+karan@users.noreply.github.com>
…/n]" (vllm-project#26220) Signed-off-by: xuebwang-amd <xuebwang@amd.com>
…/n]" (vllm-project#26220) Signed-off-by: xuebwang-amd <xuebwang@amd.com>
Reverts #25769 because it failed PyTorch Fullgraph Smoke Test