-
-
Notifications
You must be signed in to change notification settings - Fork 10.8k
[core] MLA performance boost for AMD GPUs and tuned MoE config for MI… #13439
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
👋 Hi! Thank you for contributing to the vLLM project. 💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels. Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging. To run CI, PR reviewers can either: Add 🚀 |
7e481ee to
7a76f70
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM with a small fix needed, cc @LucasWilkinson
70a9795 to
4f2422a
Compare
…300X Signed-off-by: qli88 <qiang.li2@amd.com>
Signed-off-by: qli88 <qiang.li2@amd.com>
Co-authored-by: Hongxia Yang <62075498+hongxiayang@users.noreply.github.com> Signed-off-by: qli88 <qiang.li2@amd.com>
c131305 to
148e877
Compare
|
cc @houseroad |
| BLOCK = 64 | ||
| if is_hip_: | ||
| BLOCK = 8 | ||
|
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| BLOCK = 64 | |
| if is_hip_: | |
| BLOCK = 8 | |
| BLOCK = 64 if not is_hip_ else 8 |
|
why do we close the PR? |
@houseroad I created a new PR to adapt the commit landed yesterday (#12639). Please take a look at that one (#13718) |
Uh oh!
There was an error while loading. Please reload this page.