[Kernel] Fix conflicting macro names for gguf kernels #15456

SzymonOzog · 2025-03-25T09:59:42Z

Just realized that the names for MoE macros are conflicting with MMQ ones at

vllm/csrc/quantization/gguf/mmq.cuh

Line 112 in 4f044b1

#define MMQ_Y_Q4_0 32

Not an issue at the moment because they are the same but might cause bugs in the future

Signed-off-by: SzymonOzog <szymon.ozog@gmail.com>

github-actions · 2025-03-25T09:59:51Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

Isotr0py

LGTM!

I think we should separate the MoE kernel out from gguf_kernel.cu to a gguf_moe_kernel.cu and tight it to the moe extension. So that it would be much easier to maintain the GGUF kernel. (In fact, I'm planning an update for GGUF kernel to catchup the MMA support in llama.cpp, but haven't had enough bandwidth to do it yet)

Anyway, we can leave that to be done in a following PR.

SzymonOzog · 2025-03-25T11:16:42Z

Great! I've also started reading the MMA kernels from llama.cpp and will try to adapt MoE

…5456) Signed-off-by: SzymonOzog <szymon.ozog@gmail.com>

…5456) Signed-off-by: SzymonOzog <szymon.ozog@gmail.com> Signed-off-by: Wes Medford <wryanmedford@gmail.com>

…5456) Signed-off-by: SzymonOzog <szymon.ozog@gmail.com> Signed-off-by: Louis Ulmer <ulmerlouis@gmail.com>

…5456) Signed-off-by: SzymonOzog <szymon.ozog@gmail.com>

…5456) Signed-off-by: SzymonOzog <szymon.ozog@gmail.com> Signed-off-by: Mu Huai <tianbowen.tbw@antgroup.com>

Fix conflicting macro names for gguf kernels

ae43e58

Signed-off-by: SzymonOzog <szymon.ozog@gmail.com>

Isotr0py approved these changes Mar 25, 2025

View reviewed changes

Isotr0py enabled auto-merge (squash) March 25, 2025 10:28

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Mar 25, 2025

Isotr0py merged commit a608160 into vllm-project:main Mar 25, 2025
69 checks passed

erictang000 pushed a commit to erictang000/vllm that referenced this pull request Mar 25, 2025

[Kernel] Fix conflicting macro names for gguf kernels (vllm-project#1…

3dc68bf

…5456) Signed-off-by: SzymonOzog <szymon.ozog@gmail.com>

wrmedford pushed a commit to wrmedford/vllm that referenced this pull request Mar 26, 2025

[Kernel] Fix conflicting macro names for gguf kernels (vllm-project#1…

c4189e7

…5456) Signed-off-by: SzymonOzog <szymon.ozog@gmail.com> Signed-off-by: Wes Medford <wryanmedford@gmail.com>

lulmer pushed a commit to lulmer/vllm that referenced this pull request Apr 7, 2025

[Kernel] Fix conflicting macro names for gguf kernels (vllm-project#1…

b5f8e48

…5456) Signed-off-by: SzymonOzog <szymon.ozog@gmail.com> Signed-off-by: Louis Ulmer <ulmerlouis@gmail.com>

ckhordiasma mentioned this pull request Apr 17, 2025

[do not merge] pr test for nm changes into 2.20 red-hat-data-services/vllm#107

Closed

lk-chen pushed a commit to lk-chen/vllm that referenced this pull request Apr 29, 2025

[Kernel] Fix conflicting macro names for gguf kernels (vllm-project#1…

5523455

…5456) Signed-off-by: SzymonOzog <szymon.ozog@gmail.com>

shreyankg pushed a commit to shreyankg/vllm that referenced this pull request May 3, 2025

[Kernel] Fix conflicting macro names for gguf kernels (vllm-project#1…

9f68530

…5456) Signed-off-by: SzymonOzog <szymon.ozog@gmail.com>

RichardoMrMu pushed a commit to RichardoMrMu/vllm that referenced this pull request May 12, 2025

[Kernel] Fix conflicting macro names for gguf kernels (vllm-project#1…

cf8be0d

…5456) Signed-off-by: SzymonOzog <szymon.ozog@gmail.com> Signed-off-by: Mu Huai <tianbowen.tbw@antgroup.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Kernel] Fix conflicting macro names for gguf kernels #15456

[Kernel] Fix conflicting macro names for gguf kernels #15456

Uh oh!

SzymonOzog commented Mar 25, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Mar 25, 2025

Uh oh!

Isotr0py left a comment •

edited

Loading

Uh oh!

SzymonOzog commented Mar 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

[Kernel] Fix conflicting macro names for gguf kernels #15456

[Kernel] Fix conflicting macro names for gguf kernels #15456

Uh oh!

Conversation

SzymonOzog commented Mar 25, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Mar 25, 2025

Uh oh!

Isotr0py left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

SzymonOzog commented Mar 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

SzymonOzog commented Mar 25, 2025 •

edited by github-actions bot

Loading

Isotr0py left a comment •

edited

Loading