[fix] disable gluon pa for llama by gbyu-amd · Pull Request #113 · ROCm/ATOM

gbyu-amd · 2026-01-06T05:50:05Z

Motivation

Gluon pa is not fully verified with llama, thus disable this path for llama for now.

Submission Checklist

Look over the contributing guidelines at https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests.

scxiao · 2026-01-07T22:56:04Z

atom/model_ops/attention_mha.py

            else:
                # Qwen only uses gluon pa decode when bs=64
-                return self.paged_attention_triton if ctx.batch_size == 64 else self.paged_attention_asm
+                if ATOM_ENABLE_QK_NORM_ROPE_CACHE_QUANT_FUSION:


should we also make a change for the paged_attention_triton for line 405?

llama and qwen will not trigger the if condition in line 404, check line 132. One problem is that some if conditions here maybe not orthogonal enough for different models.
The accuracy of llama is back to normal with this pr:

But the Qwen3 model failed for the gluon attn here, https://github.com/ROCm/ATOM/actions/runs/20798013041/job/59736412001?pr=56. Do you know why it failed here?

the gluon pa api has been changed in aiter side, but the integration remains unchanged in ATOM. bernard_ps_pa_upstream gave a fix and waiting for merge.

disable gluon pa for llama

ac28377

scxiao reviewed Jan 7, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[fix] disable gluon pa for llama#113

[fix] disable gluon pa for llama#113
gbyu-amd wants to merge 1 commit intomainfrom
guanbao/fix_llama

gbyu-amd commented Jan 6, 2026

Uh oh!

scxiao Jan 7, 2026

Uh oh!

gbyu-amd Jan 8, 2026

Uh oh!

scxiao Jan 8, 2026 •

edited

Loading

Uh oh!

gbyu-amd Jan 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

gbyu-amd commented Jan 6, 2026

Motivation

Submission Checklist

Uh oh!

scxiao Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

gbyu-amd Jan 8, 2026

Choose a reason for hiding this comment

Uh oh!

scxiao Jan 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gbyu-amd Jan 8, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

scxiao Jan 8, 2026 •

edited

Loading