Skip to content

Fix INT4 QR TP8 boundary condition#1834

Merged
valarLip merged 1 commit intomainfrom
alizaidy/qr-fix
Jan 14, 2026
Merged

Fix INT4 QR TP8 boundary condition#1834
valarLip merged 1 commit intomainfrom
alizaidy/qr-fix

Conversation

@azaidy
Copy link
Contributor

@azaidy azaidy commented Jan 13, 2026

Motivation

#1673 added a broad condition to use QR when input size > 4MB. This PR attempts to narrow the scope down to fp16/TP8/INT4 QR. The performance of QR is better than AR only above 8 MB. At 4MB QR achieves 32 us as opposed to 27 us for AR.

Technical Details

Test Plan

Test Result

Submission Checklist

@azaidy azaidy requested review from a team, amirumoAMD and vgokhale January 13, 2026 22:36
Copy link
Collaborator

@valarLip valarLip left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@valarLip valarLip merged commit 1ec04f7 into main Jan 14, 2026
17 checks passed
@valarLip valarLip deleted the alizaidy/qr-fix branch January 14, 2026 02:12
zhuyuhua-v pushed a commit that referenced this pull request Jan 14, 2026
yzhou103 pushed a commit that referenced this pull request Jan 28, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants