Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[LinearLayouts] Faster pext algorithm #5621

Merged
merged 4 commits into from
Jan 15, 2025
Merged

[LinearLayouts] Faster pext algorithm #5621

merged 4 commits into from
Jan 15, 2025

Conversation

lezcano
Copy link
Contributor

@lezcano lezcano commented Jan 15, 2025

We also skip the LinearLayout test for HIP as it's currently failing.

Regarding the use of getWarpSize and getNumWarpsPerCTA, which are not correct for LinearLayouts with broadcasting as noted in #5617, we found almost all the uses are in AMD land. Changing these into calling the functions that act on the module is tricky, as the module is not currently accessible at the caller site in most of them. As such, we leave this refactor up to AMD folks.

We also skip the LinearLayout test for HIP as it's currently failing
Comment on lines +2796 to +2797
if is_hip() and isinstance(src_layout, LinearLayout):
pytest.skip("FIXME: LinearLayout not supported on HIP")
Copy link
Contributor Author

@lezcano lezcano Jan 15, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

cc @antiagainst regarding the HIP skip. See also the warpSize / numWarp comment in the OP.

@lezcano lezcano enabled auto-merge (squash) January 15, 2025 17:48
@lezcano lezcano merged commit 9895a1f into main Jan 15, 2025
7 checks passed
@lezcano lezcano deleted the reviews_reduce_linear branch January 15, 2025 17:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants