Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[AssignTiles][CI] Fix assign-tiles for L3 buffers with multiple tile candidate + add e2e test targeting 4x8 AIE array on Strix #1031

Merged
merged 7 commits into from
Jan 22, 2025

Conversation

Abhishek-Varma
Copy link
Contributor

@Abhishek-Varma Abhishek-Varma commented Jan 15, 2025

This PR logically contains two commits :-

Commit 1
-- Required to enable 4x8 AIE array on Strix.
-- Before this commit iree-amdaie-assign-tiles pass ends up allocating/distributing LHS L3 buffers on (0,0) -> (7,0).
-- Since this leads to consumer DMA channel exhaustion later on, this commit aims to address the same and tries to assign same tile set to L3 buffers based on the L3 LOF count for L3<->L2 Copy.

Commit 2
-- Updates mm_npu4.cc for targeting 4x8 intrinsic for ukernel.
-- Adds two e2e tests for CI : a normal execution and a ukernel based execution.

Signed-off-by: Abhishek Varma abhvarma@amd.com

@Abhishek-Varma Abhishek-Varma force-pushed the avarma_same_tile_assg_multi_col branch from 49d1660 to defa226 Compare January 16, 2025 12:23
@Abhishek-Varma Abhishek-Varma force-pushed the avarma_same_tile_assg_multi_col branch from defa226 to f1c2085 Compare January 17, 2025 10:31
@Abhishek-Varma Abhishek-Varma changed the base branch from main to avarma_use_gcd_for_split_factor January 17, 2025 10:31
Base automatically changed from avarma_use_gcd_for_split_factor to main January 17, 2025 12:28
…iff block

-- Required to enable 4x8 AIE array on Strix.
-- Before this commit `iree-amdaie-assign-tiles` pass will end up
   allocating/distributing LHS L3 buffers on (0,0) -> (7,0).
-- Since this leads to consumer DMA channel exhaustion later on, this commit
   aims to address the same and tries to assign same tile set to L3 buffers on
   different block.
-- As a result, we will get LHS L3 buffers on (0,0) -> (4,0).

Signed-off-by: Abhishek Varma <abhvarma@amd.com>
-- Adds e2e Matmul 1024x1204x1024 i32 test for 4x8 AIE array on Strix.

Signed-off-by: Abhishek Varma <abhvarma@amd.com>
@Abhishek-Varma Abhishek-Varma force-pushed the avarma_same_tile_assg_multi_col branch 2 times, most recently from 9629558 to 6e9d1f8 Compare January 20, 2025 12:28
@Abhishek-Varma Abhishek-Varma force-pushed the avarma_same_tile_assg_multi_col branch from 6e9d1f8 to c86ba72 Compare January 20, 2025 12:30
@Abhishek-Varma Abhishek-Varma marked this pull request as ready for review January 20, 2025 18:01
@Abhishek-Varma Abhishek-Varma requested a review from jtuyls January 22, 2025 09:21
Copy link
Collaborator

@jtuyls jtuyls left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, just one nit.

Thanks for updating to analyzing based on L2 buffers now.

@Abhishek-Varma Abhishek-Varma enabled auto-merge (squash) January 22, 2025 11:33
@Abhishek-Varma Abhishek-Varma merged commit d2f880a into main Jan 22, 2025
7 checks passed
@Abhishek-Varma Abhishek-Varma deleted the avarma_same_tile_assg_multi_col branch January 22, 2025 11:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants