Skip to content

Conversation

@ProExpertProg
Copy link
Collaborator

@ProExpertProg ProExpertProg commented Oct 17, 2025

This is on top of 2.9 + #24604 as I didn't want any surprises so I just developed with 2.9. We can check 2.8 with CI :P.

TODOs:

  • I think it would be nice to merge Middle and Last patterns - see allreduce fusion in [torch.compile] Enable attention and allreduce fusion without custom ops enabled #24604 (we can wrap the pattern and replacement to just return the first output)
  • I think the sequence parallelism unit test is kinda complicated with fusion and fix_finctionalization happening. rmsnorm+quant fusion stopped working for me, if you can get it working that's good, if not no worries.

Signed-off-by: Huy Do <huydhn@gmail.com>
Signed-off-by: Huy Do <huydhn@gmail.com>
Signed-off-by: Huy Do <huydhn@gmail.com>
Signed-off-by: Huy Do <huydhn@gmail.com>
Signed-off-by: Huy Do <huydhn@gmail.com>
Signed-off-by: Huy Do <huydhn@gmail.com>
Signed-off-by: Huy Do <huydhn@gmail.com>
Signed-off-by: Huy Do <huydhn@gmail.com>
Signed-off-by: Huy Do <huydhn@gmail.com>
Signed-off-by: Huy Do <huydhn@gmail.com>
Signed-off-by: Huy Do <huydhn@gmail.com>
Signed-off-by: Huy Do <huydhn@gmail.com>
Signed-off-by: Huy Do <huydhn@gmail.com>
Signed-off-by: Huy Do <huydhn@gmail.com>
Signed-off-by: Huy Do <huydhn@gmail.com>
Signed-off-by: Huy Do <huydhn@gmail.com>
Signed-off-by: Huy Do <huydhn@gmail.com>
Signed-off-by: Huy Do <huydhn@gmail.com>
Signed-off-by: Huy Do <huydhn@gmail.com>
Signed-off-by: Huy Do <huydhn@gmail.com>
Signed-off-by: Huy Do <huydhn@gmail.com>
Signed-off-by: Huy Do <huydhn@gmail.com>
Signed-off-by: Huy Do <huydhn@gmail.com>
Signed-off-by: Huy Do <huydhn@gmail.com>
Signed-off-by: Huy Do <huydhn@gmail.com>
Signed-off-by: Huy Do <huydhn@gmail.com>
Signed-off-by: Huy Do <huydhn@gmail.com>
Signed-off-by: Huy Do <huydhn@gmail.com>
Signed-off-by: Huy Do <huydhn@gmail.com>
Signed-off-by: Huy Do <huydhn@gmail.com>
angelayi and others added 11 commits October 17, 2025 14:35
…om ops enabled

Signed-off-by: angelayi <yiangela7@gmail.com>

Signed-off-by: ProExpertProg <lgovedic@redhat.com>
Signed-off-by: ProExpertProg <lgovedic@redhat.com>
Signed-off-by: ProExpertProg <lgovedic@redhat.com>
Signed-off-by: ProExpertProg <lgovedic@redhat.com>
Signed-off-by: ProExpertProg <lgovedic@redhat.com>
Signed-off-by: ProExpertProg <lgovedic@redhat.com>
Signed-off-by: ProExpertProg <lgovedic@redhat.com>
Signed-off-by: ProExpertProg <lgovedic@redhat.com>
Signed-off-by: ProExpertProg <lgovedic@redhat.com>
Signed-off-by: ProExpertProg <lgovedic@redhat.com>
Signed-off-by: ProExpertProg <lgovedic@redhat.com>
@mergify
Copy link

mergify bot commented Oct 23, 2025

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @ProExpertProg.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ci/build documentation Improvements or additions to documentation needs-rebase ready ONLY add when PR is ready to merge/full CI is needed rocm Related to AMD ROCm

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants