[Fix] fix the mha fwd_v3 segment fault in torch.compile(mode="reduce-overhead", fullgraph=True) by minmengdie · Pull Request #1794 · ROCm/aiter

minmengdie · 2026-01-08T08:20:08Z

Motivation

fix the mha fwd_v3 segment fault in torch.compile(mode="reduce-overhead", fullgraph=True)

Technical Details

changed the reference capture to the value capture to prevent the args from being destroyed early.
Explicit assignment forces evaluation order and prevents compiler from reordering operations that could lead to accessing uninitialized args.

Test Plan

cd /root/rtfm-amd
source ./.venv/bin/activate
python -m wlt.models.rtfm.demo.server.server --model-config rtfm_10_06_512_chonky_meanflow --port 8083

Test Result

Submission Checklist

Look over the contributing guidelines at https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests.

Copilot

Pull request overview

This PR adds thread safety improvements via a mutex and temporary debug logging to the MHA (Multi-Head Attention) forward pass implementation, while also adjusting test configurations.

Key Changes:

Added mutex protection for thread-safe access to the kernel implementation pointer map in C++ code
Added extensive debug logging statements for troubleshooting purposes
Modified test parameters (GQA head count and test configuration dimensions)

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 10 comments.

File	Description
csrc/cpp_itfs/mha_fwd.cpp	Added mutex for thread-safe kernel pointer map access and debug logging statements throughout
op_tests/test_mha.py	Commented out parameter, reduced test dimensions, changed GQA head count, and disabled seq_padding test

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

csrc/cpp_itfs/mha_fwd.cpp

op_tests/test_mha.py

csrc/cpp_itfs/mha_fwd.cpp

op_tests/test_mha.py

csrc/cpp_itfs/mha_fwd.cpp

…iler from reordering operations

…overhead", fullgraph=True) (#1794) * add log and mutex for test * add thread_local * value capture args * fix the Explicit assignment forces evaluation order and prevents compiler from reordering operations * delete some logs

add log and mutex for test

3aca41f

minmengdie requested review from a team and Copilot January 8, 2026 08:20

Copilot started reviewing on behalf of minmengdie January 8, 2026 08:21 View session

Copilot AI reviewed Jan 8, 2026

View reviewed changes

minmengdie added 4 commits January 9, 2026 08:32

add thread_local

8b30964

value capture args

3a9fce3

fix the Explicit assignment forces evaluation order and prevents comp…

2ca218c

…iler from reordering operations

delete some logs

a52fef0

minmengdie changed the title ~~add log and mutex for test~~ [Fix] fix the mha fwd_v3 segment fault in torch.compile(mode="reduce-overhead", fullgraph=True) Jan 12, 2026

valarLip approved these changes Jan 13, 2026

View reviewed changes

valarLip merged commit 2985cb6 into main Jan 13, 2026
17 checks passed

valarLip deleted the mmd/fix/torchcompile branch January 13, 2026 04:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Fix] fix the mha fwd_v3 segment fault in torch.compile(mode="reduce-overhead", fullgraph=True)#1794

[Fix] fix the mha fwd_v3 segment fault in torch.compile(mode="reduce-overhead", fullgraph=True)#1794
valarLip merged 5 commits intomainfrom
mmd/fix/torchcompile

minmengdie commented Jan 8, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

minmengdie commented Jan 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Technical Details

Test Plan

Test Result

Submission Checklist

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

minmengdie commented Jan 8, 2026 •

edited

Loading