-
Notifications
You must be signed in to change notification settings - Fork 894
Seg faults in BTL atomics #1209
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Forgot to mention : only the last rank crashes (rank 3 with np=4 or rank 5 with np=6). |
Looking at it now. If I can't reproduce on one of my IB machines I will try on psg. |
I see the problem. Should have the fix ready later today. |
hjelmn
added a commit
to hjelmn/ompi-release
that referenced
this issue
Dec 15, 2015
A previous commit updated the one-sided code to register the state region only once. This created an issue when using the scratch lock with fetching atomics. In this case on any rank that isn't local rank 0 the module->state_handle is NULL. This commit fixes the issue by removing the scratch lock and using a fragment pointer instead. Fixes open-mpi/ompi#1209 (cherry picked from open-mpi/ompi@0de9445) Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
hjelmn
referenced
this issue
Dec 15, 2015
A previous commit updated the one-sided code to register the state region only once. This created an issue when using the scratch lock with fetching atomics. In this case on any rank that isn't local rank 0 the module->state_handle is NULL. This commit fixes the issue by removing the scratch lock and using a fragment pointer instead. Fixes #1290 Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
👍 |
@sjeaugey can this issue be closed? |
Yes. I thought it was already but only 1241 was. Sorry about that. |
jsquyres
pushed a commit
to jsquyres/ompi
that referenced
this issue
Sep 19, 2016
While we weren't really hanging, it was taking a very, very long time…
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Many tests seem to segfault with new atomics on IB. I can reproduce it with 4 ranks (2x2) but not with only 2.
Here is the backtrace from the c_accumulate test :
The text was updated successfully, but these errors were encountered: