Skip to content

v4.0.x: Thread fixes #9311

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 2 commits into from
Aug 27, 2021
Merged

v4.0.x: Thread fixes #9311

merged 2 commits into from
Aug 27, 2021

Conversation

awlauria
Copy link
Contributor

  • ompi/request: Add a read memory barrier to sync the receive buffer after wait completes.

We found an issue where with using multiple threads, it is possible for the data
to not be in the buffer before MPI_Wait() returns. Testing the buffer later after
MPI_Wait() returned would show the data arrives eventually without the rmb().

We have seen this issue on Power9 intermittently using PAMI, but in theory could
happen with any transport.

Signed-off-by: Austen Lauria awlauria@us.ibm.com
(cherry picked from commit 12192f1)

  • Allow mpi_init_thread to override the MPI_THREAD_LEVEL

Signed-off-by: Austen Lauria awlauria@us.ibm.com
(cherry picked from commit babfd93)

…on after wait completes.

We found an issue where with using multiple threads, it is possible for the data
to not be in the buffer before MPI_Wait() returns. Testing the buffer later after
MPI_Wait() returned would show the data arrives eventually without the rmb().

We have seen this issue on Power9 intermittently using PAMI, but in theory could
happen with any transport.

Signed-off-by: Austen Lauria <awlauria@us.ibm.com>
(cherry picked from commit 12192f1)
Signed-off-by: Austen Lauria <awlauria@us.ibm.com>
(cherry picked from commit babfd93)
@awlauria awlauria added this to the v4.0.7 milestone Aug 25, 2021
@awlauria
Copy link
Contributor Author

bot:aws:retest

@gpaulsen gpaulsen merged commit 785d110 into open-mpi:v4.0.x Aug 27, 2021
@jsquyres
Copy link
Member

This PR contains a commit effecting a behavior change to MPI_INIT_THREAD that really should not be under the name "Thread fixes". See #9331.

@awlauria awlauria deleted the v4.0.x_threads branch March 17, 2022 17:28
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants