Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
btl/smcuda: Add atomic_wmb() before sm_fifo_write
This change fixes open-mpi#12270 Testing on c7g instance type (arm64) confirms this change elminates hangs and crashes that were previously observed in 1 in 30 runs of IMB alltoall benchmark. Tested with over 300 runs and no failures. The write memory barrier prevents other CPUs from observing the fifo get updated before they observe the updated contents of the header itself. Without the barrier, uninitialized header contents caused the crashes and invalid data. Signed-off-by: Luke Robison <lrbison@amazon.com> (cherry picked from commit 71f378d)
- Loading branch information