-
Notifications
You must be signed in to change notification settings - Fork 395
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Move open interface call to base fi_ops #29
Comments
Address in commit 759c605 |
sungeunchoi
added a commit
to sungeunchoi/libfabric
that referenced
this issue
Feb 27, 2015
Fix build warnings. Merging to get build stable. Others please review. @bturrubiates @hppritcha
tstruk
added a commit
to tstruk/libfabric
that referenced
this issue
Oct 7, 2024
…wg#29) LEX-3960: Add back-off mechanism on -EAGAIN * LEX-3960: Add back-off mechanism on -EAGAIN The 4-node hang mostly happens after -EAGAIN error is returned from send from one context, e.g. MPI init, and is folowed by another send from a new context, e.g. NCCL test. Add a back-off mechanism on flush_unsent to help the system recover after -EAGAIN error. Signed-off-by: Tadeusz Struk <tstruk@gigaio.com> Approved-by: Benjamin Kitor Approved-by: Eric Pilmore
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Allow an app to open interfaces that are associated with a specific object. Move open_if to struct fi_ops.
The text was updated successfully, but these errors were encountered: