-
Notifications
You must be signed in to change notification settings - Fork 894
Topic/pr7201 to v40x #7356
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Topic/pr7201 to v40x #7356
Conversation
Both opal_hwloc_base_get_relative_locality() and _get_locality_string() iterate over hwloc levels to build the proc locality information. Unfortunately, NUMA nodes are not in those normal levels anymore since 2.0. We have to explicitly look a the special NUMA level to get that locality info. I am factorizing the core of the iterations inside dedicated "_by_depth" functions and calling them again for the NUMA level at the end of the loops. Thanks to Hatem Elshazly for reporting the NUMA communicator split failure at https://www.mail-archive.com/users@lists.open-mpi.org/msg33589.html It looks like only the opal_hwloc_base_get_locality_string() part is needed to fix that split, but there's no reason not to fix get_relative_locality() as well. Signed-off-by: Brice Goglin <Brice.Goglin@inria.fr> (cherry picked from commit ea80a20)
not being defined. related to open-mpi#7201 Signed-off-by: Howard Pritchard <howardp@lanl.gov>
I'll give it a build / test tomorrow and then merge this after a review. |
@gpaulsen don't merge yet. this will break backward compatibility with hwloc 1 series. |
I assume the issues you're referring to are resolved by #7366 ? |
@gpaulsen Yes, they should. |
Refs open-mpi#7362 Signed-off-by: Brice Goglin <Brice.Goglin@inria.fr> (cherry picked from commit 329d445)
Okay, I pushed the commit from #7366 to this branch as well. |
oops. Thanks for double checking. I'll cherry-pick again in the next hour. |
Build was broken by mistake in commit d40662edc41a5a4d09ae690b640cfdeeb24e15a1 Fixes open-mpi#7362 Signed-off-by: Brice Goglin <Brice.Goglin@inria.fr> (cherry picked from commit 907ad85)
Ok, please re-review. Note that I also changed f136804#diff-f4ec0cf4c6659e1c4af03b7a807378d8R2262 to asprintf. I think that was correct, but wanted to point it out. |
@gpaulsen looks good and ready to merge |
No description provided.