-
Notifications
You must be signed in to change notification settings - Fork 871
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Issue with overlapping vector datatype #5540
Comments
George has a patch that's coming. |
Discussed on 2018-09-11 webex; @bosilca mentioned that he has a patch but he hasn't tested it properly with all the other datatype tests. Stay tuned. |
Discussed at the RM meeting today, and will bring up again on tomorrow's Web-ex. |
Per discussion on today's Webex, we decided that this issue should be fixed in the v4.0.x series, but probably isn't worth bringing back to the v3.0.x and v3.1.x series (i.e., it's a pretty esoteric issue for highly complex datatypes). |
@bosilca Do you have a PR? If it's partial we might be able to get help working this... |
@bosilca ? |
@bosilca said on the Webex today that he would separate out the fix for this vector issue from a branch where he has other optimizations, and then push a PR with just the fix. |
@bosilca Any update on this? |
#6695 addresses this and much more. |
Added blocker label as this is affecting real app, and the fix PR #6695 can be picked to v4.0.2 |
@bosilca Notes that this does not easily cherry-pick to the v3.1.x or v3.0.x branches. As such, @hppritcha and I agree that this will be a Known Issue on the v3.0.x / v3.1.x series: the solution will be to upgrade to >= v4.0.2. |
Re-opening this issue until it's been cherry-picked to v4.0.x (based on the Target:v4.0.x label, which is to fix issue 5540) |
v4.0.x backport: #6863 |
The fix has been merged to both master and v4.0.x. Closing this Issue. |
As reported by the DKRZ folks, creating a datatype with overlapping capabilities (legal for all send communications) leads to data corruption, as the packing and unpacking function will fail to compute the correct memory layout.
A simple example of such datatype is:
All Open MPI versions are affected. However, it has been decided during the weekly call to only fix it in every release after 3.0 and obviously master.
The text was updated successfully, but these errors were encountered: