Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

v4.0.x: osc/rdma: modify attach to check for region overlap #7415

Conversation

hjelmn
Copy link
Member

@hjelmn hjelmn commented Feb 17, 2020

This commit addresses two issues in osc/rdma:

  1. It is erroneous to attach regions that overlap. This was being
    allowed but the standard does not allow overlapping attachments.

  2. Overlapping registration regions (4k alignment of attachments)
    appear to be allowed. Add attachment bases to the bookeeping
    structure so we can keep better track of what can be detached.

It is possible that the standard did not intend to allow 2). If that
is the case then 2) should fail in the same way as 1). There should
be no technical reason to disallow 2) at this time.

References #7384

This commit addresses two issues in osc/rdma:

 1) It is erroneous to attach regions that overlap. This was being
    allowed but the standard does not allow overlapping attachments.

 2) Overlapping registration regions (4k alignment of attachments)
    appear to be allowed. Add attachment bases to the bookeeping
    structure so we can keep better track of what can be detached.

It is possible that the standard did not intend to allow #2. If that
is the case then #2 should fail in the same way as #1. There should
be no technical reason to disallow #2 at this time.

References open-mpi#7384

Signed-off-by: Nathan Hjelm <hjelmn@google.com>
(cherry picked from commit 6649aef)
Signed-off-by: Nathan Hjelm <hjelmn@google.com>
This commit increaes the osc_rdma_max_attach variable from 32
to 64. The new default is kept low due to the small number
of registration resources on some systems (Cray Aries). A
larger max attachement value can be set by the user on other
systems.

Signed-off-by: Nathan Hjelm <hjelmn@google.com>
(cherry picked from commit 54c8233)
Signed-off-by: Nathan Hjelm <hjelmn@google.com>
@hjelmn hjelmn requested a review from bwbarrett February 17, 2020 21:52
@jjhursey
Copy link
Member

bot:ibm:gnu:retest

1 similar comment
@jjhursey
Copy link
Member

bot:ibm:gnu:retest

@open-mpi open-mpi deleted a comment from ibm-ompi Feb 18, 2020
@hppritcha hppritcha added the NEWS label Feb 18, 2020
@hppritcha hppritcha requested review from hppritcha and removed request for bwbarrett February 24, 2020 16:13
@gpaulsen
Copy link
Member

@hppritcha This seems like a lot of changes for this late in the v4.0.3 cycle. Do we need this for v4.0.3?

@hppritcha
Copy link
Member

@gpaulsen I agree.

@awlauria awlauria changed the title osc/rdma: modify attach to check for region overlap v4.0.x: osc/rdma: modify attach to check for region overlap Mar 4, 2020
@gpaulsen
Copy link
Member

gpaulsen commented Mar 5, 2020

@hppritcha I think this is ready to review and merge when you're ready.

@hppritcha hppritcha merged commit 65219eb into open-mpi:v4.0.x Mar 7, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants