Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

v4.0.x OSC: Reset external request to NULL #6810

Merged
merged 1 commit into from
Jul 23, 2019
Merged

Conversation

janjust
Copy link
Contributor

@janjust janjust commented Jul 12, 2019

Addresses issue #6552 (Not reproduced in master).
Reset external request to NULL to avoid double request completion.
Co-authored with Artem Polyakov artemp@mellanox.com

Signed-off-by: Tomislav Janjusic tomislavj@mellanox.com

completion
Co-authored with Artem Polyakov <artemp@mellanox.com>
Signed-off-by: Tomislav Janjusic <tomislavj@mellanox.com>
@ibm-ompi
Copy link

The IBM CI (XL Compiler) build failed! Please review the log, linked below.

Gist: https://gist.github.com/e3f7a4c1da95c1a8ec079e09da7de012

@artpol84
Copy link
Contributor

@yosefe
I think 2 things contributed to the issue

  1. Rget/Rput were less tested than their non-request counterparts
  2. There is some confusion regarding whether or not UCX will call the request cleanup function at ucp_request_release. It will not, but I used to think it will myself.

@jjhursey
Copy link
Member

bot:ibm:xl:retest

@hppritcha
Copy link
Member

hppritcha commented Jul 13, 2019

I don't think the reference to #6455 is correct.

@hppritcha hppritcha added this to the v4.0.2 milestone Jul 13, 2019
@hppritcha
Copy link
Member

@janjust which issue is this PR suppose to fix?

@janjust
Copy link
Contributor Author

janjust commented Jul 13, 2019

@hppritcha my mistake, I corrected the reference. It fixes #6552

@hppritcha
Copy link
Member

@yosefe could you review this?

@gpaulsen
Copy link
Member

:bot:retest

@hppritcha
Copy link
Member

ompi:bot:retest

@artpol84
Copy link
Contributor

@jsquyres @hppritcha

conftest.c:343:1: warning: function declaration isn't a prototype [-Wstrict-prototypes]
 char __malloc_initialize_hook ();
 ^~~~
conftest.c:352:1: warning: function declaration isn't a prototype [-Wstrict-prototypes]
 main ()
 ^~~~
/usr/lib64/gcc/x86_64-suse-linux/7/../../../../x86_64-suse-linux/bin/ld: /tmp/ccUlyi2d.o: in function `main':
conftest.c:(.text.startup+0x7): undefined reference to `__malloc_initialize_hook'
collect2: error: ld returned 1 exit status

This doesn't look like an issue on this PRs side.

@hppritcha
Copy link
Member

try again
ompi:bot:retest

@hppritcha
Copy link
Member

this PR is also hitting the flex problem:

configure:37017: WARNING: *** Could not find Flex on your system.
configure:37019: WARNING: *** Flex is required for developer builds of Open MPI.
configure:37021: WARNING: *** Other versions of Lex are not supported.
configure:37023: WARNING: *** YOU DO NOT NEED FLEX WHEN BUILDING DISTRIBUTION TARBALLS!
configure:37025: error: Cannot continue

@hppritcha
Copy link
Member

bot:lanl:retest

@hppritcha hppritcha merged commit 667aba9 into open-mpi:v4.0.x Jul 23, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

8 participants