init/finalize: extensions #1007

jsquyres · 2015-10-11T12:37:37Z

Proposed extensions for Open MPI:

If MPI_INITLIZED is invoked and MPI is only partially initialized, wait until MPI is fully initialized before returning.
If MPI_FINALIZED is invoked and MPI is only partially finalized, wait until MPI is fully finalized before returning.
If the ompi_mpix_allow_multi_init MCA param is true, allow MPI_INIT and MPI_INIT_THREAD to be invoked multiple times without error (MPI will be safely initialized only the first time it is invoked).

@hppritcha @bosilca This is a bit more than we talked about on the phone last week. I ended up using a mutex instead of atomics because I have to check multiple values, so it made more sense to put all the checks within a mutex lock.

bosilca · 2015-10-11T16:56:54Z

ompi/mpi/c/finalized.c

+        opal_mutex_unlock(&ompi_mpi_bootstrap_mutex);
+        usleep(1);
+        opal_mutex_lock(&ompi_mpi_bootstrap_mutex);
+    }



The result of this loop is basically a continuous loop doing a "lock + unlock"every 1us, in all cases. As you are using a mutex, why don't you rely on the trigger of the mutex by the thread that succeeded to ensure the sequentiality of this code?

I specifically didn't hold the lock for the duration of ompi_mpi_init() / ompi_mpi_finalize(). Hence, I can't have the thread here just block waiting for the lock (e.g., which would naturally wait until the thread in ompi_mpi_init() completed the function).

Instead, I'm only using the lock to access the variables. Hence, I have to loop around checking it, unlocking, delaying, and locking again.

Are you advocating that I should hold the lock through ompi_mpi_init() / ompi_mpi_finalize()? I could probably do that (and then get rid of this loop, and the similar one in INITIALIZED)

jsquyres · 2015-10-13T13:33:46Z

@bosilca Code changed to hold the lock for the bulk of the duration of ompi_mpi_init() and ompi_mpi_finalize(), thereby removing the need to loop checking the values of variables.

bosilca · 2015-10-13T16:21:27Z

Thanks Jeff. Also, adding the blurb of text to the man pages is a great idea. The code looks good 👍

jsquyres · 2015-10-13T16:32:52Z

@hppritcha Does this look good to you?

jsquyres · 2015-10-14T20:53:00Z

The more I think about this, the less I think it makes sense to allow MPI_INIT[_THREAD] to be invoked multiple times safely, but only allow MPI_FINALIZE to be invoked once.

Should we give up on the idea of invoking MPI_INIT[_THREAD] multiple times, and just have this PR essentially be the bootstrap_mutex stuff?

jsquyres · 2015-10-14T21:00:56Z

ompi/runtime/ompi_mpi_init.c

-bool ompi_mpi_finalized = false;
-bool ompi_rte_initialized = false;
-int32_t ompi_mpi_finalize_started = false;
+opal_mutex_t ompi_mpi_bootstrap_mutex;


@hjelmn @bosilca What is the Right way to initialize this mutex? I notice that it's a class -- not a plain pthread mutex. Should I use a simple pthread mutex instead, and then statically initialize it with PTHREAD_MUTEX_INITIALIZER?

Looks like the Right way is #1026.

hppritcha · 2015-10-14T23:12:49Z

Looks excellent.

lanl-ompi · 2015-10-15T02:22:45Z

Test FAILed.

lanl-ompi · 2015-10-15T02:23:46Z

Test FAILed.

lanl-ompi · 2015-10-15T02:27:10Z

Test FAILed.

lanl-ompi · 2015-10-15T02:27:12Z

Test FAILed.

lanl-ompi · 2015-10-15T02:33:38Z

Test FAILed.

lanl-ompi · 2015-10-15T02:38:59Z

Test FAILed.

Proposed extensions for Open MPI: - If MPI_INITLIZED is invoked and MPI is only partially initialized, wait until MPI is fully initialized before returning. - If MPI_FINALIZED is invoked and MPI is only partially finalized, wait until MPI is fully finalized before returning. - If the ompi_mpix_allow_multi_init MCA param is true, allow MPI_INIT and MPI_INIT_THREAD to be invoked multiple times without error (MPI will be safely initialized only the first time it is invoked).

Update language surrounding initialization and finalization in MPI_Init[_thread], MPI_Initialized, MPI_Finalize, and MPI_Finalized.

jsquyres · 2015-10-15T16:45:35Z

I removed the MPIX MCA param and the ability to call MPI_INIT[_THREAD] multiple times. It's a can of worms. If someone else wants to tackle that issue, feel free to do so.

This PR is now just:

If MPI_INITLIZED is invoked and MPI is only partially initialized, wait until MPI is fully initialized before returning.
If MPI_FINALIZED is invoked and MPI is only partially finalized, wait until MPI is fully finalized before returning.

init/finalize: extensions

…ed-perl-construct v2.0.0: autogen: fix deprecated construct

jsquyres added the RFC label Oct 11, 2015

bosilca reviewed Oct 11, 2015
View reviewed changes

This was referenced Oct 12, 2015

v2.x - Add in-finalize indicator, fca fall back to prev barrier if in-finalize open-mpi/ompi-release#636

Merged

use opal atomics to modify ompi initialize/finalize related variables #1006

Closed

jsquyres added the MPI-3.1 label Oct 12, 2015

jsquyres force-pushed the rfc/init-finalize-updates branch from de8f493 to 58d6f86 Compare October 13, 2015 13:32

jsquyres force-pushed the rfc/init-finalize-updates branch from 58d6f86 to e0f49fa Compare October 13, 2015 14:04

jsquyres force-pushed the rfc/init-finalize-updates branch from e0f49fa to b2a5f31 Compare October 13, 2015 16:37

bosilca mentioned this pull request Oct 14, 2015

bump mpi version to 3.1 #1024

Merged

jsquyres reviewed Oct 14, 2015
View reviewed changes

jsquyres force-pushed the rfc/init-finalize-updates branch from 0b6b809 to d91b717 Compare October 15, 2015 02:12

jsquyres force-pushed the rfc/init-finalize-updates branch from d91b717 to 76ad8f2 Compare October 15, 2015 02:23

jsquyres force-pushed the rfc/init-finalize-updates branch from 76ad8f2 to b262382 Compare October 15, 2015 02:25

jsquyres added 3 commits October 15, 2015 12:39

man: update man pages for Init*/Finalize*

338257a

Update language surrounding initialization and finalization in MPI_Init[_thread], MPI_Initialized, MPI_Finalize, and MPI_Finalized.

help-mpi-api.txt: remove now-stale help messages

40b4d5d

jsquyres force-pushed the rfc/init-finalize-updates branch from b262382 to 40b4d5d Compare October 15, 2015 16:44

jsquyres added a commit that referenced this pull request Oct 15, 2015

Merge pull request #1007 from jsquyres/rfc/init-finalize-updates

9fe4ebb

init/finalize: extensions

jsquyres merged commit 9fe4ebb into open-mpi:master Oct 15, 2015

jsquyres deleted the rfc/init-finalize-updates branch October 15, 2015 17:55

jsquyres mentioned this pull request Oct 15, 2015

v2.0.0: init finalize mutex plus mutex static initializer open-mpi/ompi-release#676

Merged

jsquyres mentioned this pull request Nov 2, 2015

test all env query functions for compliance with MPI-3.1 thread safety compliance #493

Closed

jsquyres pushed a commit to jsquyres/ompi that referenced this pull request Aug 23, 2016

Merge pull request open-mpi#1007 from jsquyres/pr/v2.0.0/fix-deprecat…

2d08dd1

…ed-perl-construct v2.0.0: autogen: fix deprecated construct

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

init/finalize: extensions #1007

init/finalize: extensions #1007

jsquyres commented Oct 11, 2015

bosilca Oct 11, 2015

jsquyres Oct 12, 2015

jsquyres commented Oct 13, 2015

bosilca commented Oct 13, 2015

jsquyres commented Oct 13, 2015

jsquyres commented Oct 14, 2015

jsquyres Oct 14, 2015

jsquyres Oct 14, 2015

hppritcha commented Oct 14, 2015

lanl-ompi commented Oct 15, 2015

lanl-ompi commented Oct 15, 2015

lanl-ompi commented Oct 15, 2015

lanl-ompi commented Oct 15, 2015

lanl-ompi commented Oct 15, 2015

lanl-ompi commented Oct 15, 2015

jsquyres commented Oct 15, 2015

init/finalize: extensions #1007

init/finalize: extensions #1007

Conversation

jsquyres commented Oct 11, 2015

bosilca Oct 11, 2015

Choose a reason for hiding this comment

jsquyres Oct 12, 2015

Choose a reason for hiding this comment

jsquyres commented Oct 13, 2015

bosilca commented Oct 13, 2015

jsquyres commented Oct 13, 2015

jsquyres commented Oct 14, 2015

jsquyres Oct 14, 2015

Choose a reason for hiding this comment

jsquyres Oct 14, 2015

Choose a reason for hiding this comment

hppritcha commented Oct 14, 2015

lanl-ompi commented Oct 15, 2015

lanl-ompi commented Oct 15, 2015

lanl-ompi commented Oct 15, 2015

lanl-ompi commented Oct 15, 2015

lanl-ompi commented Oct 15, 2015

lanl-ompi commented Oct 15, 2015

jsquyres commented Oct 15, 2015