GH-109978: Allow multiprocessing finalizers to run on a separate thread #110510

pitrou · 2023-10-07T21:18:03Z

Issue: Allow multiprocessing finalizers to run on a separate thread #109978

…e thread

pitrou · 2023-10-07T21:27:37Z

This is an attempt at solving the aforementioned issue (let non-reentrant multiprocessing finalizers run on a separate thread)... but there are complications due to the fact that multiprocessing obviously relies on fork, which strongly prefers a single-thread setup.

@vstinner @gpshead

pitrou · 2023-10-07T21:28:06Z

Lib/multiprocessing/util.py

+        if thread_was_running:
+            # HACK: os.fork() queries the system for the number of running threads.
+            # However, Thread.join() only ensures that the Python thread state
+            # was destroyed, while the system thread could still be running.
+            # Give it time to exit.
+            time.sleep(0.001)


This is a bit unfortunate. @gpshead

Agreed, this is effectively because our join() has never truely been a join... I filed #110829 to track that, it is probably time to fix that.

vstinner · 2023-10-12T23:12:17Z

I think that it will skip my turn for this one. I touched too many multiprocessing code recently, and I was bitten.

Maybe @serhiy-storchaka wants to have a look, he did something like that recently (I recall vaguely).

gpshead

I haven't tried to understand the whole thing yet, but i've given it a once over to look at the issues this is running into.

gpshead · 2023-10-13T13:31:59Z

Lib/multiprocessing/util.py

+        if thread_was_running:
+            # HACK: os.fork() queries the system for the number of running threads.
+            # However, Thread.join() only ensures that the Python thread state
+            # was destroyed, while the system thread could still be running.
+            # Give it time to exit.
+            time.sleep(0.001)


Agreed, this is effectively because our join() has never truely been a join... I filed #110829 to track that, it is probably time to fix that.

gpshead · 2023-10-13T13:32:56Z

Modules/posixmodule.c

@@ -7625,6 +7625,9 @@ static void warn_about_fork_with_threads(const char* name) {
            num_python_threads = atoi(field);  // 0 on error
        }
    }
+    // XXX This counts the number of system threads, but Python code can only
+    // call threading.Thread.join(), which can return before the system thread ended.
+    // This function could therefore print a spurious warning in unlucky cases.


It's not spurious though, the existence of a thread doing anything is accurately identified as a potential problem. The annoyance that led to this though is that Python didn't provide a concrete way to guarantee a thread has exited.

serhiy-storchaka · 2023-10-13T12:56:28Z

Lib/multiprocessing/pool.py

+        while self._pool:
+            self._pool.pop().join()


What is the difference?

serhiy-storchaka · 2023-10-13T13:58:44Z

Lib/multiprocessing/util.py

+        with self._lock():
+            return self._stopped


Why is the lock needed for reading an attribute?

serhiy-storchaka · 2023-10-13T14:08:33Z

Lib/multiprocessing/util.py

+
+    def wait_until_idle(self):
+        with self._queue_drained_cond:
+            self._queue_drained_cond.wait_for(lambda: self._queue.empty())


Suggested change

self._queue_drained_cond.wait_for(lambda: self._queue.empty())

self._queue_drained_cond.wait_for(self._queue.empty)

serhiy-storchaka · 2023-10-13T14:11:03Z

Lib/multiprocessing/util.py

+        # from the work loop.
+        assert callable(cb)
+        with self._lock:
+            if not self._stopped and self._thread is not None:


Isn't self._thread always None when self._stopped is True?

serhiy-storchaka · 2023-10-13T14:16:24Z

Lib/multiprocessing/util.py

+        finally:
+            cb = None


Maybe add a reference to _work_loop() or copy the comment? I was confused when I saw this.

Hmm, _work_queue.enqueue_task() is always called from the global enqueue_task()which keeps a reference tocb`, so it perhaps does not help.

serhiy-storchaka · 2023-10-13T14:28:44Z

Lib/multiprocessing/util.py

-                sub_debug('finalizer ignored because different process')
-                res = None
-            else:
+            try:


There is already one try a level above. The second try may be not needed, you can simply add finally at the same level after else. The difference is that the finally block will be exececuted in the except case, but I do not see what can be wrong with this.

pitrou · 2023-10-15T15:10:20Z

FWIW, I'm prioritizing #110848 now, as it should make this PR slightly more robust.

bedevere-app bot mentioned this pull request Oct 7, 2023

Allow multiprocessing finalizers to run on a separate thread #109978

Open

pythonGH-109978: Allow multiprocessing finalizers to run on a separat…

3664a79

…e thread

pitrou force-pushed the gh109978-mp-finalizer-thread branch from a7d3581 to 3664a79 Compare October 7, 2023 21:25

pitrou commented Oct 7, 2023

View reviewed changes

serhiy-storchaka self-requested a review October 13, 2023 12:48

gpshead mentioned this pull request Oct 13, 2023

threading Thread.join should call the OS join API #110829

Closed

gpshead reviewed Oct 13, 2023

View reviewed changes

serhiy-storchaka reviewed Oct 13, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

GH-109978: Allow multiprocessing finalizers to run on a separate thread #110510

GH-109978: Allow multiprocessing finalizers to run on a separate thread #110510

Uh oh!

pitrou commented Oct 7, 2023 •

edited by bedevere-app bot

Loading

Uh oh!

pitrou commented Oct 7, 2023

Uh oh!

pitrou Oct 7, 2023

Uh oh!

gpshead Oct 13, 2023

Uh oh!

vstinner commented Oct 12, 2023

Uh oh!

gpshead left a comment

Uh oh!

gpshead Oct 13, 2023

Uh oh!

gpshead Oct 13, 2023

Uh oh!

serhiy-storchaka Oct 13, 2023

Uh oh!

serhiy-storchaka Oct 13, 2023

Uh oh!

serhiy-storchaka Oct 13, 2023

Uh oh!

serhiy-storchaka Oct 13, 2023

Uh oh!

serhiy-storchaka Oct 13, 2023

Uh oh!

serhiy-storchaka Oct 13, 2023

Uh oh!

pitrou commented Oct 15, 2023

Uh oh!

Uh oh!

	self._queue_drained_cond.wait_for(lambda: self._queue.empty())
	self._queue_drained_cond.wait_for(self._queue.empty)

Uh oh!

GH-109978: Allow multiprocessing finalizers to run on a separate thread #110510

Are you sure you want to change the base?

GH-109978: Allow multiprocessing finalizers to run on a separate thread #110510

Uh oh!

Conversation

pitrou commented Oct 7, 2023 • edited by bedevere-app bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pitrou commented Oct 7, 2023

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

vstinner commented Oct 12, 2023

Uh oh!

gpshead left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pitrou commented Oct 15, 2023

Uh oh!

Uh oh!

pitrou commented Oct 7, 2023 •

edited by bedevere-app bot

Loading