Added support for subinterpreter workers #850

agronholm · 2024-12-31T23:17:45Z

Changes

Adds experimental support for subinterpreter workers

Checklist

If this is a user-facing code change, like a bugfix or a new feature, please ensure that
you've fulfilled the following conditions (where applicable):

You've added tests (in tests/) added which would fail without your patch
You've updated the documentation (in docs/, in case of behavior changes or new
features)
You've added a new changelog entry (in docs/versionhistory.rst).

If this is a trivial change, like a typo fix or a code reformatting, then you can ignore
these instructions.

Updating the changelog

If there are no entries after the last release, use **UNRELEASED** as the version.
If, say, your patch fixes issue #123, the entry should look like this:

- Fix big bad boo-boo in task groups
  (`#123 <https://github.com/agronholm/anyio/issues/123>`_; PR by @yourgithubaccount)

If there's no issue linked, just link to your pull request instead by updating the
changelog after you've created the PR.

graingert · 2025-01-02T08:07:28Z

src/anyio/_core/_exceptions.py

+        else:
+            return dedent(
+                f"""
+    {super().__str__()}


Better now?

graingert · 2025-01-02T08:08:27Z

src/anyio/to_interpreter.py

+        try:
+            func, args, kwargs = loads(item)
+            retval = func(*args, **kwargs)
+        except Exception as exc:


BaseException?

graingert · 2025-01-02T08:16:05Z

src/anyio/to_interpreter.py

+    _interpreter_id: int
+    _queue_id: int
+
+    async def initialize(self) -> None:


Suggested change

async def initialize(self) -> None:

def initialize(self) -> None:

does this need to be async?

No, and in fact it should not set up the interpreter in the event loop thread either. I'll fix both issues at once.

src/anyio/to_interpreter.py

The related PR: python/typeshed#13355

agronholm · 2025-01-03T13:10:34Z

Do we need pruning of interpreters that have been unused for too long?

richardsheridan · 2025-01-03T21:00:50Z

Actually I was going to comment on that. following the trio thread cache model has been standard. Unlike threads or processes though, it seems like your implementation would not be able to free up any resources of a timed-out worker until someone calls run_sync. There might be a better way to free up after something caches a bunch of interpreters? otherwise you need something like a background pruning task.

Also, are there any implications for unexpected behavior as the subinterpreters jump around different threads?

agronholm · 2025-01-03T21:04:34Z

Actually I was going to comment on that. following the trio thread cache model has been standard. Unlike threads or processes though, it seems like your implementation would not be able to free up any resources of a timed-out worker until someone calls run_sync. There might be a better way to free up after something caches a bunch of interpreters? otherwise you need something like a background pruning task.

I was thinking of pruning unused workers before or after a call to run_sync(). IIRC we do the same with unused worker threads.

Also, are there any implications for unexpected behavior as the subinterpreters jump around different threads?

None that I'm aware of.

richardsheridan · 2025-01-03T21:06:32Z

Separately, the cancellation story of a busy worker seems poor here. if you abandon the thread, the destroy in the atexit handler will fail. Not sure what the consequences will be!

agronholm · 2025-01-03T21:07:50Z

Separately, the cancellation story of a busy worker seems poor here. if you abandon the thread, the destroy in the atexit handler will fail. Not sure what the consequences will be!

But the worker threads should all be gone by the time the atexit hooks are run?

agronholm · 2025-01-03T21:15:09Z

Separately, the cancellation story of a busy worker seems poor here. if you abandon the thread, the destroy in the atexit handler will fail. Not sure what the consequences will be!

But the worker threads should all be gone by the time the atexit hooks are run?

I tested with this:

import time

import anyio
from anyio import to_interpreter


async def main():
    await to_interpreter.run_sync(time.sleep, 6, abandon_on_cancel=True)


anyio.run(main)

It won't exit the process until the worker thread has run its course.

richardsheridan · 2025-01-03T21:15:37Z

Actually I was going to comment on that. following the trio thread cache model has been standard. Unlike threads or processes though, it seems like your implementation would not be able to free up any resources of a timed-out worker until someone calls run_sync. There might be a better way to free up after something caches a bunch of interpreters? otherwise you need something like a background pruning task.

I was thinking of pruning unused workers before or after a call to run_sync(). IIRC we do the same with unused worker threads.

I think you mean processes? anyway, the issue is that the processes can automatically free up most resources except the OS process table bits that need waiting. An interpreter might sit on gigabytes of memory or thousands of sockets until the next call rather than the prescribed timeout.

Separately, the cancellation story of a busy worker seems poor here. if you abandon the thread, the destroy in the atexit handler will fail. Not sure what the consequences will be!

But the worker threads should all be gone by the time the atexit hooks are run?

Not if the subinterpereter is deadlocked or something like that!

Also, just realized that a cancelled subinterpreter worker should definitely not be returned to the idle worker queue.

agronholm · 2025-01-03T21:18:36Z

Actually I was going to comment on that. following the trio thread cache model has been standard. Unlike threads or processes though, it seems like your implementation would not be able to free up any resources of a timed-out worker until someone calls run_sync. There might be a better way to free up after something caches a bunch of interpreters? otherwise you need something like a background pruning task.

I was thinking of pruning unused workers before or after a call to run_sync(). IIRC we do the same with unused worker threads.

I think you mean processes? anyway, the issue is that the processes can automatically free up most resources except the OS process table bits that need waiting. An interpreter might sit on gigabytes of memory or thousands of sockets until the next call rather than the prescribed timeout.

Separately, the cancellation story of a busy worker seems poor here. if you abandon the thread, the destroy in the atexit handler will fail. Not sure what the consequences will be!

But the worker threads should all be gone by the time the atexit hooks are run?

Not if the subinterpereter is deadlocked or something like that!

Explain please. The task might abandon the worker thread, but the worker thread won't abandon the subinterpreter; it will continue to run the given code until it completes.

Also, just realized that a cancelled subinterpreter worker should definitely not be returned to the idle worker queue.

Yes, if abandon_on_cancel=True, we should not add the subinterpreter to the idle queue when cancelled. I will deal with this somehow.

richardsheridan · 2025-01-03T21:19:16Z

It won't exit the process until the worker thread has run its course.

Is this all happening in a non-daemon thread? or is the interpreter shutodwn logic of python doing this? Either way it seems problematic. Maybe an initial release could ignore "abandon_on_cancel"?

agronholm · 2025-01-03T21:22:18Z

It won't exit the process until the worker thread has run its course.

Is this all happening in a non-daemon thread? or is the interpreter shutodwn logic of python doing this? Either way it seems problematic. Maybe an initial release could ignore "abandon_on_cancel"?

The worker threads are daemonic, but there is a hook added to the root task that will ensure the threads have finished before the event loop exits. It's not 100% foolproof (what is?) but good enough for the vast majority of cases.

But I'm okay with leaving out abandon_on_cancel in the initial release.

richardsheridan · 2025-01-03T21:25:38Z

But the worker threads should all be gone by the time the atexit hooks are run?

Not if the subinterpereter is deadlocked or something like that!

Explain please. The task might abandon the worker thread, but the worker thread won't abandon the subinterpreter; it will continue to run the given code until it completes.

Maybe i don't understand the order of operations on exit. I thought the main script ends, then atexits are run, then daemon threads are held up acquiring the gil, then interpreters are destroyed, then python runtime ends.

Reading your previous message, it sounds like anyio interjects another step up there.

I think cancellation should wait until subinterpreters have a better interruption story. I tried and failed to make workers that run on channels and thread/interpreter pairs.

docs/subinterpreters.rst

Co-authored-by: Jordan Speicher <uSpike@users.noreply.github.com>

richardsheridan · 2025-01-04T05:58:57Z

... processes can automatically free up most resources except the OS process table bits that need waiting.

I just reviewed the anyio.to_process implementation and noticed it lacks this ability as well, so I guess there's no strong reason to try to implement it for interpreters in this pr.

We can add it back later, just needs to be consistent across the worker thread/interpreter/process APIs

agronholm added 3 commits January 1, 2025 01:17

Added support for subinterpreter workers

4c6a20b

Added missing re-export of BrokenWorkerIntepreter

59ca979

Fixed test failures with uvloop

27dd8f5

graingert reviewed Jan 2, 2025

View reviewed changes

src/anyio/to_interpreter.py Show resolved Hide resolved

agronholm added 9 commits January 2, 2025 15:23

Initialize subinterpreters in worker threads, not in the event loop

f96092b

Catch base exceptions in the worker too

e3a8f53

Tweaked the formatting in BrokenWorkerIntepreter.__str__()

0b08fb6

Merge branch 'master' into subinterpreters

ad6e787

Merge branch 'master' into subinterpreters

3aae4ec

Merge branch 'master' into subinterpreters

b245dde

Merge branch 'master' into subinterpreters

96d1f12

Updated type annotations to accommodate upcoming Typeshed changes

1dc2499

The related PR: python/typeshed#13355

Merge branch 'master' into subinterpreters

725d93b

Removed the abandon_on_cancel option for now

2f4f261

uSpike reviewed Jan 3, 2025

View reviewed changes

docs/subinterpreters.rst Outdated Show resolved Hide resolved

Update docs/subinterpreters.rst

ea0d7a7

Co-authored-by: Jordan Speicher <uSpike@users.noreply.github.com>

agronholm added 3 commits January 4, 2025 21:49

Merge branch 'master' into subinterpreters

8c13f3a

Prune idle workers at exit

667b157

Removed the kwargs parameter

65ba5e1

We can add it back later, just needs to be consistent across the worker thread/interpreter/process APIs

agronholm requested a review from graingert January 4, 2025 23:26

graingert approved these changes Jan 5, 2025

View reviewed changes

agronholm merged commit 264a6f9 into master Jan 5, 2025
17 checks passed

agronholm deleted the subinterpreters branch January 5, 2025 12:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added support for subinterpreter workers #850

Added support for subinterpreter workers #850

agronholm commented Dec 31, 2024

graingert Jan 2, 2025

agronholm Jan 2, 2025

graingert Jan 2, 2025

agronholm Jan 2, 2025

graingert Jan 2, 2025

agronholm Jan 2, 2025

agronholm commented Jan 3, 2025

richardsheridan commented Jan 3, 2025

agronholm commented Jan 3, 2025

richardsheridan commented Jan 3, 2025

agronholm commented Jan 3, 2025

agronholm commented Jan 3, 2025

richardsheridan commented Jan 3, 2025

agronholm commented Jan 3, 2025

richardsheridan commented Jan 3, 2025

agronholm commented Jan 3, 2025

richardsheridan commented Jan 3, 2025

richardsheridan commented Jan 4, 2025

	async def initialize(self) -> None:
	def initialize(self) -> None:

Added support for subinterpreter workers #850

Added support for subinterpreter workers #850

Conversation

agronholm commented Dec 31, 2024

Changes

Checklist

Updating the changelog

graingert Jan 2, 2025

Choose a reason for hiding this comment

agronholm Jan 2, 2025

Choose a reason for hiding this comment

graingert Jan 2, 2025

Choose a reason for hiding this comment

agronholm Jan 2, 2025

Choose a reason for hiding this comment

graingert Jan 2, 2025

Choose a reason for hiding this comment

agronholm Jan 2, 2025

Choose a reason for hiding this comment

agronholm commented Jan 3, 2025

richardsheridan commented Jan 3, 2025

agronholm commented Jan 3, 2025

richardsheridan commented Jan 3, 2025

agronholm commented Jan 3, 2025

agronholm commented Jan 3, 2025

richardsheridan commented Jan 3, 2025

agronholm commented Jan 3, 2025

richardsheridan commented Jan 3, 2025

agronholm commented Jan 3, 2025

richardsheridan commented Jan 3, 2025

richardsheridan commented Jan 4, 2025