task referencer #18914

altendky · 2024-11-20T20:26:59Z

Purpose:

@dataclasses.dataclass
class _TaskReferencer:
    """Holds strong references to tasks until they are done.  This compensates for
    asyncio holding only weak references.  This should be replaced by patterns using
    task groups such as from anyio.
    """

Current Behavior:

New Behavior:

Testing Notes:

benchmark runs:

Draft For:

enable flake8-tidy-imports #18913 so we can ban asyncio.create_task()
considering benchmark results

coveralls-official · 2024-11-22T14:44:42Z

Pull Request Test Coverage Report for Build 12483476002

Details

162 of 180 (90.0%) changed or added relevant lines in 39 files are covered.
55 unchanged lines in 9 files lost coverage.
Overall coverage decreased (-0.02%) to 91.529%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
chia/daemon/server.py	4	5	80.0%
chia/farmer/farmer.py	5	6	83.33%
chia/seeder/crawler.py	2	3	66.67%
chia/server/server.py	3	4	75.0%
chia/util/beta_metrics.py	1	2	50.0%
chia/full_node/full_node.py	14	16	87.5%
chia/timelord/timelord.py	3	5	60.0%
chia/server/ws_connection.py	10	13	76.92%
chia/util/task_referencer.py	25	28	89.29%
chia/wallet/wallet_node.py	6	9	66.67%

Files with Coverage Reduction	New Missed Lines	%
chia/data_layer/data_store.py	1	95.47%
chia/server/node_discovery.py	1	81.87%
chia/daemon/server.py	1	84.3%
chia/wallet/wallet_node.py	1	88.32%
chia/timelord/timelord_launcher.py	4	89.29%
chia/server/address_manager.py	5	90.48%
chia/full_node/full_node.py	7	86.09%
chia/timelord/timelord.py	11	78.82%
chia/_tests/core/util/test_lockfile.py	24	77.31%

Totals
Change from base Build 12475392303:	-0.02%
Covered Lines:	105309
Relevant Lines:	114878

💛 - Coveralls

chia/_tests/core/server/test_dos.py

pytest.ini

ruff.toml

emlowe · 2024-11-26T20:53:48Z

I like this idea as a for now thing - I would like @arvidn to consider it though

github-actions · 2024-12-11T16:09:56Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

github-actions · 2024-12-11T16:13:48Z

Conflicts have been resolved. A maintainer will review the pull request shortly.

github-actions · 2024-12-11T16:13:48Z

Conflicts have been resolved. A maintainer will review the pull request shortly.

github-actions · 2024-12-11T16:16:56Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

github-actions · 2024-12-11T16:19:53Z

Conflicts have been resolved. A maintainer will review the pull request shortly.

arvidn

I'm not excited about keeping this taks list global, but I understand it's still an improvement over keeping hidden references in the asyncio scheduler as we do today.

One concern I have is that, for tasks that we currently handle correctly, it seems this would unnecessarily extend their lifetime until the next culling. i.e. tasks that we create and hold a reference to, and then await or gather(), will still be kept alive longer than they would today.

Seeing that you wrapped create_task(), at first I also expected that you'd wrap the Task object itself as well, just to be able to hook on it being awaited, and immediately remove it from the taask referencer.

altendky · 2024-12-19T15:09:24Z

drafting so we can discuss the raised ideas before a merge happens.

i agree that it will keep a completed task object alive longer. if we have some situation where the task object keeps something else significant alive because of this that could indeed cause trouble. i think the most likely case for that is an exception result that references other problematic objects.

it appears that the task object already provides a solution to this so perhaps there would be no need to wrap it. https://docs.python.org/3/library/asyncio-task.html#asyncio.Task.add_done_callback the cost would be more frequent single-item culling which would be less efficient with the existing list. it looks like task objects are hashable. i don't recall at the moment if there was another reason i didn't use a set. switching the list to a set should alleviate the cost of more frequent and smaller culling.

arvidn · 2024-12-20T11:00:21Z

we would still need to look through the tasks and remove the ones that are done though, right? To simulate "detach", which seems to be how we commonly use new tasks

altendky · 2024-12-20T15:49:41Z

i expected that somewhere around the time a task changes to the done state that it would call the done callback where we would remove it from the dict. are you concerned that the done callback relates to a task being awaited? I believe that state is independent and represents the completion of execution, not the awaiting of a result.

Python 3.12.7 (main, Nov 20 2024, 20:26:30) [GCC 11.4.0] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import asyncio
>>> async def af():
...     await asyncio.sleep(0.1)
... 
>>> async def main():
...     task = asyncio.create_task(af())
...     task.add_done_callback(print)
...     await asyncio.sleep(5)
...     print(task.done())
...     await task
... 
>>> asyncio.run(main())
<Task finished name='Task-2' coro=<af() done, defined at <stdin>:1> result=None>
True

there's still room for a bug where a task somehow finishes without triggering the callback or the callback somehow failing to remove properly i suppose. then an occasionally explicit culling would catch those tasks that fell through the cracks. i had decided not to leave such behavior in, but it could be added back.

arvidn · 2024-12-23T20:12:09Z

oh, I see. I assumed the callback would only be called when the task was awaited on. but if it isn't; is the callback called on the same task that's done? and I imagine the lifespan left of the task is so short it doesn't matter that we drop the reference to it from within itself.

altendky · 2024-12-24T15:18:11Z

the callback is sync and couldn't be injected into any existing task afaik. also note that the callback is executed when the task is already marked as done. that could still leave us dropping our reference before the task is awaited, but if it is going to be awaited by something else then that other thing must also have a reference that would keep it alive.

>>> import asyncio
>>> async def af():
...     print(f"in task: {asyncio.current_task()}")
...     await asyncio.sleep(0.1)
... 
>>> async def main():
...     print(f"main task: {asyncio.current_task()}")
...     task = asyncio.create_task(af())
...     task.add_done_callback(lambda task: print(f"in done callback: {task.done()} {asyncio.current_task()}"))
...     await asyncio.sleep(5)
...     print(task.done())
...     await task
... 
>>> asyncio.run(main())
main task: <Task pending name='Task-5' coro=<main() running at <stdin>:2> cb=[_run_until_complete_cb() at /home/altendky/.pyenv/versions/3.12.7/lib/python3.12/asyncio/base_events.py:182]>
in task: <Task pending name='Task-6' coro=<af() running at <stdin>:2> cb=[main.<locals>.<lambda>() at <stdin>:4]>
in done callback: True None
True

github-actions · 2024-12-24T17:34:08Z

File	Coverage	Missing Lines
`chia/daemon/server.py`	80.0%	lines 1060
`chia/farmer/farmer.py`	83.3%	lines 202
`chia/full_node/full_node.py`	87.5%	lines 288, 298
`chia/seeder/crawler.py`	66.7%	lines 223
`chia/server/server.py`	75.0%	lines 506
`chia/server/ws_connection.py`	76.9%	lines 681, 728, 733
`chia/timelord/timelord.py`	60.0%	lines 171, 1140
`chia/util/beta_metrics.py`	50.0%	lines 88
`chia/util/task_referencer.py`	89.3%	lines 23, 53-54
`chia/wallet/wallet_node.py`	66.7%	lines 430, 433, 1242

Total	Missing	Coverage
180 lines	18 lines	90%

altendky added 2 commits November 20, 2024 14:52

put otherwise unheld tasks into the pit

dffd386

and the rest too

143664a

altendky added Changed Required label for PR that categorizes merge commit message as "Changed" for changelog Exclude_Notes Use this label if the changes in the PR should be excluded from the release notes labels Nov 20, 2024

altendky added 2 commits November 20, 2024 16:02

cull less often

942d345

task referencer

101533f

altendky changed the title ~~task pit~~ task referencer Nov 20, 2024

private

e50023c

github-actions bot added the coverage-diff label Nov 20, 2024

altendky added 7 commits November 21, 2024 09:20

report unexpectedly unreferenced tasks

f7c3d0c

also warn for automated catching in tests

0c81b6d

reporting task

53dea53

undo some

f890efb

fixup for 3.9

764fbdd

Merge branch 'main' into task_pit

9de4cd4

ban asyncio.create_task

700caf9

altendky marked this pull request as ready for review November 25, 2024 20:21

altendky requested a review from a team as a code owner November 25, 2024 20:21

altendky requested review from AmineKhaldi, arvidn and emlowe November 25, 2024 20:22

emlowe reviewed Nov 26, 2024

View reviewed changes

chia/_tests/core/server/test_dos.py Outdated Show resolved Hide resolved

emlowe reviewed Nov 26, 2024

View reviewed changes

pytest.ini Outdated Show resolved Hide resolved

emlowe reviewed Nov 26, 2024

View reviewed changes

ruff.toml Outdated Show resolved Hide resolved

oof

e631061

altendky mentioned this pull request Dec 2, 2024

Track weight proof tasks #18896

Merged

github-actions bot added the merge_conflict Branch has conflicts that prevent merge to main label Dec 11, 2024

Merge branch 'main' into task_pit

b3c016d

github-actions bot removed merge_conflict Branch has conflicts that prevent merge to main labels Dec 11, 2024

github-actions bot added the merge_conflict Branch has conflicts that prevent merge to main label Dec 11, 2024

Merge branch 'main' into task_pit

9c350ab

github-actions bot removed the merge_conflict Branch has conflicts that prevent merge to main label Dec 11, 2024

altendky closed this Dec 18, 2024

altendky reopened this Dec 18, 2024

arvidn previously approved these changes Dec 19, 2024

View reviewed changes

altendky marked this pull request as draft December 19, 2024 15:04

altendky added 2 commits December 19, 2024 10:10

Merge branch 'main' into task_pit

f10ecc4

simplify including using Task.add_done_callback() for culling

05ea24b

altendky dismissed arvidn’s stale review via 05ea24b December 19, 2024 15:47

altendky marked this pull request as ready for review December 19, 2024 17:09

altendky requested review from arvidn and emlowe December 19, 2024 17:09

arvidn approved these changes Dec 24, 2024

View reviewed changes

altendky closed this Dec 24, 2024

altendky reopened this Dec 24, 2024

altendky removed the coverage-diff label Dec 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

task referencer #18914

task referencer #18914

altendky commented Nov 20, 2024 •

edited

Loading

coveralls-official bot commented Nov 22, 2024 •

edited

Loading

emlowe commented Nov 26, 2024

github-actions bot commented Dec 11, 2024

github-actions bot commented Dec 11, 2024

github-actions bot commented Dec 11, 2024

github-actions bot commented Dec 11, 2024

github-actions bot commented Dec 11, 2024

arvidn left a comment

altendky commented Dec 19, 2024

arvidn commented Dec 20, 2024

altendky commented Dec 20, 2024

arvidn commented Dec 23, 2024

altendky commented Dec 24, 2024

github-actions bot commented Dec 24, 2024

task referencer #18914

Are you sure you want to change the base?

task referencer #18914

Conversation

altendky commented Nov 20, 2024 • edited Loading

Purpose:

Current Behavior:

New Behavior:

Testing Notes:

Draft For:

coveralls-official bot commented Nov 22, 2024 • edited Loading

Pull Request Test Coverage Report for Build 12483476002

Details

💛 - Coveralls

emlowe commented Nov 26, 2024

github-actions bot commented Dec 11, 2024

github-actions bot commented Dec 11, 2024

github-actions bot commented Dec 11, 2024

github-actions bot commented Dec 11, 2024

github-actions bot commented Dec 11, 2024

arvidn left a comment

Choose a reason for hiding this comment

altendky commented Dec 19, 2024

arvidn commented Dec 20, 2024

altendky commented Dec 20, 2024

arvidn commented Dec 23, 2024

altendky commented Dec 24, 2024

github-actions bot commented Dec 24, 2024

altendky commented Nov 20, 2024 •

edited

Loading

coveralls-official bot commented Nov 22, 2024 •

edited

Loading