Disable ORM access from Tasks, DAG processing and Triggers #47320

ashb · 2025-03-03T23:23:12Z

It's about time we delivered on one of the key points of AIP-72: DB isolation from workers.
(To be honest, it's probably past time, but now is the second best time)

All of these use the Workload supervisor from the TaskSDK and the main paths
XCom, Variables and Secrets) have all been ported to use the Execution API,
so it's about time we disabled DB access.

Note: this will almost certainly break a few things, like Skip mixin based tasks in particular - that is WIP in 46584

Also closes #47232 as that was failing if configure_orm was never called.

^ Add meaningful description above
Read the Pull Request Guidelines for more information.
In case of fundamental code changes, an Airflow Improvement Proposal (AIP) is needed.
In case of a new dependency, check compliance with the ASF 3rd Party License Policy.
In case of backwards incompatible changes please leave a note in a newsfragment file, named {pr_number}.significant.rst or {issue_number}.significant.rst, in newsfragments.

airflow/settings.py

airflow/dag_processing/manager.py

jscheffl

LGTM!

All of these use the Workload supervisor from the TaskSDK and the main paths (XCom, Variables and Secrets) have all been ported to use the Execution API, so it's about time we disabled DB access.

ashb · 2025-03-06T13:30:00Z

Oof that was a bit of a mission to land.

jscheffl · 2025-03-06T14:45:00Z

Oof that was a bit of a mission to land.

Wohooo!

jedcunningham · 2025-03-06T15:08:42Z

#protm

potiuk · 2025-03-06T19:03:32Z

Oof that was a bit of a mission to land.

Indeed

This change seems innocuous, and possibly even wrong, but it is the correct behaviour since #47320 landed. We _do not_ want to call dispose_orm, as that ends up reconnecting, and sometimes this results in the wrong connection being shared between the parent and the child. I don't love the "sometimes" nature of this bug, but the fix seems sound. Prior to this running one or two runs concurrently would result in the scheduler handing (stuck in SQLA code trying to roll back) or an error from psycopg about "error with status PGRES_TUPLES_OK and no message from the libpq". With this change we were able to repeatedly run 10 runs concurrently. The reason we don't want this is that we registered an at_fork handler already that closes/discards the socket object (without closing the DB level session) so calling dispose can, perversely, resurrect that object and try reusing it! Co-authored-by: Jed Cunningham <66968678+jedcunningham@users.noreply.github.com> Co-authored-by: Kaxil Naik <kaxilnaik@apache.org>

) All of these use the Workload supervisor from the TaskSDK and the main paths (XCom, Variables and Secrets) have all been ported to use the Execution API, so it's about time we disabled DB access.

This change seems innocuous, and possibly even wrong, but it is the correct behaviour since apache#47320 landed. We _do not_ want to call dispose_orm, as that ends up reconnecting, and sometimes this results in the wrong connection being shared between the parent and the child. I don't love the "sometimes" nature of this bug, but the fix seems sound. Prior to this running one or two runs concurrently would result in the scheduler handing (stuck in SQLA code trying to roll back) or an error from psycopg about "error with status PGRES_TUPLES_OK and no message from the libpq". With this change we were able to repeatedly run 10 runs concurrently. The reason we don't want this is that we registered an at_fork handler already that closes/discards the socket object (without closing the DB level session) so calling dispose can, perversely, resurrect that object and try reusing it! Co-authored-by: Jed Cunningham <66968678+jedcunningham@users.noreply.github.com> Co-authored-by: Kaxil Naik <kaxilnaik@apache.org>

Some recent changes in main and documentation published in the inventories, made the 2.10 doc building fail as references to non-existing docs in the new inventories were still used in the documentation for 2.10 This PR fixes it by changing the docs to not refer to those changed docs any more. The PRs that removed the links: apache#47320 and apache#47399 Co-authored-by: LIU ZHE YOU <zhu424.dev@gmail.com> Co-authored-by: Jarek Potiuk <jarek@potiuk.com>

Some recent changes in main and documentation published in the inventories, made the 2.10 doc building fail as references to non-existing docs in the new inventories were still used in the documentation for 2.10 This PR fixes it by changing the docs to not refer to those changed docs any more. The PRs that removed the links: #47320 and #47399 Co-authored-by: Jarek Potiuk <jarek@potiuk.com>

ashb requested review from eladkal, ephraimbuddy, jscheffl, o-nikolas and pierrejeambrun as code owners March 3, 2025 23:23

boring-cyborg bot added area:API Airflow's REST/HTTP API area:logging area:providers area:task-sdk provider:amazon AWS/Amazon - related issues provider:edge Edge Executor / Worker (AIP-69) / edge3 provider:fab labels Mar 3, 2025

ashb force-pushed the disable-db-access-tasks branch from 18faa85 to eb14f67 Compare March 3, 2025 23:24

ashb changed the title ~~disable db access tasks~~ Disable ORM access from Tasks, DAG processing and Triggers Mar 3, 2025

ashb force-pushed the disable-db-access-tasks branch from eb14f67 to 2a77c31 Compare March 4, 2025 11:21

ashb requested a review from hussein-awala as a code owner March 4, 2025 11:21

ashb requested a review from jedcunningham as a code owner March 4, 2025 14:01

ashb commented Mar 4, 2025

View reviewed changes

airflow/settings.py Outdated Show resolved Hide resolved

vincbeck approved these changes Mar 4, 2025

View reviewed changes

jedcunningham approved these changes Mar 4, 2025

View reviewed changes

ashb force-pushed the disable-db-access-tasks branch from e55d781 to 1176f81 Compare March 4, 2025 18:23

Lee-W self-requested a review March 5, 2025 02:55

ashb force-pushed the disable-db-access-tasks branch from 7d81e91 to 0f19bf7 Compare March 5, 2025 12:56

ashb commented Mar 5, 2025

View reviewed changes

airflow/dag_processing/manager.py Show resolved Hide resolved

ashb added the full tests needed We need to run full set of tests for this PR to merge label Mar 5, 2025

ashb force-pushed the disable-db-access-tasks branch 3 times, most recently from 1a9f977 to 93aacd2 Compare March 5, 2025 17:06

jedcunningham force-pushed the disable-db-access-tasks branch from 93aacd2 to 63cb614 Compare March 5, 2025 19:20

jscheffl approved these changes Mar 5, 2025

View reviewed changes

ashb force-pushed the disable-db-access-tasks branch 2 times, most recently from 5c5e1ea to c127964 Compare March 6, 2025 11:24

ashb added 2 commits March 6, 2025 12:27

Disable ORM access from Tasks, DAG processing and Triggers

eef128c

All of these use the Workload supervisor from the TaskSDK and the main paths (XCom, Variables and Secrets) have all been ported to use the Execution API, so it's about time we disabled DB access.

fixup! Disable ORM access from Tasks, DAG processing and Triggers

34b7152

ashb force-pushed the disable-db-access-tasks branch from c127964 to 34b7152 Compare March 6, 2025 12:28

ashb merged commit c440959 into main Mar 6, 2025
89 checks passed

ashb deleted the disable-db-access-tasks branch March 6, 2025 13:29

Lee-W mentioned this pull request Mar 7, 2025

feat(api_fastapi): include asset id in asset nodes when calling "/ui/dependencies" and "/ui/structure/structure_data" #47381

Merged

ashb mentioned this pull request Mar 12, 2025

Make LocalExecutor work under heavy load #47678

Merged

potiuk mentioned this pull request Apr 12, 2025

[v2-10-test] Fix Building docs CI #48568

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Disable ORM access from Tasks, DAG processing and Triggers #47320

Disable ORM access from Tasks, DAG processing and Triggers #47320

Uh oh!

ashb commented Mar 3, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

jscheffl left a comment

Uh oh!

Uh oh!

ashb commented Mar 6, 2025

Uh oh!

jscheffl commented Mar 6, 2025

Uh oh!

jedcunningham commented Mar 6, 2025

Uh oh!

potiuk commented Mar 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Disable ORM access from Tasks, DAG processing and Triggers #47320

Disable ORM access from Tasks, DAG processing and Triggers #47320

Uh oh!

Conversation

ashb commented Mar 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jscheffl left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ashb commented Mar 6, 2025

Uh oh!

jscheffl commented Mar 6, 2025

Uh oh!

jedcunningham commented Mar 6, 2025

Uh oh!

potiuk commented Mar 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

ashb commented Mar 3, 2025 •

edited

Loading