Introduce separate channel for trigger workloads #48835

gopidesupavan · 2025-04-05T14:31:39Z

Why

Introducing a separate channel for trigger workloads.
Currently when multiple tasks are entering into trigger, the messages are mixing up with Trigger Workloads. this is due to the read side when we are trying to read same sys.stdin, causing this problem IMHO. When this mixing happens we are loosing the Trigger Workloads as its read in different place.

What

Created two new sockets to handle Trigger Workloads, when main process writes with trigger_stdin and in child process this Workloads will be reading from the trigger_requests_fd

Have observed this behaviour while testing DagStateTrigger/WorkflowTrigger

Connections are failing to retrieve in Trigger, when multiple tasks running in tirgger.

After

Retrieved connections properly for all the task and their own connection id, have created a separate connection for each task to verify whether messages are mixing up or not. Looking good.

Add tests

^ Add meaningful description above
Read the Pull Request Guidelines for more information.
In case of fundamental code changes, an Airflow Improvement Proposal (AIP) is needed.
In case of a new dependency, check compliance with the ASF 3rd Party License Policy.
In case of backwards incompatible changes please leave a note in a newsfragment file, named {pr_number}.significant.rst or {issue_number}.significant.rst, in airflow-core/newsfragments.

gopidesupavan · 2025-04-05T14:33:31Z

Does this approach okay?

gopidesupavan · 2025-04-05T22:54:36Z

task-sdk/src/airflow/sdk/execution_time/supervisor.py

        child_comms, read_msgs = mkpipe()
        child_logs, read_logs = mkpipe()

+        if "TriggerRunnerSupervisor" in cls.__name__:


Open these sockets only when in triggerer, for others we dont need it.

ashb · 2025-04-06T06:38:14Z

I'll take a closer look at this on Monday

gopidesupavan · 2025-04-06T14:29:34Z

I'll take a closer look at this on Monday

cool thanks :)

pierrejeambrun

I just made a couple of nits

Makes sense to me, I'll let Ash have the final word on this.

pierrejeambrun · 2025-04-07T11:53:29Z

task-sdk/src/airflow/sdk/execution_time/task_runner.py

+    """"
+    It require special case to handle the workloads and api calls to api-server, due to mixing up messages
+    a separate channel is used to send the workloads from parent process to child process the child process.
+    connect_stdin will use this channel to read the workloads in read_workload method.
+    """


If I remember correctly those should field docstring should go bellow the attr they describe, not above.

pierrejeambrun · 2025-04-07T11:57:50Z

airflow-core/src/airflow/jobs/triggerer_job_runner.py

        )

    def _send(self, msg: BaseModel):
+        self.trigger_stdin.write(msg.model_dump_json().encode("utf-8") + b"\n")  # type: ignore[union-attr]


That type ignore seems of. It wasn't necessary before and it seems really similar now:

gopidesupavan · 2025-04-07T18:22:03Z

Thanks @pierrejeambrun for review, will be closing this in favour of #48880

Introduce separate channel for trigger workloads

edb0d73

gopidesupavan requested review from amoghrajesh, ashb, dstandish, hussein-awala and kaxil as code owners April 5, 2025 14:31

boring-cyborg bot added area:task-sdk area:Triggerer labels Apr 5, 2025

gopidesupavan added 2 commits April 5, 2025 17:24

Fix tests

da6c247

Add test to check multiple deferrable tasks gets correct messages

665b665

gopidesupavan requested a review from pierrejeambrun April 5, 2025 21:12

gopidesupavan added 2 commits April 5, 2025 22:22

Add comment to new fd

b151939

Close sockets properly

970f8a2

gopidesupavan commented Apr 5, 2025

View reviewed changes

gopidesupavan mentioned this pull request Apr 5, 2025

Fix dagstate trigger to work with TaskSDK #48747

Merged

pierrejeambrun reviewed Apr 7, 2025

View reviewed changes

gopidesupavan closed this Apr 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Introduce separate channel for trigger workloads #48835

Introduce separate channel for trigger workloads #48835

Uh oh!

gopidesupavan commented Apr 5, 2025 •

edited

Loading

Uh oh!

gopidesupavan commented Apr 5, 2025

Uh oh!

gopidesupavan Apr 5, 2025

Uh oh!

ashb commented Apr 6, 2025

Uh oh!

gopidesupavan commented Apr 6, 2025

Uh oh!

pierrejeambrun left a comment

Uh oh!

pierrejeambrun Apr 7, 2025

Uh oh!

pierrejeambrun Apr 7, 2025

Uh oh!

gopidesupavan commented Apr 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Introduce separate channel for trigger workloads #48835

Introduce separate channel for trigger workloads #48835

Uh oh!

Conversation

gopidesupavan commented Apr 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why

What

After

Uh oh!

gopidesupavan commented Apr 5, 2025

Uh oh!

gopidesupavan Apr 5, 2025

Choose a reason for hiding this comment

Uh oh!

ashb commented Apr 6, 2025

Uh oh!

gopidesupavan commented Apr 6, 2025

Uh oh!

pierrejeambrun left a comment

Choose a reason for hiding this comment

Uh oh!

pierrejeambrun Apr 7, 2025

Choose a reason for hiding this comment

Uh oh!

pierrejeambrun Apr 7, 2025

Choose a reason for hiding this comment

Uh oh!

gopidesupavan commented Apr 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

gopidesupavan commented Apr 5, 2025 •

edited

Loading