Implement XComArg concat() #40172

uranusjr · 2024-06-11T09:37:26Z

This is useful when you want to do a thing against more than one list of things. You can sort of already do this with an extra task, but this is lazier and saves some resources both in the XCom storage and memory needed to run tasks.

hussein-awala

To be honest, I'm not completely convinced of the importance of this feature, but it might be useful in some case.

Could you update the documentation by describing the new XCom operation chain?

josh-fell

My initial reaction was the semantics could be confused with a TaskFlow-specific implementation of airflow.models.baseoperator.chain but it doesn't function the same way. This seems to mirror itertools.chain(), but because there is already a chain() method at the operator level we might need a different semantic? Maybe .combine() is another option?

Curious what others think too.

This is cool, and I would definitely use this personally.

josh-fell · 2024-06-13T03:32:15Z

Oddly enough, this seems useful for the discussion in #40124?

uranusjr · 2024-06-13T04:15:37Z

I briefly searched how other ecosystems call this and concat seems popular (behind chain). combine is more often like join but I think it still is an option in this case since the joining semantic does not make sense.

potiuk · 2024-06-13T05:34:18Z

I briefly searched how other ecosystems call this and concat seems popular (behind chain). combine is more often like join but I think it still is an option in this case since the joining semantic does not make sense.

Maybe append is a better name ? I agree with @hussein-awala -> we need some docs to make it discoverable, it's a rather useful feature, but having a description and example of use is pretty much a "MUST HAVE" here.

uranusjr · 2024-06-13T11:03:27Z

I don’t like append personally since in Python it’s a method that modifies the object in-place, not creating a new object. (Same for extend although nobody mentions it yet.) I plan to add docs after we agree on the interface.

potiuk · 2024-06-13T20:03:57Z

Maybe just xcom_chain() or chain_xcom() ? That would remove confusion and keep itertools relation - at the expense of somewhat redundancy.

uranusjr · 2024-06-17T04:51:50Z

I think I’m going to go with concat. This is used in JavaScript. Java has Stream.concat (which is more like itertools.chain, close enough). Various languages also have concat for strings. Probably among the most solid choices beside chain.

potiuk · 2024-06-17T08:54:32Z

SGTM

uranusjr · 2024-06-17T10:09:56Z

Alright, I’ve made the name change and added docs.

eladkal · 2024-06-17T10:17:29Z

This is useful when you want to do a thing against more than one list of things.

Very nice! BTW I think it's also interesting to be able to do thing against specific items in the XCOM rather than all of it.

josh-fell

Looks great!

Just curious if DAG authors would also be able to concat XComArgs from two sets of mapped tasks? If yes, then maybe worth adding to the docs too?

airflow/models/xcom_arg.py

uranusjr · 2024-06-17T13:58:00Z

concat XComArgs from two sets of mapped tasks

Not sure I follow this

josh-fell · 2024-06-17T16:08:11Z

concat XComArgs from two sets of mapped tasks

Not sure I follow this

Related to a question I could think of DAG authors asking: "Does this work with mapped-task inputs?".

  graph LR;
      mapped_task_1-->aggregate_mapped_tasks;
      mapped_task_2-->aggregate_mapped_tasks;

meaning

aggregate_mapped_tasks(input=mapped_task_1.concat(mapped_task_2))

Looking again the "... concat function takes arbitrary positional arguments ..." statement covers this so ignore. Still looks great.

uranusjr · 2024-06-18T03:36:26Z

I added some text to call out chaining calls.

Implement ChainXComArg

dcbc175

hussein-awala reviewed Jun 11, 2024

View reviewed changes

josh-fell reviewed Jun 13, 2024

View reviewed changes

uranusjr added 2 commits June 17, 2024 17:38

Rename to concat()

0ad5608

Add documentation for concat()

4498833

uranusjr requested a review from potiuk as a code owner June 17, 2024 10:09

uranusjr changed the title ~~Implement ChainXComArg~~ Implement XComArg concat() Jun 17, 2024

eladkal approved these changes Jun 17, 2024

View reviewed changes

josh-fell approved these changes Jun 17, 2024

View reviewed changes

airflow/models/xcom_arg.py Outdated Show resolved Hide resolved

Test index access

5d158bb

potiuk approved these changes Jun 17, 2024

View reviewed changes

uranusjr added 2 commits June 18, 2024 10:55

Typo

fcbe1f5

Tweak docs to add examples

0a4675a

uranusjr merged commit 5a3823d into apache:main Jun 18, 2024
52 checks passed

uranusjr deleted the mapped-chain branch June 18, 2024 03:36

utkarsharma2 added the type:improvement Changelog: Improvements label Jul 1, 2024

utkarsharma2 added this to the Airflow 2.10.0 milestone Jul 1, 2024

romsharon98 pushed a commit to romsharon98/airflow that referenced this pull request Jul 26, 2024

Implement XComArg concat() (apache#40172)

81e7593

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement XComArg concat() #40172

Implement XComArg concat() #40172

uranusjr commented Jun 11, 2024

hussein-awala left a comment

josh-fell left a comment •

edited

Loading

josh-fell commented Jun 13, 2024

uranusjr commented Jun 13, 2024

potiuk commented Jun 13, 2024

uranusjr commented Jun 13, 2024

potiuk commented Jun 13, 2024

uranusjr commented Jun 17, 2024

potiuk commented Jun 17, 2024

uranusjr commented Jun 17, 2024

eladkal commented Jun 17, 2024 •

edited

Loading

josh-fell left a comment

uranusjr commented Jun 17, 2024

josh-fell commented Jun 17, 2024

uranusjr commented Jun 18, 2024

Implement XComArg concat() #40172

Implement XComArg concat() #40172

Conversation

uranusjr commented Jun 11, 2024

hussein-awala left a comment

Choose a reason for hiding this comment

josh-fell left a comment • edited Loading

Choose a reason for hiding this comment

josh-fell commented Jun 13, 2024

uranusjr commented Jun 13, 2024

potiuk commented Jun 13, 2024

uranusjr commented Jun 13, 2024

potiuk commented Jun 13, 2024

uranusjr commented Jun 17, 2024

potiuk commented Jun 17, 2024

uranusjr commented Jun 17, 2024

eladkal commented Jun 17, 2024 • edited Loading

josh-fell left a comment

Choose a reason for hiding this comment

uranusjr commented Jun 17, 2024

josh-fell commented Jun 17, 2024

uranusjr commented Jun 18, 2024

josh-fell left a comment •

edited

Loading

eladkal commented Jun 17, 2024 •

edited

Loading