Skip to content

Conversation

@mobuchowski
Copy link
Contributor

This PR extends OpenLineage integration in Airflow to support root parent lineage metadata via the newly added root field in the ParentRunFacet.

Previously, consumers had to recursively traverse parent references to determine root job in lineage hierarchy. With this update, consumers can immediately identify the DAG-level root job and run for any depth of children task.
More on the underlying OpenLineage feature: https://docs.google.com/document/d/1JypxrJc0MkrarzYfMopK-TF8QiPpjPnk2je4mgq9e4k

This updates both openlineage provider as few others that generate OL events by themselves.

@amoghrajesh
Copy link
Contributor

@mobuchowski test failures on this one. Is it important before today's provider wave?

@mobuchowski mobuchowski force-pushed the handle-root-parent-information branch from b800438 to fe603ef Compare April 16, 2025 07:49
@mobuchowski
Copy link
Contributor Author

@amoghrajesh looking at this right now.

@mobuchowski
Copy link
Contributor Author

@amoghrajesh rebase fixed those issues.

@mobuchowski mobuchowski force-pushed the handle-root-parent-information branch from fe603ef to 39966e0 Compare April 23, 2025 10:43
@mobuchowski mobuchowski force-pushed the handle-root-parent-information branch 5 times, most recently from 1c8d095 to def2bf1 Compare April 24, 2025 09:01
Copy link
Contributor

@kacpermuda kacpermuda left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for all the changes, LGTM.

@mobuchowski
Copy link
Contributor Author

@amoghrajesh want to take another look at that one? @kacpermuda reviewed it and approved.

@potiuk
Copy link
Member

potiuk commented May 5, 2025

needs conflict resolving

@mobuchowski mobuchowski force-pushed the handle-root-parent-information branch 2 times, most recently from c3511a3 to 7a5a871 Compare May 5, 2025 17:27
@mobuchowski
Copy link
Contributor Author

@potiuk done.

Copy link
Member

@potiuk potiuk left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

One NIT - with version consistency

Signed-off-by: Maciej Obuchowski <maciej.obuchowski@datadoghq.com>
@mobuchowski mobuchowski force-pushed the handle-root-parent-information branch from 7a5a871 to 9760a17 Compare May 5, 2025 19:36
@mobuchowski mobuchowski merged commit 3b3dbe9 into apache:main May 5, 2025
69 checks passed
mvfc pushed a commit to mvfc/airflow that referenced this pull request May 6, 2025
Signed-off-by: Maciej Obuchowski <maciej.obuchowski@datadoghq.com>
potiuk added a commit to potiuk/airflow that referenced this pull request May 6, 2025
The apache#49237 did not bump openlineage provider. This PR fixes it.
potiuk added a commit that referenced this pull request May 6, 2025
The #49237 did not bump openlineage provider. This PR fixes it.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants