Skip to content

Conversation

@kacpermuda
Copy link
Contributor

@kacpermuda kacpermuda commented Mar 13, 2025

closes: #47488

Prevent loss of information when using OpenLineage with Snowflake that is currently happening. When multiple query ids are encountered, pair of dedicated OL events will be sent for each query_id (with execution metadata whenever available).

Not sure how we should name OpenLineage job for each of this Snowflake query. For job namespace we can use the same OpenLineage namespace as Airflow is using, but for job name ? For now I wrote snowflake_query:{query_id} - but I have no idea if it makes sense. We agreed on OpenLineage committer sync to go with job_name = {dag_id}.{task_id}.query.{counter} and this is how it's implemented in this PR.

cc @mobuchowski


^ Add meaningful description above
Read the Pull Request Guidelines for more information.
In case of fundamental code changes, an Airflow Improvement Proposal (AIP) is needed.
In case of a new dependency, check compliance with the ASF 3rd Party License Policy.
In case of backwards incompatible changes please leave a note in a newsfragment file, named {pr_number}.significant.rst or {issue_number}.significant.rst, in newsfragments.

@boring-cyborg boring-cyborg bot added area:providers provider:snowflake Issues related to Snowflake provider labels Mar 13, 2025
@kacpermuda kacpermuda force-pushed the feat-ol-snowflake-child-events branch 3 times, most recently from fd96163 to fb983ee Compare March 14, 2025 15:56
@kacpermuda kacpermuda marked this pull request as ready for review March 14, 2025 16:01
@kacpermuda kacpermuda marked this pull request as draft March 18, 2025 13:07
@kacpermuda
Copy link
Contributor Author

Let's wait for #47909 to be merged, so that I can adjust tests.

@kacpermuda kacpermuda force-pushed the feat-ol-snowflake-child-events branch from fb983ee to 20ea002 Compare March 18, 2025 16:54
@kacpermuda kacpermuda marked this pull request as ready for review March 20, 2025 12:12
@kacpermuda kacpermuda force-pushed the feat-ol-snowflake-child-events branch 3 times, most recently from 94094b2 to 05c6058 Compare March 20, 2025 14:14
@eladkal eladkal requested a review from mobuchowski March 23, 2025 18:17
@kacpermuda kacpermuda force-pushed the feat-ol-snowflake-child-events branch from 05c6058 to 1623b48 Compare March 27, 2025 13:09
@kacpermuda kacpermuda force-pushed the feat-ol-snowflake-child-events branch from 1623b48 to 179605e Compare March 31, 2025 11:45
@mobuchowski mobuchowski merged commit dce4ab4 into apache:main Apr 1, 2025
153 checks passed
@kacpermuda kacpermuda deleted the feat-ol-snowflake-child-events branch April 2, 2025 08:41
nailo2c pushed a commit to nailo2c/airflow that referenced this pull request Apr 4, 2025
diogotrodrigues pushed a commit to diogotrodrigues/airflow that referenced this pull request Apr 6, 2025
simonprydden pushed a commit to simonprydden/airflow that referenced this pull request Apr 8, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

area:providers provider:snowflake Issues related to Snowflake provider

Projects

None yet

Development

Successfully merging this pull request may close these issues.

OpenLineage can silently lose Snowflake query_ids and can't support multiple query_ids

2 participants