-
Notifications
You must be signed in to change notification settings - Fork 196
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support OpenLineage in spark-3.x-bigquery connectors #1212
Support OpenLineage in spark-3.x-bigquery connectors #1212
Conversation
codelixir
commented
Apr 15, 2024
- Add openlineage properties to Spark31BigQueryTable class
- Add BigQueryRelationProvider as an abstract class to v2 module, to be extended by BaseBigQuerySource (parent class of all the Spark BigQuery Table Provider classes).
/gcbrun |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please add an integration test testing that the lineage events are created.
...uery-dsv2-common/src/main/java/com/google/cloud/spark/bigquery/BigQueryRelationProvider.java
Outdated
Show resolved
Hide resolved
...igquery-dsv2-common/src/main/java/com/google/cloud/spark/bigquery/v2/BaseBigQuerySource.java
Show resolved
Hide resolved
...uery-dsv2-common/src/main/java/com/google/cloud/spark/bigquery/BigQueryRelationProvider.java
Show resolved
Hide resolved
/gcbrun |
…gquery-connector-common
I have moved the logic to the common module, as discussed, so that both dsv1 and dsv2 connectors call the same method internally. |
/gcbrun |
1 similar comment
/gcbrun |
Hey @codelixir, thank you for your contribution! We appreciate your effort. Have you thought about leveraging the |
Hi @ddebowczyk92 , thanks for the input! We try to keep the DataSource v2 connectors Scala agnostic in order to simplify the usage for customers due to the incompatibility between Scala 2.12 and 2.13. Once this is PR is done, we can think how to incorporate the interface into the connector. |
...rc/test/java/com/google/cloud/spark/bigquery/integration/OpenLineageIntegrationTestBase.java
Show resolved
Hide resolved
...rc/test/java/com/google/cloud/spark/bigquery/integration/OpenLineageIntegrationTestBase.java
Outdated
Show resolved
Hide resolved
Signed-off-by: Pahulpreet Singh <pahulpreets@google.com>
/gcbrun |
...rc/test/java/com/google/cloud/spark/bigquery/integration/OpenLineageIntegrationTestBase.java
Outdated
Show resolved
Hide resolved
Signed-off-by: Pahulpreet Singh <pahulpreets@google.com>
/gcbrun |
/gcbrun |
558f18f
into
GoogleCloudDataproc:master
Signed-off-by: Pahulpreet Singh <pahulpreets@google.com>
Signed-off-by: Pahulpreet Singh <pahulpreets@google.com>
Signed-off-by: Pahulpreet Singh <pahulpreets@google.com>