-
Notifications
You must be signed in to change notification settings - Fork 4.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
BigQuery Storage Write API SchemaTransform wrapper for Python SDK #24783
BigQuery Storage Write API SchemaTransform wrapper for Python SDK #24783
Conversation
Codecov Report
@@ Coverage Diff @@
## master #24783 +/- ##
==========================================
+ Coverage 72.95% 73.04% +0.09%
==========================================
Files 745 742 -3
Lines 99174 98942 -232
==========================================
- Hits 72353 72276 -77
+ Misses 25455 25303 -152
+ Partials 1366 1363 -3
Flags with carried forward coverage won't be shown. Click here to find out more.
📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more |
…e_wrapper_python marge timestamp microsecond precision PR
R: @chamikaramj |
Stopping reviewer notifications for this pull request: review requested by someone other than the bot, ceding control |
21adb6f
to
bc17b47
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks. Looks great. Just nits.
...o/google-cloud-platform/src/main/java/org/apache/beam/sdk/io/gcp/bigquery/BigQueryUtils.java
Show resolved
Hide resolved
@@ -367,6 +367,32 @@ def discover(expansion_service): | |||
inputs=proto_config.input_pcollection_names, | |||
outputs=proto_config.output_pcollection_names) | |||
|
|||
@staticmethod | |||
def discover_one(expansion_service, name): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
How about "discover_config()" ?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
sure SGTM
Did you try triggering the Jenkins integration test suites for the test with this PR ? |
@chamikaramj let me know if the mock service unit test is along the lines of what you were thinking |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks. I meant to try test_xlang_storage_write using a Jenkins trigger.
_LOGGER.info( | ||
"Created dataset %s in project %s", self.dataset_id, self.project) | ||
|
||
self.expansion_service = ('localhost:%s' % os.environ.get('EXPANSION_PORT')) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We should be using this expansion service when running the pipeline ?
Also, does this service already get created ?
Run XVR_PythonUsingJava_Dataflow PostCommit |
Closing this PR in favor of #25521 as this one has gotten too messy. |
Implementing a wrapper for Python SDK that uses the Storage API SchemaTransform (#23988) to write to BigQuery
Fixes #21961