Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature Request]: Add support for BigQuery Storge Write API in Python #21961

Closed
esciara opened this issue Jun 21, 2022 · 2 comments · Fixed by #25685
Closed

[Feature Request]: Add support for BigQuery Storge Write API in Python #21961

esciara opened this issue Jun 21, 2022 · 2 comments · Fixed by #25685

Comments

@esciara
Copy link

esciara commented Jun 21, 2022

What would you like to happen?

Much like #20687 to read to BigQuery, The Java SDK contains an implementation which uses the BigQuery Storage API to write to BigQuery. We should implement the same for Python.

In our case, we would like to use it instead of using the FILE_LOADS ingestion method in streaming mode, where errors during the loading process are not propagated to the pipeline. We are currently using it as we do not need STREAMING_INSERT features, and hence want to save on cost.

Issue Priority

Priority: 3

Issue Component

Component: io-py-gcp

@rizenfrmtheashes
Copy link

I took a look at this, and it seems like the bigquery storage api has a straightforward jsonstreamwriter but for java. for python, it only exists as protobufs. Hence why the bigquery storage write api for dataflow in python is missing, but exists in java. That api will likely need to be made before the dataflow functions can be made by the core contributors group. Sad that this blog post made it seem like it was in GA for everything.

@kennknowles
Copy link
Member

@chamikaramj FYI for if/when this can be done via xlang

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment