Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix batching for BigQueryToPostgresOperator #39233

Merged
merged 1 commit into from
Apr 27, 2024

Conversation

moiseenkov
Copy link
Contributor

Fixed batching for the following operators:

  • BigQueryToPostgresOperator
  • BigQueryToMsSqlOperator
  • BigQueryToMySqlOperator

Previously, if users specify the batch_size parameter it is used only for reading from the BigQuery, however, the writing to the database is performed with the default batch size of 1000 rows. This PR fixes that and the specified batch_size now used for both reading from BigQuery and writing to a database.

Additionally, updated the system test.

@boring-cyborg boring-cyborg bot added area:providers area:system-tests provider:google Google (including GCP) related issues labels Apr 24, 2024
@moiseenkov moiseenkov force-pushed the bigquery_to_postgres_fix_batch branch 3 times, most recently from b7ea281 to f12a935 Compare April 26, 2024 10:34
@moiseenkov moiseenkov force-pushed the bigquery_to_postgres_fix_batch branch from f12a935 to 9017c15 Compare April 26, 2024 14:58
@eladkal eladkal merged commit 78fa365 into apache:main Apr 27, 2024
40 checks passed
@prithvi-git
Copy link

Performance is still very poor. Opened new ticket #40948

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area:providers area:system-tests provider:google Google (including GCP) related issues
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants