You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I met LoadJob bug in BigQueryInsertJobOperator and tried to use GCSToBigQueryOperator instead when update from Airflow 2.2.5 to 2.3.2.
Use case/motivation
most BigQuery-related operator recommend that we should use insert_job(), but gcs_to_bq_operator still use run_load ().
GCSToBigQueryOperator still uses lots of parameter to build config. Not like BigQueryInsertJobOperator which uses a clean configuration. The configuration is much straightforward and easy to follow.
I would like to know whether to keep GCSToBigQueryOperator and replace run_load with insert_job, Or have BigQueryInsertJobOperator. If we could replace run_load with insert_job in GCSToBigQueryOperator , all BigQuery-related operator could be more consistent.
Description
BigQueryInsertJobOperator
and tried to useGCSToBigQueryOperator
instead when update from Airflow 2.2.5 to 2.3.2.Use case/motivation
insert_job()
, but gcs_to_bq_operator still userun_load ()
.GCSToBigQueryOperator
still uses lots of parameter to build config. Not likeBigQueryInsertJobOperator
which uses a clean configuration. The configuration is much straightforward and easy to follow.GCSToBigQueryOperator
and replacerun_load
withinsert_job
, Or haveBigQueryInsertJobOperator
. If we could replacerun_load
withinsert_job
inGCSToBigQueryOperator
, all BigQuery-related operator could be more consistent.Related issues
No response
Are you willing to submit a PR?
Code of Conduct
The text was updated successfully, but these errors were encountered: