-
Notifications
You must be signed in to change notification settings - Fork 1.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
FR: GCP Component for Arbitrary BQ Op without export job #2640
Comments
I think once #2606 is fixed then you should be able to not specify a GCS path and thus no extract job is run. |
Makes sense, I did try that. The component doc should then be updated to indicate the component isn't solely for BQ query + extract job use case. |
More generally speaking, once #2616 is released, customizing the query job should be doable and support the use cases you described. That being said, I do wonder if there's a balance between having a component being able to everything versus components that do a limited set of actions. |
In general I think it's more clear / readable to have a bigquery_query_op(...) and bigquery_query_to_gcs_op(...) or biqquery_query_to_export_op(...). It is likely that BQ to GCS is the most common use case anyway, so there is an argument for leaving as is. There are more options that users might use in the future that could overload a single op, such as export to various formats like JSON. |
+1 Also |
I will look in to this. /assign |
@Ark-kun as I understand it we would like to have the following three components:
is this correct? |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. |
This issue has been automatically closed because it has not had recent activity. Please comment "/reopen" to reopen it. |
lightgbm version defined as 3.3.2, but runtime YAML was not updated and only supports major version 2. https://github.com/kserve/kserve/blob/master/python/lgbserver/setup.py#L41 Signed-off-by: alexagriffith <agriffith50@bloomberg.net> Signed-off-by: alexagriffith <agriffith50@bloomberg.net>
FR for a more generic BQ Op component that doesn't try to run an export job every time. Use cases include intermediary steps in BQ. Also, AutoML Tables pipelines, where AutoML tables expects entire tables in BQ and does not support a select statement. It would be nice to provide AutoML tables with the temp table or create a separate table vs. pull data out to GCS.
Current component:
https://github.com/kubeflow/pipelines/blob/master/components/gcp/bigquery/query/README.md
The text was updated successfully, but these errors were encountered: