-
Notifications
You must be signed in to change notification settings - Fork 14
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DM-47375: Run query_all_datasets in a single request for RemoteButler #1114
Commits on Nov 14, 2024
-
Use advanced query system in query_all_datasets
Switch from the query_datasets convenience method to the advanced query system in query_all_datasets. This lets us get the results one page at a time, which will be needed to prevent memory exhaustion when running these queries on the server.
Configuration menu - View commit details
-
Copy full SHA for 39e9d02 - Browse repository at this point
Copy the full SHA 39e9d02View commit details -
Remove with_dimension_records from query-datasets
It turns out that the query-datasets CLI was not actually using dimension records, and it will simplify the implementation to not support this.
Configuration menu - View commit details
-
Copy full SHA for 6b10874 - Browse repository at this point
Copy the full SHA 6b10874View commit details -
Restrict --order-by in query-datasets to single type
The backend for querying multiple dataset types will not support "order by", so restrict the CLI to match the implementation.
Configuration menu - View commit details
-
Copy full SHA for 8da61ab - Browse repository at this point
Copy the full SHA 8da61abView commit details -
Remove order_by from query_all_datasets
The upcoming implementation of query_all_datasets will not support order_by, so remove it. This requires modifying the query-datasets CLI to use the single dataset type query_datasets when order by needs to be supported.
Configuration menu - View commit details
-
Copy full SHA for 0a92c87 - Browse repository at this point
Copy the full SHA 0a92c87View commit details -
Make streaming query logic reusable
In preparation for implementing query_all_datasets on the server, make the streaming response and timeout logic from the existing query handler re-usable.
Configuration menu - View commit details
-
Copy full SHA for f3e2e9d - Browse repository at this point
Copy the full SHA f3e2e9dView commit details -
Move query streaming logic to its own file
After the refactor in the previous commit, this is somewhat independent of the query routes.
Configuration menu - View commit details
-
Copy full SHA for 7aaee6c - Browse repository at this point
Copy the full SHA 7aaee6cView commit details -
Move query streaming client code to its own file
This will be shared by the RemoteButler query_all_datasets implementation in an upcoming commit.
Configuration menu - View commit details
-
Copy full SHA for 5c4d54d - Browse repository at this point
Copy the full SHA 5c4d54dView commit details -
Define a dataclass for query_all_datasets args
This will be used in an upcoming commit to prevent excessive duplication of function parameters between implementations of query_all_datasets.
Configuration menu - View commit details
-
Copy full SHA for f5aa116 - Browse repository at this point
Copy the full SHA f5aa116View commit details -
Add server-side implementation of query_all_datasets
query_all_datasets can potentially involve hundreds or thousands of separate dataset queries. We don't want clients slamming the server with that many HTTP requests, so add a server-side endpoint that can handle these queries in a single request.
Configuration menu - View commit details
-
Copy full SHA for ad26503 - Browse repository at this point
Copy the full SHA ad26503View commit details -
Add back dimension records to QueryDatasets
It turns out the QueryDatasets class is shared by multiple CLI scripts, some of which need dimension records included. So add back `with_dimension_records` to the internal implementation of query_all_datasets.
Configuration menu - View commit details
-
Copy full SHA for c9fbdb9 - Browse repository at this point
Copy the full SHA c9fbdb9View commit details -
Configuration menu - View commit details
-
Copy full SHA for d40596b - Browse repository at this point
Copy the full SHA d40596bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3ffbb4b - Browse repository at this point
Copy the full SHA 3ffbb4bView commit details