[DNM] benchmarks against object storage #1472
Closed
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
I'm using this PR as a space to collect some info about running the TPC-H queries against object storage. Goals are to compare
Against storage backends
Changes
This PR creates a new binary that runs every TPC-H query while logging IOs in our objectstore reader, allowing us to examine both request sizes and request counts for each query.
Parquet and Vortex are each selectable, and the bucket is also configurable.
To run the test that uses S3 Express One Zone, you need to set
AWS_S3_EXPRESS=true
in your.env
or directly in your shell environment