Add IMDB(JOB) Benchmark [2/N] (imdb queries) #12529
Open
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Which issue does this PR close?
Partially closes #12311.
.csv
,.parquet
).query_id
5
indicatesquery
2a
,Rationale for this change
imdb.slt
, just like what we did totpch.slt
.Unlike TPC-H, IMDB dataset is not generated and it's fixed sized, so no scaling factor and we don't need another docker container to generate data and answers.
I have also cross-checked answers in csv files from https://github.com/duckdb/duckdb/tree/main/benchmark/imdb/answers .
What changes are included in this PR?
IMDB(JOB) queries don't have incremental
query_id
, so I hard-coded the benchmark runnerquery_id
(1,2,3,4, ... 113 in integer) to actual IMDB query name (1a, 1b, 1c, 1d, 2a, ... 33c in string, there is no pattern) mapping via lots ofif
.Currently, I've only add SLT for:
Are these changes tested?
Yes, please check
test_files/imdb
for details.Are there any user-facing changes?
No.