You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: benchmark/README.md
+86-31Lines changed: 86 additions & 31 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,6 +1,6 @@
1
1
## Overview
2
2
3
-
This benchmark tool executes queries derived from TPC-H against both Snowflake and Embucket with cache-clearing operations to ensure clean, cache-free performance measurements. For Snowflake, it uses warehouse suspend/resume operations. For Embucket, it restarts the Docker container before each query to eliminate internal caching. It provides detailed timing metrics including compilation time, execution time, and total elapsed time.
3
+
This benchmark tool executes queries from multiple benchmark suites (TPC-H, ClickBench, TPC-DS) against both Snowflake and Embucket with cache-clearing operations to ensure clean, cache-free performance measurements. For Snowflake, it uses warehouse suspend/resume operations. For Embucket, it restarts the Docker container before each query to eliminate internal caching. It provides detailed timing metrics including compilation time, execution time, and total elapsed time.
4
4
5
5
## TPC Legal Considerations
6
6
@@ -14,9 +14,12 @@ Throughout this document and when talking about these benchmarks, you will see t
14
14
15
15
## Features
16
16
17
+
-**Multiple Benchmark Types**: Supports TPC-H, ClickBench, and TPC-DS benchmark suites
17
18
-**Cache Isolation**:
18
19
-**Snowflake**: Suspends and resumes warehouse before each query
19
20
-**Embucket**: Restarts Docker container before each query to clear internal cache
21
+
-**Flexible Caching Options**: Can run with or without cache clearing (`--no-cache` flag)
22
+
-**Command Line Interface**: Full CLI support for system selection, benchmark type, and run configuration
20
23
-**Result Cache Disabled**: Ensures no result caching affects benchmark results
21
24
-**Comprehensive Metrics**: Tracks compilation time, execution time, and row counts
22
25
-**CSV Export**: Saves results to CSV files for further analysis
**Current Behavior**: By default, the benchmark runs **only Embucket** benchmarks for 3 iterations. To run both Snowflake and Embucket with comparisons, you need to modify the `__main__` section in `benchmark.py` to call `run_benchmark(i + 1)` instead of `run_embucket_benchmark(i + 1)`.
parser.add_argument("--dataset-path", help="Override the DATASET_PATH environment variable")
438
441
parser.add_argument("--no-cache", action="store_true", help="Disable caching (force warehouse suspend and USE_CACHED_RESULT=False for Snowflake, force container restart for Embucket)")
0 commit comments