Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
47093: ycsb: only use column families when beneficial r=nvanbenschoten a=nvanbenschoten This commit updates the `workload/ycsb` load generator to default its `--families` flag (if unspecified) to the value that is most beneficial for performance of that workload. The determination of whether a workload benefits from column families or not is based on the following experiments: #### ycsb/[A-F]/nodes=3 (3x n1-standard-8 VMs): | workload | --families=false | --families=true | better with families? | |----------|-----------------:|----------------:|-----------------------| | A | 11,743.5 | 17,760.5 | true | | B | 35,232.3 | 32,982.2 | false | | C | 45,454.7 | 44,112.5 | false | | D | 36,091.0 | 35,615.1 | false | | E | 5,774.9 | 2,604.8 | false | | F | 4,933.1 | 8,259.7 | true | #### ycsb/[A-F]/nodes=3/cpu=32 (3x n1-standard-32 VMs): | workload | --families=false | --families=true | better with families? | |----------|-----------------:|----------------:|-----------------------| | A | 14,144.1 | 27,179.4 | true | | B | 96,669.6 | 104,567.5 | true | | C | 137,463.3 | 131,953.7 | false | | D | 103,188.6 | 95,285.7 | false | | E | 10,417.5 | 7,913.6 | false | | F | 5,782.3 | 15,532.1 | true | This addresses the recent regression we saw on `ycsb/E/nodes=3` due to 9124c4f. There's an argument to be made that this determination should go in the roachtest instead to avoid complicating the `--families` flag. I've decided not to do that, but would be happy to discuss if others feel like this is problematic. Co-authored-by: Nathan VanBenschoten <nvanbenschoten@gmail.com>
- Loading branch information