store: reduce memory footprint for chunks queries #3937

krya-kryak · 2021-03-16T01:31:39Z

I added CHANGELOG entry for this change.
Change is not relevant to the end user.

Changes

Reduce memory footprint for chunks queries by only keeping in memory those chunks and subchunks requested, throwing away all the unnecessary bits; processed chunks are saved into moderately-sized slabs allocated on demand, so no more than one slab is wasted needlessly.
Fixes for ~~two data races~~ one data race found during work on the main part.

Verification

A DownsampledBlockSeries benchmark was added. Note the reduction in allocated memory amount.

$ ~/go/bin/benchstat main_with_new_benchmark.txt slab_benchmark.txt
name                                                                       old time/op    new time/op    delta
BucketSeries/1000000SeriesWith1Samples/1of1000000-8                          90.9ms ± 2%    96.1ms ± 5%   +5.71%  (p=0.008 n=5+5)
BucketSeries/1000000SeriesWith1Samples/10of1000000-8                         90.4ms ± 2%    93.6ms ± 6%   +3.57%  (p=0.032 n=5+5)
BucketSeries/1000000SeriesWith1Samples/1000000of1000000-8                     1.10s ± 7%     1.02s ± 5%   -7.55%  (p=0.032 n=5+5)
BucketSeries/100000SeriesWith100Samples/1of10000000-8                        6.53ms ± 1%    6.50ms ± 1%     ~     (p=1.000 n=5+5)
BucketSeries/100000SeriesWith100Samples/100of10000000-8                      6.47ms ± 0%    6.48ms ± 1%     ~     (p=1.000 n=5+5)
BucketSeries/100000SeriesWith100Samples/10000000of10000000-8                 92.4ms ± 8%    91.9ms ± 8%     ~     (p=1.000 n=5+5)
BucketSeries/1SeriesWith10000000Samples/1of10000000-8                         224µs ± 1%     229µs ± 0%   +1.97%  (p=0.008 n=5+5)
BucketSeries/1SeriesWith10000000Samples/100of10000000-8                       225µs ± 2%     228µs ± 0%     ~     (p=0.151 n=5+5)
BucketSeries/1SeriesWith10000000Samples/10000000of10000000-8                 24.9ms ± 5%    19.5ms ±10%  -21.54%  (p=0.008 n=5+5)
BlockSeries/concurrency:_1-8                                                 11.2ms ± 1%    10.9ms ± 4%   -2.85%  (p=0.016 n=5+5)
BlockSeries/concurrency:_2-8                                                 6.41ms ± 6%    6.27ms ± 3%     ~     (p=0.548 n=5+5)
BlockSeries/concurrency:_4-8                                                 5.56ms ±11%    4.03ms ± 1%  -27.42%  (p=0.008 n=5+5)
BlockSeries/concurrency:_8-8                                                 4.82ms ±17%    3.43ms ± 7%  -28.73%  (p=0.008 n=5+5)
BlockSeries/concurrency:_16-8                                                6.96ms ±10%    3.54ms ±18%  -49.07%  (p=0.008 n=5+5)
BlockSeries/concurrency:_32-8                                                10.6ms ±62%     5.5ms ± 9%  -48.14%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT],_concurrency:_1-8                 2.92ms ± 5%    3.12ms ± 0%   +6.84%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT],_concurrency:_2-8                 1.72ms ±11%    1.88ms ± 5%     ~     (p=0.095 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT],_concurrency:_4-8                 1.09ms ±12%    1.30ms ± 2%  +19.12%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT],_concurrency:_8-8                  860µs ± 2%    1215µs ± 3%  +41.32%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT],_concurrency:_16-8                 863µs ± 3%    1198µs ± 1%  +38.92%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT],_concurrency:_32-8                 923µs ± 5%    1244µs ± 4%  +34.86%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM],_concurrency:_1-8             2.87ms ± 5%    3.22ms ± 3%  +12.48%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM],_concurrency:_2-8             1.49ms ± 3%    1.97ms ± 6%  +31.78%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM],_concurrency:_4-8             1.09ms ±14%    1.25ms ± 5%     ~     (p=0.056 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM],_concurrency:_8-8              983µs ±14%    1117µs ± 4%  +13.67%  (p=0.032 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM],_concurrency:_16-8             968µs ±20%    1132µs ± 1%     ~     (p=0.151 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM],_concurrency:_32-8             896µs ± 4%    1186µs ± 6%  +32.34%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM_MIN],_concurrency:_1-8         2.99ms ± 5%    3.62ms ±30%  +20.99%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM_MIN],_concurrency:_2-8         1.52ms ± 3%    2.05ms ±11%  +34.83%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM_MIN],_concurrency:_4-8         1.05ms ± 4%    1.23ms ± 6%  +16.53%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM_MIN],_concurrency:_8-8          937µs ± 4%    1098µs ± 4%  +17.12%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM_MIN],_concurrency:_16-8        1.04ms ±19%    1.13ms ± 1%     ~     (p=0.151 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM_MIN],_concurrency:_32-8        1.12ms ±19%    1.14ms ± 3%     ~     (p=1.000 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM_MIN_MAX],_concurrency:_1-8     3.31ms ±14%    3.44ms ± 2%     ~     (p=0.548 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM_MIN_MAX],_concurrency:_2-8     1.70ms ± 4%    2.00ms ± 5%  +17.26%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM_MIN_MAX],_concurrency:_4-8     1.24ms ±15%    1.23ms ± 3%     ~     (p=0.548 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM_MIN_MAX],_concurrency:_8-8     1.02ms ± 3%    1.10ms ± 4%   +7.76%  (p=0.016 n=4+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM_MIN_MAX],_concurrency:_16-8    1.01ms ± 9%    1.12ms ± 2%  +10.91%  (p=0.016 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM_MIN_MAX],_concurrency:_32-8    1.09ms ± 8%    1.15ms ± 3%     ~     (p=0.056 n=5+5)

name                                                                       old alloc/op   new alloc/op   delta
BucketSeries/1000000SeriesWith1Samples/1of1000000-8                          62.0MB ± 0%    62.1MB ± 0%   +0.09%  (p=0.008 n=5+5)
BucketSeries/1000000SeriesWith1Samples/10of1000000-8                         62.0MB ± 0%    62.1MB ± 0%   +0.06%  (p=0.008 n=5+5)
BucketSeries/1000000SeriesWith1Samples/1000000of1000000-8                    1.29GB ± 0%    1.25GB ± 0%   -2.90%  (p=0.029 n=4+4)
BucketSeries/100000SeriesWith100Samples/1of10000000-8                        4.82MB ± 0%    4.86MB ± 0%   +0.83%  (p=0.008 n=5+5)
BucketSeries/100000SeriesWith100Samples/100of10000000-8                      4.82MB ± 0%    4.86MB ± 0%   +0.81%  (p=0.008 n=5+5)
BucketSeries/100000SeriesWith100Samples/10000000of10000000-8                  130MB ± 4%     128MB ± 3%     ~     (p=0.548 n=5+5)
BucketSeries/1SeriesWith10000000Samples/1of10000000-8                         177kB ± 0%     214kB ± 0%  +21.03%  (p=0.016 n=4+5)
BucketSeries/1SeriesWith10000000Samples/100of10000000-8                       177kB ± 0%     214kB ± 0%  +21.05%  (p=0.008 n=5+5)
BucketSeries/1SeriesWith10000000Samples/10000000of10000000-8                 44.8MB ± 5%    39.8MB ± 0%  -11.26%  (p=0.008 n=5+5)
BlockSeries/concurrency:_1-8                                                 16.9MB ± 3%     5.4MB ± 6%  -68.36%  (p=0.008 n=5+5)
BlockSeries/concurrency:_2-8                                                 16.9MB ± 8%     5.4MB ± 5%  -67.83%  (p=0.008 n=5+5)
BlockSeries/concurrency:_4-8                                                 15.7MB ± 7%     5.4MB ± 6%  -65.86%  (p=0.008 n=5+5)
BlockSeries/concurrency:_8-8                                                 13.5MB ±20%     5.7MB ±12%  -57.54%  (p=0.008 n=5+5)
BlockSeries/concurrency:_16-8                                                18.2MB ±12%     5.9MB ±14%  -67.62%  (p=0.008 n=5+5)
BlockSeries/concurrency:_32-8                                                30.8MB ±26%     8.1MB ±14%  -73.69%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT],_concurrency:_1-8                 2.02MB ±16%    0.71MB ± 0%  -64.80%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT],_concurrency:_2-8                 2.09MB ±19%    0.73MB ± 1%  -65.07%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT],_concurrency:_4-8                 2.05MB ±20%    0.74MB ± 1%  -64.11%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT],_concurrency:_8-8                 1.65MB ±20%    0.74MB ± 1%  -55.07%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT],_concurrency:_16-8                1.80MB ± 5%    0.75MB ± 2%  -58.47%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT],_concurrency:_32-8                1.81MB ± 7%    0.80MB ± 3%  -55.85%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM],_concurrency:_1-8             2.11MB ±14%    0.84MB ± 0%  -60.08%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM],_concurrency:_2-8             2.16MB ± 7%    0.86MB ± 1%  -60.29%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM],_concurrency:_4-8             2.30MB ±13%    0.84MB ± 1%  -63.50%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM],_concurrency:_8-8             2.02MB ± 7%    0.84MB ± 1%  -58.38%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM],_concurrency:_16-8            1.86MB ±10%    0.86MB ± 2%  -53.55%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM],_concurrency:_32-8            1.87MB ±11%    0.88MB ± 3%  -52.91%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM_MIN],_concurrency:_1-8         2.26MB ± 9%    0.98MB ± 2%  -56.76%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM_MIN],_concurrency:_2-8         2.23MB ±10%    0.99MB ± 1%  -55.54%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM_MIN],_concurrency:_4-8         2.45MB ± 4%    0.94MB ± 2%  -61.69%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM_MIN],_concurrency:_8-8         2.01MB ±28%    0.89MB ± 2%  -55.56%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM_MIN],_concurrency:_16-8        1.86MB ±12%    0.95MB ± 1%  -49.18%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM_MIN],_concurrency:_32-8        2.12MB ±13%    0.95MB ± 3%  -55.19%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM_MIN_MAX],_concurrency:_1-8     2.13MB ±15%    1.06MB ± 3%  -50.40%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM_MIN_MAX],_concurrency:_2-8     2.43MB ± 6%    1.07MB ± 3%  -56.01%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM_MIN_MAX],_concurrency:_4-8     2.41MB ± 9%    1.01MB ± 3%  -57.93%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM_MIN_MAX],_concurrency:_8-8     2.25MB ±16%    0.99MB ± 2%  -55.88%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM_MIN_MAX],_concurrency:_16-8    2.09MB ±18%    1.04MB ± 1%  -50.34%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM_MIN_MAX],_concurrency:_32-8    2.02MB ± 8%    1.07MB ± 4%  -47.11%  (p=0.008 n=5+5)

name                                                                       old allocs/op  new allocs/op  delta
BucketSeries/1000000SeriesWith1Samples/1of1000000-8                           9.69k ± 1%     9.70k ± 0%     ~     (p=0.690 n=5+5)
BucketSeries/1000000SeriesWith1Samples/10of1000000-8                          9.81k ± 0%     9.79k ± 0%     ~     (p=0.341 n=5+5)
BucketSeries/1000000SeriesWith1Samples/1000000of1000000-8                     10.1M ± 0%     10.0M ± 0%   -0.37%  (p=0.008 n=5+5)
BucketSeries/100000SeriesWith100Samples/1of10000000-8                         1.10k ± 0%     1.10k ± 0%     ~     (p=0.762 n=5+5)
BucketSeries/100000SeriesWith100Samples/100of10000000-8                       1.13k ± 0%     1.13k ± 0%     ~     (p=0.190 n=5+5)
BucketSeries/100000SeriesWith100Samples/10000000of10000000-8                  1.01M ± 0%     1.00M ± 0%   -0.34%  (p=0.008 n=5+5)
BucketSeries/1SeriesWith10000000Samples/1of10000000-8                           199 ± 0%       200 ± 0%   +0.50%  (p=0.008 n=5+5)
BucketSeries/1SeriesWith10000000Samples/100of10000000-8                         199 ± 0%       200 ± 0%   +0.50%  (p=0.008 n=5+5)
BucketSeries/1SeriesWith10000000Samples/10000000of10000000-8                   170k ± 0%      167k ± 0%   -1.32%  (p=0.008 n=5+5)
BlockSeries/concurrency:_1-8                                                  32.6k ± 1%     31.8k ± 4%     ~     (p=0.056 n=5+5)
BlockSeries/concurrency:_2-8                                                  32.2k ± 2%     32.6k ± 3%     ~     (p=0.198 n=5+5)
BlockSeries/concurrency:_4-8                                                  33.9k ± 5%     32.4k ± 2%     ~     (p=0.175 n=5+5)
BlockSeries/concurrency:_8-8                                                  36.3k ±12%     31.8k ± 6%  -12.27%  (p=0.032 n=5+5)
BlockSeries/concurrency:_16-8                                                 54.0k ± 9%     32.9k ±16%  -39.00%  (p=0.008 n=5+5)
BlockSeries/concurrency:_32-8                                                 64.7k ±12%     56.9k ± 6%     ~     (p=0.222 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT],_concurrency:_1-8                  5.70k ± 1%     5.76k ± 0%   +1.05%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT],_concurrency:_2-8                  5.68k ± 1%     5.76k ± 1%   +1.40%  (p=0.032 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT],_concurrency:_4-8                  5.73k ± 1%     5.79k ± 1%     ~     (p=0.087 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT],_concurrency:_8-8                  5.74k ± 2%     5.93k ± 1%   +3.32%  (p=0.016 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT],_concurrency:_16-8                 5.69k ± 1%     6.07k ± 3%   +6.71%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT],_concurrency:_32-8                 6.04k ± 6%     6.72k ± 5%  +11.25%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM],_concurrency:_1-8              7.42k ± 1%     7.43k ± 0%     ~     (p=0.841 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM],_concurrency:_2-8              7.42k ± 1%     7.54k ± 1%   +1.67%  (p=0.032 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM],_concurrency:_4-8              7.48k ± 1%     7.53k ± 1%     ~     (p=0.310 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM],_concurrency:_8-8              7.44k ± 1%     7.63k ± 2%   +2.53%  (p=0.016 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM],_concurrency:_16-8             7.67k ± 5%     8.02k ± 2%     ~     (p=0.151 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM],_concurrency:_32-8             7.69k ± 7%     8.53k ± 3%  +10.95%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM_MIN],_concurrency:_1-8          9.15k ± 1%     9.20k ± 1%     ~     (p=0.310 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM_MIN],_concurrency:_2-8          9.12k ± 1%     9.16k ± 1%     ~     (p=0.548 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM_MIN],_concurrency:_4-8          9.12k ± 1%     9.24k ± 1%     ~     (p=0.135 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM_MIN],_concurrency:_8-8          9.40k ± 1%     9.25k ± 3%     ~     (p=0.151 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM_MIN],_concurrency:_16-8         9.53k ± 5%     9.95k ± 1%   +4.41%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM_MIN],_concurrency:_32-8         10.4k ± 9%     10.2k ± 2%     ~     (p=0.690 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM_MIN_MAX],_concurrency:_1-8      10.9k ± 1%     10.9k ± 0%     ~     (p=0.905 n=5+4)
DownsampledBlockSeries/aggregates:_[COUNT_SUM_MIN_MAX],_concurrency:_2-8      10.9k ± 1%     10.9k ± 0%     ~     (p=0.421 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM_MIN_MAX],_concurrency:_4-8      10.9k ± 1%     11.0k ± 1%     ~     (p=0.151 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM_MIN_MAX],_concurrency:_8-8      11.2k ± 4%     11.0k ± 1%     ~     (p=0.548 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM_MIN_MAX],_concurrency:_16-8     11.2k ± 4%     11.8k ± 2%   +5.07%  (p=0.008 n=5+5)
DownsampledBlockSeries/aggregates:_[COUNT_SUM_MIN_MAX],_concurrency:_32-8     12.2k ± 8%     12.3k ± 2%     ~     (p=0.690 n=5+5)

Signed-off-by: Vladimir Kononov <krya-kryak@users.noreply.github.com>

Linter failed with: `saviour` is a misspelling of `savior` (misspell) Signed-off-by: Vladimir Kononov <krya-kryak@users.noreply.github.com>

krya-kryak · 2021-03-16T02:50:47Z

pkg/store/bucket.go

 		chunks:   map[uint64]chunkenc.Chunk{},
 	}
 }

-// addPreload adds the chunk with id to the data set that will be fetched on calling preload.
-func (r *bucketChunkReader) addPreload(id uint64) error {
+func (r *bucketChunkReader) Chunk(id uint64) (chunkenc.Chunk, error) {


Chunk and Close methods are unchanged; just moved exported functions closer to receiver definition.

Actually, scratch that. First of all, I've broken Chunk() while changing preload flow. Secondly, Chunk() is no longer used, so I've deleted it altogether.

Signed-off-by: Vladimir Kononov <krya-kryak@users.noreply.github.com>

krya-kryak · 2021-03-16T11:44:52Z

pkg/store/bucket_test.go

@@ -1318,8 +1319,6 @@ func benchBucketSeries(t testutil.TB, skipChunk bool, samplesPerSeries, totalSer

 	if !t.IsBenchmark() {
 		if !skipChunk {
-			// Make sure the pool is correctly used. This is expected for 200k numbers.
-			testutil.Equals(t, numOfBlocks, int(st.chunkPool.(*mockedPool).gets.Load()))


Checking the number of slabs allocated by chunk pool is now untrivial as it depends on the time series.

krya-kryak · 2021-03-16T16:21:28Z

2021-03-16T12:19:18.5949904Z --- FAIL: TestStoreGateway (35.07s)
2021-03-16T12:19:18.5950809Z     --- FAIL: TestStoreGateway/query_works (25.86s)
2021-03-16T12:19:18.5951573Z === CONT  TestReceive/replication_with_outage
<...>
2021-03-16T12:19:18.7179785Z FAIL
2021-03-16T12:19:18.7180515Z FAIL	github.com/thanos-io/thanos/test/e2e	150.873s
2021-03-16T12:19:18.7181531Z ?   	github.com/thanos-io/thanos/test/e2e/e2ethanos	[no test files]
2021-03-16T12:19:18.7182110Z FAIL
2021-03-16T12:19:18.7183653Z ##[error]make: *** [Makefile:243: test-e2e] Error 1
2021-03-16T12:19:18.7192266Z ##[error]Process completed with exit code 2.

Failing e2e test TestStoreGateway seems to work locally (output cleaned up):

GOROOT=/usr/local/Cellar/go/1.16.2/libexec #gosetup
GOPATH=/Users/vkononov/go #gosetup
/usr/local/Cellar/go/1.16.2/libexec/bin/go test -c -o /private/var/folders/31/qr0fcfm56vs91g1fl524n63m0000gp/T/___TestStoreGateway_in_github_com_thanos_io_thanos_test_e2e github.com/thanos-io/thanos/test/e2e #gosetup
/usr/local/Cellar/go/1.16.2/libexec/bin/go tool test2json -t /private/var/folders/31/qr0fcfm56vs91g1fl524n63m0000gp/T/___TestStoreGateway_in_github_com_thanos_io_thanos_test_e2e -test.v -test.run ^\QTestStoreGateway\E$
=== RUN   TestStoreGateway
=== PAUSE TestStoreGateway
=== CONT  TestStoreGateway
--- PASS: TestStoreGateway (21.98s)
=== RUN   TestStoreGateway/query_works
    --- PASS: TestStoreGateway/query_works (5.05s)
=== RUN   TestStoreGateway/remove_meta.json_from_id1_block
    --- PASS: TestStoreGateway/remove_meta.json_from_id1_block (1.19s)
=== RUN   TestStoreGateway/upload_block_id5,_similar_to_id1
    --- PASS: TestStoreGateway/upload_block_id5,_similar_to_id1 (2.82s)
=== RUN   TestStoreGateway/delete_whole_id2_block_#yolo
    --- PASS: TestStoreGateway/delete_whole_id2_block_#yolo (3.03s)
PASS

Process finished with exit code 0

GiedriusS

Thanks for your work! It seems like you have mixed up the order of two benchmarks in your PR's description. The times have decreased, not increased, right? Also, I was able to reproduce the CI failure locally :( Will attach more info once I am able to find something

GiedriusS · 2021-03-17T16:23:10Z

CHANGELOG.md

@@ -19,8 +19,11 @@ We use _breaking :warning:_ to mark changes that are not backward compatible (re
 ### Fixed
 - [#3204](https://github.com/thanos-io/thanos/pull/3204) Mixin: Use sidecar's metric timestamp for healthcheck.
 - [#3922](https://github.com/thanos-io/thanos/pull/3922) Fix panic in http logging middleware.
+- [#3937](https://github.com/thanos-io/thanos/pull/3937) Store: Fix race condition in chunk pool.
+- [#3937](https://github.com/thanos-io/thanos/pull/3937) Testutil: Fix race condition encountered during benchmarking.


We typically don't add non user-facing changes here :P

It seems like you have mixed up the order of two benchmarks in your PR's description. The times have decreased, not increased, right? Also, I was able to reproduce the CI failure locally :( Will attach more info once I am able to find something

time/op did in fact increase in some benchmarks (and went down a bit in the others), it's mostly allocated bytes / op I was aiming here. I'll look into possibility of shaving some cpu time (larger slabs, probaly).

pkg/testutil/testutil.go

GiedriusS · 2021-03-17T16:33:33Z

pkg/store/bucket.go

-	c, ok := r.chunks[id]
-	if !ok {
-		return nil, errors.Errorf("chunk with ID %d not found", id)
+func (r *bucketChunkReader) savior(b []byte) ([]byte, error) {


I find the terminology here interesting. Maybe a simple name of allocate() would suit it more? I'm not sure who/what is being saved from what 😄

Also, we could potentially probably save even allocations more because this only checks the last r.chunkBytes. So, it should perform worse with allocations of size:

42 12345 1 2 3 4 5

i.e. where their sizes are decreasing because we will always allocate more whereas we could potentially store them at the beginning unless I am mistaken. Maybe it would be worth making this change & putting together a small benchmark to see if it optimizes things even more?

yes, there is a way to optimize this further - as currently slab "tails" are wasted. This would potentially save up to max(raw values chunk / subchunk size) - 1 per slab (set to 64k in this PR). What is that, 3200 bytes? If so, maximum gain is 4.9% per slab.

However, for the benchmarks we have (filled with random values), average size tends to be less than half of that, so that would cut our estimated maximum profit to, say, 1500 bytes per slab (2.3%). Given that not every case would be that bad, I would estimate actual memory wastes to be around 1.5%.

Moreover, we would need to be lucky and only write chunks that are small enough to be written to the wasted space - so we would not be able to reclaim all of it. All in all, I would consider it a win if we reduce that waste from 1.5% to 1%. So, 5MB save on a 1GB allocation. Do we have bigger fish to fry?

Makes sense, also - one thing at the time (:

pkg/store/bucket.go

pkg/pool/pool.go

Signed-off-by: Vladimir Kononov <krya-kryak@users.noreply.github.com>

This reverts commit 194d234. Signed-off-by: Vladimir Kononov <krya-kryak@users.noreply.github.com>

bwplotka

Thanks a lot! Solid work, some nits only. Otherwise looks generally good 💪🏽

pkg/store/bucket.go

bwplotka · 2021-03-18T18:19:22Z

pkg/store/bucket.go

 		chunks:   map[uint64]chunkenc.Chunk{},
 	}
 }

-// addPreload adds the chunk with id to the data set that will be fetched on calling preload.
-func (r *bucketChunkReader) addPreload(id uint64) error {
+func (r *bucketChunkReader) Chunk(id uint64) (chunkenc.Chunk, error) {


bwplotka · 2021-03-18T18:19:36Z

pkg/store/bucket.go

+	return nil
+}
+
+// appPreload adds the chunk with id to the data set that will be fetched on calling preload.


Suggested change

// appPreload adds the chunk with id to the data set that will be fetched on calling preload.

// addPreload adds the chunk with id to the data set that will be fetched on calling preload.

Can you update this with i,j meanings too?

Done. Also, renamed addPreload() / preload(), as nothing is preloaded anymore.

pkg/store/bucket.go

bwplotka · 2021-03-18T18:32:48Z

pkg/store/bucket.go

-	c, ok := r.chunks[id]
-	if !ok {
-		return nil, errors.Errorf("chunk with ID %d not found", id)
+func (r *bucketChunkReader) savior(b []byte) ([]byte, error) {


Makes sense, also - one thing at the time (:

pkg/store/bucket_test.go

Signed-off-by: Vladimir Kononov <krya-kryak@users.noreply.github.com>

GiedriusS

LGTM, all comments have been addressed, is this still a WIP @krya-kryak? I would like to merge this nice optimization. Plus, I couldn't reproduce e2e test failures anymore so they are either flaky or GH Actions runners had some issues

krya-kryak · 2021-03-27T16:02:43Z

LGTM, all comments have been addressed, is this still a WIP @krya-kryak? I would like to merge this nice optimization. Plus, I couldn't reproduce e2e test failures anymore so they are either flaky or GH Actions runners had some issues

@GiedriusS, thank you. I've set WIP with thoughts of optimizing CPU usage ( #3937 (comment) ). Unfortunately, I was not able to dedicate some time to it this week :(
If you consider current state of things good enough - I'll be happy to merge it and add performance optimizations (if any) later.

GiedriusS · 2021-03-29T13:03:13Z

LGTM, all comments have been addressed, is this still a WIP @krya-kryak? I would like to merge this nice optimization. Plus, I couldn't reproduce e2e test failures anymore so they are either flaky or GH Actions runners had some issues

@GiedriusS, thank you. I've set WIP with thoughts of optimizing CPU usage ( #3937 (comment) ). Unfortunately, I was not able to dedicate some time to it this week :(
If you consider current state of things good enough - I'll be happy to merge it and add performance optimizations (if any) later.

Given that most, if not all, CPU usage happens at the side of Thanos Query, and the increase here is so minuscule in comparison with what happens on the Thanos Query side, I think that merging this as-is should be OK. But, let's wait for the opinion of others.

Signed-off-by: Vladimir Kononov <krya-kryak@users.noreply.github.com>

bwplotka

Looks amazing, thanks! LGTM 💪🏽

krya-kryak added 4 commits March 16, 2021 04:35

Store: add benchmark for quering downsampled data.

c2f8d83

Signed-off-by: Vladimir Kononov <krya-kryak@users.noreply.github.com>

pkg/pool: fix data race

1c5db53

Signed-off-by: Vladimir Kononov <krya-kryak@users.noreply.github.com>

pkg/testutil: fix data race

194d234

Signed-off-by: Vladimir Kononov <krya-kryak@users.noreply.github.com>

store: reduce allocated memory amount for chunks queries

8c0d7c6

Signed-off-by: Vladimir Kononov <krya-kryak@users.noreply.github.com>

krya-kryak force-pushed the experiment-reduce-store-mem branch from 4eaa961 to 8c0d7c6 Compare March 16, 2021 01:36

pkg/store/bucket: translate variable name from English to US

6818d9a

Linter failed with: `saviour` is a misspelling of `savior` (misspell) Signed-off-by: Vladimir Kononov <krya-kryak@users.noreply.github.com>

krya-kryak force-pushed the experiment-reduce-store-mem branch from d59543d to 6818d9a Compare March 16, 2021 02:02

krya-kryak commented Mar 16, 2021

View reviewed changes

store: use slab allocations for chunks

38d6b49

Signed-off-by: Vladimir Kononov <krya-kryak@users.noreply.github.com>

krya-kryak commented Mar 16, 2021

View reviewed changes

GiedriusS reviewed Mar 17, 2021

View reviewed changes

krya-kryak added 2 commits March 18, 2021 00:04

Update changelog

c8218ec

Signed-off-by: Vladimir Kononov <krya-kryak@users.noreply.github.com>

Revert "pkg/testutil: fix data race"

07978c9

This reverts commit 194d234. Signed-off-by: Vladimir Kononov <krya-kryak@users.noreply.github.com>

bwplotka reviewed Mar 18, 2021

View reviewed changes

krya-kryak force-pushed the experiment-reduce-store-mem branch from 9cddbfd to 18d9fed Compare March 23, 2021 15:55

Apply code review suggestions

e38d349

Signed-off-by: Vladimir Kononov <krya-kryak@users.noreply.github.com>

krya-kryak force-pushed the experiment-reduce-store-mem branch from 18d9fed to b6379f7 Compare March 23, 2021 16:41

krya-kryak changed the title ~~store: reduce memory footprint for chunks queries~~ WIP: store: reduce memory footprint for chunks queries Mar 23, 2021

Merge branch 'main' into experiment-reduce-store-mem

91a52a4

krya-kryak force-pushed the experiment-reduce-store-mem branch from b6379f7 to 91a52a4 Compare March 23, 2021 17:07

GiedriusS approved these changes Mar 27, 2021

View reviewed changes

krya-kryak changed the title ~~WIP: store: reduce memory footprint for chunks queries~~ store: reduce memory footprint for chunks queries Mar 27, 2021

Merge branch 'main' into experiment-reduce-store-mem

239d243

pkg/store: rename variables

4b42644

Signed-off-by: Vladimir Kononov <krya-kryak@users.noreply.github.com>

krya-kryak force-pushed the experiment-reduce-store-mem branch from cc48150 to 4b42644 Compare March 30, 2021 17:01

bwplotka approved these changes Apr 1, 2021

View reviewed changes

bwplotka merged commit d3e60d6 into thanos-io:main Apr 1, 2021

krya-kryak deleted the experiment-reduce-store-mem branch April 1, 2021 21:16

This was referenced Apr 22, 2021

Upgrade Prometheus, Alertmanager and Thanos cortexproject/cortex#4102

Merged

Fix race condition in pool.BucketedBytes #4095

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

store: reduce memory footprint for chunks queries #3937

store: reduce memory footprint for chunks queries #3937

krya-kryak commented Mar 16, 2021 •

edited

Loading

krya-kryak Mar 16, 2021

bwplotka Mar 18, 2021

krya-kryak Mar 23, 2021

krya-kryak Mar 16, 2021

krya-kryak commented Mar 16, 2021

GiedriusS left a comment

GiedriusS Mar 17, 2021

krya-kryak Mar 18, 2021

GiedriusS Mar 17, 2021

krya-kryak Mar 17, 2021

bwplotka Mar 18, 2021

bwplotka left a comment

bwplotka Mar 18, 2021

bwplotka Mar 18, 2021

bwplotka Mar 18, 2021

krya-kryak Mar 23, 2021

bwplotka Mar 18, 2021

GiedriusS left a comment

krya-kryak commented Mar 27, 2021 •

edited

Loading

GiedriusS commented Mar 29, 2021 •

edited

Loading

bwplotka left a comment

	// appPreload adds the chunk with id to the data set that will be fetched on calling preload.
	// addPreload adds the chunk with id to the data set that will be fetched on calling preload.

store: reduce memory footprint for chunks queries #3937

store: reduce memory footprint for chunks queries #3937

Conversation

krya-kryak commented Mar 16, 2021 • edited Loading

Changes

Verification

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

krya-kryak commented Mar 16, 2021

GiedriusS left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bwplotka left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

GiedriusS left a comment

Choose a reason for hiding this comment

krya-kryak commented Mar 27, 2021 • edited Loading

GiedriusS commented Mar 29, 2021 • edited Loading

bwplotka left a comment

Choose a reason for hiding this comment

krya-kryak commented Mar 16, 2021 •

edited

Loading

krya-kryak commented Mar 27, 2021 •

edited

Loading

GiedriusS commented Mar 29, 2021 •

edited

Loading