CachedValuesHistogram memory overhead #1296

mattnelson · 2016-08-01T15:53:33Z

The CachedValuesHistogram class preloads[1] 1000 instances of HDRHistogram. Through some profiling it looks like each instance of the HDRHistogram is using 41k of memory. Which equates to 41m of memory for preloading these histograms. The class will create a new histogram on a pool miss[2], so I'm wondering how the 1000 default was decided on #1047. Would you need 1000 unique commands in order to exhaust the pool?

[1] https://github.com/Netflix/Hystrix/blob/v1.5.3/hystrix-core/src/main/java/com/netflix/hystrix/metric/CachedValuesHistogram.java#L27-L31
[2] https://github.com/Netflix/Hystrix/blob/v1.5.3/hystrix-core/src/main/java/com/netflix/hystrix/metric/CachedValuesHistogram.java#L165-L167

mattrjacobs · 2016-08-02T17:28:56Z

These values were put in because they worked for our use-case. I don't think these would necessarily be appropriate for all use-cases though. I will add 2 pieces of configuration:

Size of histogram pool
Number of significant digits of each histogram.

I think setting the defaults to 0 / 3 is probably the right thing to do. That way, you opt in to to decision to use the pool. Thanks for the catch on this - was probably a bad decision to make this change to all Hystrix consumers.

The way this works is that each distribution stream (concurrency/latency) generates a new distribution on an interval. There are only a few distributions active at a time (10 is the default). When a new distribution is needed, it tries to get one from the pool. When an old distribution is no longer referenced, it is cleared and returned to the pool.

Once this is configurable, I'll run some benchmarks to get a more accurate accounting for the tradeoffs of pool sizes.

mattrjacobs · 2016-09-16T23:19:30Z

@mattnelson Rather than add configuration, I have a PR which just removes the pooling entirely: #1351. I would rather reduce complexity than add to it. Any concerns with this approach?

mattnelson · 2016-09-16T23:36:13Z

I'm fine with either approach. Was there profiling done when this feature was introduced to warrant using the pool? I don't want to see a performance regression caused by the construction of the histograms.

mattrjacobs · 2016-09-16T23:39:13Z

I think at some point in the development of the metrics functionality, it was warranted. I just ran a preliminary set of jmh tests and the results looked fine without the pooling. I'll merge this in and publish the perf-delta here

mattrjacobs · 2016-09-28T20:18:20Z

I added the jmh data for 1.5.6 here: https://docs.google.com/spreadsheets/d/1a0ERBQJZzlmVqMpuvvdSwbJuXwt0UmkPsFZ97pvgA8o/edit#gid=1200952544

No large delta on memory-usage when reading metrics, so closing

mattrjacobs added bug enhancement hystrix-core metrics labels Aug 2, 2016

mattrjacobs added this to the 1.5.5 milestone Aug 11, 2016

mattrjacobs mentioned this issue Sep 16, 2016

Remove object-pooling from CachedValuesHistogram #1351

Merged

mattrjacobs closed this as completed Sep 28, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CachedValuesHistogram memory overhead #1296

CachedValuesHistogram memory overhead #1296

mattnelson commented Aug 1, 2016

mattrjacobs commented Aug 2, 2016

mattrjacobs commented Sep 16, 2016

mattnelson commented Sep 16, 2016

mattrjacobs commented Sep 16, 2016

mattrjacobs commented Sep 28, 2016

CachedValuesHistogram memory overhead #1296

CachedValuesHistogram memory overhead #1296

Comments

mattnelson commented Aug 1, 2016

mattrjacobs commented Aug 2, 2016

mattrjacobs commented Sep 16, 2016

mattnelson commented Sep 16, 2016

mattrjacobs commented Sep 16, 2016

mattrjacobs commented Sep 28, 2016