More Flexible Metrics strategy #854

mattrjacobs · 2015-08-06T18:28:49Z

Currently, HystrixCommandMetrics / HystrixThreadPoolMetrics / HystrixCollapserMetrics are all concrete classes which proscribe a single metrics strategy. In each case, they do in-memory summarization of counts and latencies. This is generally fine, but it provides no flexibility if anything else is desired. See #333 for an example.

mattrjacobs · 2015-08-06T18:29:37Z

#843 should solve this issue. This will be the first commit after I branch off 1.4.x

mattrjacobs · 2015-08-25T04:54:20Z

I've spent the last 2 weeks prototyping using this change, and that has refined my thinking. I'm leaving this open until I get a few more tasks done.

I would like Hystrix 1.5.0 to support multiple modes of operation w.r.t metrics. Here are a few examples:

A) Work as-is today, which is to aggregate into in-memory data structures for commands / threadpools / collapsers. Circuit-breakers are based on a rolling window of command outcomes. A metrics publisher plugin gets the metrics off-box on some sort of interval.

B) Shift as much metrics aggregation off-box as possible. Per-request, flush all state that got built over a request (command executions / collapser executions). Provide a way to access longer-lived metrics, such as thread pool / queue utilization or concurrency experienced by a command. The only reason to keep on-box data structures is for circuit-breaking. This has the advantage of never losing any data. Interesting data can be directly computed by the off-box aggregator, such as a true histogram of command latency / thread-pool utilization / interarrival time of a command.

C) Keep metrics aggregation on-box, but allow for different representations. Circuit-breaking should still be supported, so a rolling window of command outcomes should still be there, but everything else is up for grabs. Collapser metrics may be dropped, for instance. Or you could store each Command / Collapser event in a List and publish that List periodically for downstream processing.

From this, I'm creating some concrete tasks to get done for 1.5.0

Add collapser executions to request state (supports B)
Add command startTime and distinct latency info to request state (supports B)
Create text and binary representation of full Hystrix data on per-request-basis (supports B)
Add semantic metric type to metrics (Rolling Sum / Cumulative Sum / Snapshot / etc). This allows for more generic code to be written in each of the metrics publisher plugins. (supports A, B, C)
Give HystrixRollingNumber and HystrixRollingPercentile a way to share the logic for bucket-rolling (supports A, C)
Cache commonly-read values for HystrixRollingPercentile (A, C)
Tie HealthCounts to bucket-rolling for HystrixCommandMetrics. I don't see a ton of value in allowing these to be computed independently. (A, B, C)
Evaluate performance impact of a background thread performing the bucket-rolling algorithm. This would save every metrics write/read from having to check the current time to determine if it should do a bucket-roll. (A, B, C)

If there are other cases to consider, or any concerns the above does not address, I'd love to hear them

mattrjacobs added enhancement hystrix-core labels Aug 6, 2015

mattrjacobs added this to the 1.5.x milestone Aug 6, 2015

mattrjacobs closed this as completed Aug 8, 2015

mattrjacobs reopened this Aug 25, 2015

mattrjacobs mentioned this issue Oct 14, 2015

Hystrix Metric Streams for Asynchronous Consumers #943

Closed

mattrjacobs mentioned this issue Dec 11, 2015

Added threshold under which errorPercentage is kept at zero #1017

Closed

mattrjacobs mentioned this issue Jan 12, 2016

Hystrix metrics as streams #1047

Merged

mattrjacobs closed this as completed Jan 14, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More Flexible Metrics strategy #854

More Flexible Metrics strategy #854

mattrjacobs commented Aug 6, 2015

mattrjacobs commented Aug 6, 2015

mattrjacobs commented Aug 25, 2015

More Flexible Metrics strategy #854

More Flexible Metrics strategy #854

Comments

mattrjacobs commented Aug 6, 2015

mattrjacobs commented Aug 6, 2015

mattrjacobs commented Aug 25, 2015