Produce metrics for ThreadPoolExecutors #947

batsatt · 2019-08-23T15:53:12Z

At minimum, create metrics for pool sizes and task counts, maybe active threads (this is more expensive). Additional metrics can be created for known subtypes.

Executors should not be modified, rather metrics should be pulled from them so that any type can be supported. This implies some form of registration (e.g. at the point of installation via a builder, etc.).

spericas · 2019-08-26T13:09:22Z

Stating the obvious likely, but we should do this after the 2.0 work is merged.

tomas-langer · 2021-06-24T10:03:09Z

Requested recently by a customer through a different other channel, let's retriage

gmpatter · 2021-06-24T10:08:00Z

We are seeing in our app, when load increases on an instance, the requests are put on the thread pool queue. When the thread pool queue reaches full capacity then the app responds with a 503. Which is all expected behaviour. But when we deploy on Kubernetes this can result in k8s restarting the pod because it is returning 503 for the health check.
It might be useful if we had a metric for the queue size so that we can alert when the queue is growing, and possibly also use the queue size metric in our scaling decisions.

tjquinno · 2021-06-24T12:54:43Z

@gmpatter (and others)

Helidon 2.x already has an optional feature for enabling some additional key performance indicator metrics. For some reason the documentation for this is missing from our published doc site, but below I've pasted the details from our doc source.

Note that the KPI deferred Meter would capture queued requests.

Key Performance Indicator (KPI) Metrics

Any time you include the Helidon metrics module in your application, Helidon tracks two basic performance indicator metrics:

a Counter of all requests received (requests.count), and
a Meter of all requests received (requests.meter).

Helidon also includes additional, extended KPI metrics which are disabled by default:

current number of requests in-flight - a ConcurrentGauge (requests.inFlight) of requests currently being processed
long-running requests - a Meter (requests.longRunning) measuring the rate at which Helidon processes requests which take at least a given amount of time to complete; configurable, defaults to 10000 milliseconds (10 seconds)
load - a Meter (requests.load) measuring the rate at which requests are worked on (as opposed to received)
deferred - a Meter (requests.deferred) measuring the rate at which a request's processing is delayed after Helidon receives the request

You can enable and control these metrics using configuration:

metrics.key-performance-indicators.extended = true
metrics.key-performance-indicators.long-running.threshold-ms = 2000

Further, for SE apps:

Your Helidon SE application can also control the behavior of the KPI metrics programmatically.

Prepare a KeyPerformanceIndicatorSettings object, using its builder, and then pass the builder when invoking the MetricsSupport.Builder#keyPerformanceIndicatorMetricsSettings() method, or
Prepare a Config object and pass it to the MetricsSupport.Builder#keyPerformanceIndicatorMetricsConfig() method.
```
extended = true
long-running.threshold-ms = 2000
```

tjquinno · 2021-07-08T14:00:41Z

Possibly nearly equivalent to #2688 and #2689 (as described in the earlier comment).

While it's true that the recently-added KPI metrics do not directly report information about the executor used to queue and run requests, the executor behavior can be inferred from the KPI metrics.

Is it worthwhile to invest the work to add the nearly-equivalent executor-based metrics?

Would they add sufficient actionable information over the existing KPI metrics?

batsatt added enhancement New feature or request MP SE labels Aug 23, 2019

m0mus added P3 P4 and removed P3 labels Aug 29, 2019

tomas-langer removed the P4 label Jun 24, 2021

m0mus assigned tjquinno Jun 24, 2021

m0mus added the P3 label Jul 15, 2021

This was referenced Nov 9, 2021

Implement metrics for thread pool suppliers #3630

Merged

Metrics for thread pools (3.x) #3639

Closed

tjquinno closed this as completed Nov 17, 2021

m0mus added this to Backlog Aug 12, 2024

m0mus moved this to Closed in Backlog Aug 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Produce metrics for ThreadPoolExecutors #947

Produce metrics for ThreadPoolExecutors #947

batsatt commented Aug 23, 2019

spericas commented Aug 26, 2019

tomas-langer commented Jun 24, 2021

gmpatter commented Jun 24, 2021

tjquinno commented Jun 24, 2021

tjquinno commented Jul 8, 2021 •

edited

Loading

Produce metrics for ThreadPoolExecutors #947

Produce metrics for ThreadPoolExecutors #947

Comments

batsatt commented Aug 23, 2019

spericas commented Aug 26, 2019

tomas-langer commented Jun 24, 2021

gmpatter commented Jun 24, 2021

tjquinno commented Jun 24, 2021

Key Performance Indicator (KPI) Metrics

tjquinno commented Jul 8, 2021 • edited Loading

tjquinno commented Jul 8, 2021 •

edited

Loading