-
Notifications
You must be signed in to change notification settings - Fork 8.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bell Curve Histogram #2704
Comments
This would be a really powerful feature! I'd love to be able to see what my response time distribution would be. It would be even better if you could overlay multiple queries (so compare the performance distribution curve of one data center vs another for example) |
How would this be different to using a bar chart with time-based ranges on the X-Axis and 'count of documents' on the Y-Axis? |
It would always be a bell curve, but the x axis scale would vary with what the percentiles were. Also, this is not for count, but for the evaluation of a metric. |
The "Bell Curve" AKA probability density function differs from a Count aggregate over Histogram buckets, because the Y axis shows relative frequencies instead of absolute counts, e.g.: |
For a summary of statistical properties of the data, we could also consider the boxplot alternative. It shows similar metrics, but makes less explicit of an assumption that the data is normally distributed. If you'd agree, I would close this one in favor of #4157 then. |
@thomasneirynck the box plot would be really helpful but really all that is needed to to divide each bucket by the total values seen in the whole chart. This would just make the Y axis a percentile of the total instead of a raw count. |
Is this related to the Cumulative Distribution function? #3905 |
Lens now has support for formulas, which allow you to normalize the values per bucket, either compared to the overall, or on a per-bucket basis. https://www.elastic.co/guide/en/kibana/current/lens.html#lens-formulas this allows you to compute relative frequencies. E.g. something like For full in-app docs, please find the little info-button in the As you can add multiple lines, this also allows you to compare relative frequencies of multiple distributions. |
The idea is that, for a given query, to build a bell curve.
The X-axis would be the metric value (time) and Y-axis is relative frequency (or reversed if that makes more sense).
I'm not looking for a perfect bell curve, but two queries on the same filter (one for percentiles and one with a range aggregation based on the histogram) could do it nicely.
The text was updated successfully, but these errors were encountered: