-
Notifications
You must be signed in to change notification settings - Fork 24.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Boxplot Aggregation #33112
Comments
Pinging @elastic/es-search-aggs |
subscribe |
sub |
After more thought, I think it’d be useful to provide a
The IQR could then be easily calculated through In a basic boxplot, the whiskers extend to the min and max values. But in another popular style, the whiskers extend to the furthest points within [Q1 - 1.5 * IQR, Q3 + 1.5 * IQR]. Points that are outside this interval are considered outliers and usually displayed on the plot. Perhaps we could start with the basic five-number summary, then look into supporting the style with outliers if there’s interest. |
Adds a `boxplot` aggregation that calculates min, max, medium and the first and the third quartiles of the given data set. Closes elastic#33112
Adds a `boxplot` aggregation that calculates min, max, medium and the first and the third quartiles of the given data set. Closes #33112
Adds a `boxplot` aggregation that calculates min, max, medium and the first and the third quartiles of the given data set. Closes #33112
Add support for the histogram field type to boxplot aggs. Closes elastic#52233 Relates to elastic#33112
Add support for the histogram field type to boxplot aggs. Closes elastic#52233 Relates to elastic#33112
(Previous title: 'Interquartile Range Aggregation')
The interquartile range is a common robust measure of statistical dispersion. Compared to the standard deviation, the IQR is less sensitive to outliers in the data, with a breakdown point of 0.25. Along with the median, it is often used in creating a box plot, a simple yet common way to summarize data and identify potential outliers.
The IQR is equal to the third minus the first quartile of a dataset, and could be calculated from the output of a
percentiles
aggregation. Even though it can be easily calculated from quantile information, it may still be useful to provide it as an aggregation for convenience, and to increase its visibility. An alternative option would be to describe the IQR as part of thepercentiles
documentation.Compared to the MAD (#26681), the IQR has a lower breakdown point (0.25, compared to 0.5). However, it is simple to calculate and is better equipped to handle skewed (asymmetric) data.
Relates to #26681.
The text was updated successfully, but these errors were encountered: