Add max/min eval functions#4333
Conversation
Signed-off-by: Ritvi Bhatt <ribhatt@amazon.com>
Signed-off-by: Ritvi Bhatt <ribhatt@amazon.com>
Signed-off-by: Ritvi Bhatt <ribhatt@amazon.com>
Signed-off-by: Ritvi Bhatt <ribhatt@amazon.com>
Signed-off-by: Ritvi Bhatt <ribhatt@amazon.com>
Signed-off-by: Ritvi Bhatt <ribhatt@amazon.com>
Signed-off-by: Ritvi Bhatt <ribhatt@amazon.com>
Signed-off-by: Ritvi Bhatt <ribhatt@amazon.com>
| :local: | ||
| :depth: 1 | ||
|
|
||
| .. versionadded:: 3.3.0 |
There was a problem hiding this comment.
| boolean aIsNumeric = isNumeric(a); | ||
| boolean bIsNumeric = isNumeric(b); | ||
|
|
||
| if (aIsNumeric != bIsNumeric) { |
There was a problem hiding this comment.
what is expectataion is max(4, "2")? should be 4, right?
There was a problem hiding this comment.
Max(4, "2") should result in "2" since strings are always considered larger than numeric values
array_min/max required all arguments have same type. max/min required speical handling, implicit cast may not work, e.g. the expactation is
|
|
@ritvibhatt I synced with @penghuo offline. For Q2, it doesn’t seem to be a type conversion issue as I originally thought, but rather a data sorting one. For example, we could define a custom comparator in Java to handle the new sorting rule for numeric and string values, and then apply it in min/max/sort APIs. Perhaps we can do something similar within the Calcite accumulator? |
Signed-off-by: Ritvi Bhatt <ribhatt@amazon.com>
Signed-off-by: ritvibhatt <53196324+ritvibhatt@users.noreply.github.com>
Signed-off-by: Ritvi Bhatt <ribhatt@amazon.com>
| import java.util.Comparator; | ||
|
|
||
| /** Comparator for MAX operations where strings have higher precedence than numbers. */ | ||
| public class MaxTypeComparator implements Comparator<Object> { |
There was a problem hiding this comment.
Is MinTypeComparator simply the inverse of MaxTypeComparator? I'm thinking of only one comparator and use it in Min/MaxFunction below by Java Stream's min/max(comparator) API.
Also this may worth adding dedicated UT to show its behavior for num-num, num-string, string-string etc.
There was a problem hiding this comment.
Updated and added tests thank you!
Signed-off-by: Ritvi Bhatt <ribhatt@amazon.com>
Signed-off-by: Ritvi Bhatt <ribhatt@amazon.com>
dai-chen
left a comment
There was a problem hiding this comment.
Thanks for the changes!
| if (aIsNumeric) { | ||
| return Double.compare(((Number) a).doubleValue(), ((Number) b).doubleValue()); | ||
| } else { | ||
| return Integer.compare(a.toString().compareTo(b.toString()), 0); |
There was a problem hiding this comment.
what does this Integer.compare with 0 meaning?
There was a problem hiding this comment.
It will normalize it so if the string comparison returns a negative it will make it -1 and if it returns a positive it will be 1. Don't think that is necessary, can remove and just leave the string comparison
| } | ||
|
|
||
| private static boolean isNumeric(Object obj) { | ||
| return obj instanceof Number; |
There was a problem hiding this comment.
Could you confirm whether comparison between "1" and "2" also considered as numerical comparison?
There was a problem hiding this comment.
No they will be compared as strings so max("9", "21") will return "9"
|
The backport to To backport manually, run these commands in your terminal: # Navigate to the root of your repository
cd $(git rev-parse --show-toplevel)
# Fetch latest updates from GitHub
git fetch
# Create a new working tree
git worktree add ../.worktrees/sql/backport-2.19-dev 2.19-dev
# Navigate to the new working tree
pushd ../.worktrees/sql/backport-2.19-dev
# Create a new branch
git switch --create backport/backport-4333-to-2.19-dev
# Cherry-pick the merged commit of this pull request and resolve the conflicts
git cherry-pick -x --mainline 1 fae06873a057f3bcdea1a7a113c74932fc801deb
# Push it to GitHub
git push --set-upstream origin backport/backport-4333-to-2.19-dev
# Go back to the original working tree
popd
# Delete the working tree
git worktree remove ../.worktrees/sql/backport-2.19-devThen, create a pull request where the |
(cherry picked from commit fae0687)
(cherry picked from commit fae0687) Signed-off-by: Ritvi Bhatt <ribhatt@amazon.com>
* main-apple: (218 commits) Add ignorePrometheus Flag for integTest and docTest (opensearch-project#4442) Create fab-radar.yml PPL `fillnull` command enhancement (opensearch-project#4421) reverting to _doc + _id (opensearch-project#4435) Support `multisearch` command in calcite (opensearch-project#4332) Add 3.3 release notes (opensearch-project#4422) (opensearch-project#4423) [SQL/PPL] Fix the `count(*)` and `dc(field)` to be capped at MAX_INTEGER opensearch-project#4416 (opensearch-project#4418) Change the default search sort tiebreaker to `_shard_doc` for PIT search (opensearch-project#4378) [Enhancement] Add error handling for known limitation of sql `JOIN` (opensearch-project#4344) Bugfix: SQL type mapping for legacy JDBC output (opensearch-project#3613) Version bump: 3.3 (opensearch-project#4417) Add max/min eval functions (opensearch-project#4333) Support time modifiers in search command (opensearch-project#4224) Fix numbered token bug and make it optional output in patterns command (opensearch-project#4402) refactor span (opensearch-project#4334) Move release notes categories (opensearch-project#3818) [Doc] Enable doctest with Calcite (opensearch-project#4379) Mod function should return decimal instead of float when handle the operands are decimal literal (opensearch-project#4407) Scale of decimal literal should always be positive in Calcite (opensearch-project#4401) Enable Calcite by default and implicit fallback the unsupported commands (opensearch-project#4372) ...
Description
Add support for min/max statistical eval functions, allowing users to find maximum and minimum values among multiple arguments within a single row.
Usage Examples
-- Returns the larger value between age field and 30 for each row
source=accounts | eval max_age = MAX(age, 30) | fields age, max_age-- Returns either 'John' or value in firstname depending on what is larger lexicographically
source=accounts | eval result = MAX(age, 'John', firstname) | fields age, firstname, result-- Returns either the value in the age field or 35
source=accounts | eval result = MIN(age, 35, firstname) | fields age, firstname, resultRelated Issues
Resolves #4341
Check List
--signoffor-s.By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.