Move pipeline agg validation to coordinating node #53669

nik9000 · 2020-03-17T14:41:16Z

This moves the pipeline aggregation validation from the data node to the
coordinating node so that we, eventually, can stop sending pipeline
aggregations to the data nodes entirely. In fact, it moves it into the
"request validation" stage so multiple errors can be accumulated and
sent back to the requester for the entire request. We can't always take
advantage of that, but it'll be nice for folks not to have to play
whack-a-mole with validation.

This is implemented by replacing PipelineAggretionBuilder#validate
with:

protected abstract void validate(ValidationContext context);

The ValidationContext handles the accumulation of validation failures,
provides access to the aggregation's siblings, and implements a few
validation utility methods.

This moves the pipeline aggregation validation from the data node to the coordinating node so that we, eventually, can stop sending pipeline aggregations to the data nodes entirely. In fact, it moves it into the "request validation" stage so multiple errors can be accumulated and sent back to the requester for the entire request. We can't always take advantage of that, but it'll be nice for folks not to have to play whack-a-mole with validation. This is implemented by replacing `PipelineAggretionBuilder#validate` with: ``` protected abstract void validate(ValidationContext context); ``` The `ValidationContext` handles the accumulation of validation failures, provides access to the aggregation's siblings, and implements a few validation utility methods.

elasticmachine · 2020-03-17T14:41:18Z

Pinging @elastic/es-analytics-geo (:Analytics/Aggregations)

nik9000 · 2020-03-17T14:43:39Z

This'll probably change some errors from 500s to 400s. I believe that'll be a net positive because I don't imagine anyone is relying on 500 errors for bad aggregation configuration.

polyfractal

++ I like this. Left a few tiny notes and optional nits.

Should we add a note to the breaking changes doc? I'm onboard with the fact these should have been 4xx errors anyway, and it's unlikely someone was depending on these particular 5xx errors in their code...but it might be nice to note it in the docs anyway?

server/src/main/java/org/elasticsearch/search/aggregations/AggregatorFactories.java

polyfractal · 2020-03-19T17:48:53Z

server/src/main/java/org/elasticsearch/search/aggregations/AggregatorFactories.java

+                orderedPipelineAggregators = resolvePipelineAggregatorOrder(pipelineAggregatorBuilders, aggregationBuilders);
+            } catch (IllegalArgumentException iae) {
+                context.addValidationError(iae.getMessage());
+                return;


Should we allow the validations to keep running down the tree, so we can tell the user all the problems at once?

I was tempted but I think the tree is pretty borked at this point and you'll end up with duplicate error messages all about the same thing. And I figured we were just returning a single error message right now so it probably isn't worse than it was before and we could do it later if we wanted it.

Makes sense to me 👍

polyfractal · 2020-03-19T18:17:21Z

server/src/main/java/org/elasticsearch/search/aggregations/PipelineAggregationBuilder.java

+            }
+
+            @Override
+            public void validateParentAggSequentiallyOrdered(String type, String name) {


Ahh yes, this thing. Would be nice someday if we could get rid of these instanceofs with some kind of isSequential() method on the agg.

Battle for another day :)

+++++++++++++

polyfractal · 2020-03-19T18:20:11Z

.../org/elasticsearch/search/aggregations/pipeline/BucketMetricsPipelineAggregationBuilder.java

                .findAny();
        if (aggBuilder.isPresent()) {
            if ((aggBuilder.get() instanceof MultiBucketAggregationBuilder) == false) {
-                throw new IllegalArgumentException("The first aggregation in " + PipelineAggregator.Parser.BUCKETS_PATH.getPreferredName()
+                context.addValidationError("The first aggregation in " + PipelineAggregator.Parser.BUCKETS_PATH.getPreferredName()
                        + " must be a multi-bucket aggregation for aggregation [" + name + "] found :"
                        + aggBuilder.get().getClass().getName() + " for buckets path: " + bucketsPaths[0]);
            }
        } else {


Optional: should we remove this else statement and just leave the context error as the last statement in the method?

I always feel like else are trappy/unnecessary when it's just doing some kind of error or return statement. Less rightward drift if we remove.

Totally optional, this might just be a quirk I've picked up :)

Or, re-arrange so there's an isPresent() == false check first, and an instanceof check next, which avoids nesting.

But yeah, optional pending your preferences :D

nik9000 · 2020-03-19T21:06:39Z

Should we add a note to the breaking changes doc? I'm onboard with the fact these should have been 4xx errors anyway, and it's unlikely someone was depending on these particular 5xx errors in their code...but it might be nice to note it in the docs anyway?

I think I have to do that as part of the backport, right?

nik9000 · 2020-03-19T21:40:52Z

@polyfractal I believe I've done the things you asked! Thanks so much for the review.

polyfractal · 2020-03-23T18:18:26Z

I think I have to do that as part of the backport, right?

🤦‍♂️ yes, yes I believe you're right.

…c#53669) This moves the pipeline aggregation validation from the data node to the coordinating node so that we, eventually, can stop sending pipeline aggregations to the data nodes entirely. In fact, it moves it into the "request validation" stage so multiple errors can be accumulated and sent back to the requester for the entire request. We can't always take advantage of that, but it'll be nice for folks not to have to play whack-a-mole with validation. This is implemented by replacing `PipelineAggretionBuilder#validate` with: ``` protected abstract void validate(ValidationContext context); ``` The `ValidationContext` handles the accumulation of validation failures, provides access to the aggregation's siblings, and implements a few validation utility methods.

#54019) This moves the pipeline aggregation validation from the data node to the coordinating node so that we, eventually, can stop sending pipeline aggregations to the data nodes entirely. In fact, it moves it into the "request validation" stage so multiple errors can be accumulated and sent back to the requester for the entire request. We can't always take advantage of that, but it'll be nice for folks not to have to play whack-a-mole with validation. This is implemented by replacing `PipelineAggretionBuilder#validate` with: ``` protected abstract void validate(ValidationContext context); ``` The `ValidationContext` handles the accumulation of validation failures, provides access to the aggregation's siblings, and implements a few validation utility methods.

Adds a note to the pipeline aggregation docs for error status codes changed with #53669.

…8328) Adds a note to the pipeline aggregation docs for error status codes changed with #53669.

nik9000 added >enhancement :Analytics/Aggregations Aggregations v8.0.0 v7.7.0 labels Mar 17, 2020

nik9000 requested review from not-napoleon and polyfractal March 17, 2020 14:41

nik9000 added >refactoring and removed >enhancement labels Mar 17, 2020

nik9000 added 2 commits March 17, 2020 11:09

Fix tests

3dbada8

Fix test

7aaae7a

nik9000 mentioned this pull request Mar 18, 2020

Pipeline aggregations are weird #53742

Closed

3 tasks

$polyfractal$

polyfractal reviewed Mar 19, 2020

View reviewed changes

Merge branch 'master' into pipeline_validate_builders

055ed19

Less nesting

c686fe0

nik9000 force-pushed the pipeline_validate_builders branch from 2b846b9 to c686fe0 Compare March 19, 2020 21:40

$polyfractal$

polyfractal approved these changes Mar 23, 2020

View reviewed changes

nik9000 merged commit 569dffc into elastic:master Mar 23, 2020

nik9000 added the backport pending label Mar 23, 2020

nik9000 removed the backport pending label Mar 23, 2020

nik9000 added a commit that referenced this pull request Mar 25, 2020

Add breaking change note for #53669

4d7d3ef

nik9000 added a commit that referenced this pull request Mar 25, 2020

Add breaking change note for #53669

16e4bd5

nik9000 added the >breaking label Apr 8, 2020

jakelandis mentioned this pull request Feb 22, 2021

DRAFT [META] REST Compatible API V7 completeness #68905

Closed

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

jrodewig mentioned this pull request Sep 27, 2021

[DOCS] Status code change for pipeline validation errors #78324

Merged

jrodewig added a commit that referenced this pull request Sep 27, 2021

[DOCS] Status code change for pipeline validation errors (#78324)

4434730

Adds a note to the pipeline aggregation docs for error status codes changed with #53669.

jrodewig added a commit that referenced this pull request Sep 27, 2021

[DOCS] Status code change for pipeline validation errors (#78324) (#7…

72955eb

…8328) Adds a note to the pipeline aggregation docs for error status codes changed with #53669.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move pipeline agg validation to coordinating node #53669

Move pipeline agg validation to coordinating node #53669

nik9000 commented Mar 17, 2020

elasticmachine commented Mar 17, 2020

nik9000 commented Mar 17, 2020

$@polyfractal$ polyfractal left a comment

$@polyfractal$ polyfractal Mar 19, 2020

nik9000 Mar 19, 2020

$@polyfractal$ polyfractal Mar 23, 2020

$@polyfractal$ polyfractal Mar 19, 2020

nik9000 Mar 19, 2020

$@polyfractal$ polyfractal Mar 19, 2020

$@polyfractal$ polyfractal Mar 19, 2020

nik9000 commented Mar 19, 2020

nik9000 commented Mar 19, 2020

polyfractal commented Mar 23, 2020

Move pipeline agg validation to coordinating node #53669

Move pipeline agg validation to coordinating node #53669

Conversation

nik9000 commented Mar 17, 2020

elasticmachine commented Mar 17, 2020

nik9000 commented Mar 17, 2020

polyfractal left a comment

Choose a reason for hiding this comment

polyfractal Mar 19, 2020

Choose a reason for hiding this comment

nik9000 Mar 19, 2020

Choose a reason for hiding this comment

polyfractal Mar 23, 2020

Choose a reason for hiding this comment

polyfractal Mar 19, 2020

Choose a reason for hiding this comment

nik9000 Mar 19, 2020

Choose a reason for hiding this comment

polyfractal Mar 19, 2020

Choose a reason for hiding this comment

polyfractal Mar 19, 2020

Choose a reason for hiding this comment

nik9000 commented Mar 19, 2020

nik9000 commented Mar 19, 2020

polyfractal commented Mar 23, 2020

$@polyfractal$ polyfractal left a comment

$@polyfractal$ polyfractal Mar 19, 2020

$@polyfractal$ polyfractal Mar 23, 2020

$@polyfractal$ polyfractal Mar 19, 2020

$@polyfractal$ polyfractal Mar 19, 2020

$@polyfractal$ polyfractal Mar 19, 2020