Make sure non-collecting aggs include sub-aggs (backport of #64214) #64244

nik9000 · 2020-10-27T20:10:25Z

Now that we're consistently using cat_match to filter which shards we
run on we can get this confusing case:

You have a search with, say, a range and a sub-agg.
That search has a query that can_match can recognize will match no
docs. On any shard.
So we dutifully run it on a single shard so it can produce the
"empty" aggs.
The shard we pick happens to not have the target of the range mapped.
This kicks in the special range aggregator that doesn't collect any
documents.
Before this commit, that range aggregator also never produced any
sub-aggs.

So, without this change, it was quite possible for a search that
happened to match no documents to "throw away" the sub-aggs of a range
and a few other aggs.

We've had this problem for a long, long time but it is more confusing
now because can_match is really kicking in and causing us to see cases
where it looks like you are targeting a lot of shards but you really are
only targeting a couple. It used to be that to get the "no sub-aggs"
behavior you had to explicitly target only shards that didn't map the
target field of the range agg. And, like, in that case it isn't too
bad because you targeted a sort of degenerate shard. But now that
can_match is doing its thing you can end up with the confusing steps
above. It took me several hours to track down what what happening I know
how the individual pieces of all of this works. It took four hours to
figure out how they fit together in this case....

Anyway! This replaces all the aggregator implementations that throw out
the sub-aggregators with ones that keep them. I think this'll be less
confusing in the future.

Closes #64142

…4214) Now that we're consistently using `cat_match` to filter which shards we run on we can get this confusing case: 1. You have a search with, say, a range and a sub-agg. 2. That search has a query that `can_match` can recognize will match no docs. On *any* shard. 3. So we dutifully run it on a single shard so it can produce the "empty" aggs. 4. The shard we pick happens to not have the target of the range mapped. 5. This kicks in the special range aggregator that doesn't collect any documents. 6. Before this commit, that range aggregator *also* never produced any sub-aggs. So, without this change, it was quite possible for a search that happened to match no documents to "throw away" the sub-aggs of a range and a few other aggs. We've had this problem for a long, long time but it is more confusing now because `can_match` is really kicking in and causing us to see cases where it looks like you are targeting a lot of shards but you really are only targeting a couple. It used to be that to get the "no sub-aggs" behavior you had to explicitly target only shards that didn't map the target field of the `range` agg. And, like, in that case it isn't too bad because you targeted a sort of degenerate shard. But now that `can_match` is doing its thing you can end up with the confusing steps above. It took me several hours to track down what what happening I know how the individual pieces of all of this works. It took four hours to figure out how they fit together in this case.... Anyway! This replaces all the aggregator implementations that throw out the sub-aggregators with ones that keep them. I think this'll be less confusing in the future. Closes elastic#64142

Backport is coming. Please wait.

nik9000 · 2020-10-28T13:31:39Z

run elasticsearch-ci/default-distro

nik9000 · 2020-10-28T16:18:17Z

@elasticmachine, update branch

nik9000 · 2020-10-28T20:01:56Z

@elasticmachine update branch

nik9000 · 2020-10-28T20:52:06Z

run elasticsearch-ci/2

nik9000 · 2020-10-28T21:33:17Z

@elasticmachine update branch

nik9000 · 2020-10-29T13:47:43Z

run elasticsearch-ci/packaging-sample-unix

nik9000 added backport v7.11.0 labels Oct 27, 2020

nik9000 added 2 commits October 27, 2020 16:39

hush bwc tests

a7fc0d3

Backport is coming. Please wait.

Update fix after backport

4b1a031

Merge branch '7.x' into unmapped_has_sub_factories_7_x

60bb87d

Merge branch '7.x' into unmapped_has_sub_factories_7_x

a362620

nik9000 merged commit cc693e4 into elastic:7.x Oct 29, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make sure non-collecting aggs include sub-aggs (backport of #64214) #64244

Make sure non-collecting aggs include sub-aggs (backport of #64214) #64244

nik9000 commented Oct 27, 2020

nik9000 commented Oct 28, 2020

nik9000 commented Oct 28, 2020

nik9000 commented Oct 28, 2020

nik9000 commented Oct 28, 2020

nik9000 commented Oct 28, 2020

nik9000 commented Oct 29, 2020

Make sure non-collecting aggs include sub-aggs (backport of #64214) #64244

Make sure non-collecting aggs include sub-aggs (backport of #64214) #64244

Conversation

nik9000 commented Oct 27, 2020

nik9000 commented Oct 28, 2020

nik9000 commented Oct 28, 2020

nik9000 commented Oct 28, 2020

nik9000 commented Oct 28, 2020

nik9000 commented Oct 28, 2020

nik9000 commented Oct 29, 2020