-
Notifications
You must be signed in to change notification settings - Fork 24.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Pipeline aggregations shouldn't serialize to remote clusters in cross-cluster search #73680
Comments
Pinging @elastic/es-analytics-geo (Team:Analytics) |
It is always executed on the coordinating node. But old coordinating node versions they don't know how to read the pipelines from the request - only from a response from a response sent back from the data nodes. The thing I haven't reasoned through is what cases we don't have to send them. |
We should probably talk about this in the next team meeting. |
A few of us talked and think that we do actually need to serialize these responses. We thought that we didn't, but async search serializes the response with binary. So! We have to make sure these are all serializable. Sadness. I'll see about adding more tests to catch places that aren't. |
This tries to automatically detect problems seriealization aggregation results by round tripping the results in our usual `AggregatorTestCase`. It's "free" testing in that we already have the tests written and we'll get round trip testing "on the side". But it's kind of sneaky because we aren't *trying* to test serialization here. So they failures can be surprising. But surprising test failures are better than bugs. At least that is what I tell myself so I can sleep at night. Closes elastic#73680
This tries to automatically detect problems seriealization aggregation results by round tripping the results in our usual `AggregatorTestCase`. It's "free" testing in that we already have the tests written and we'll get round trip testing "on the side". But it's kind of sneaky because we aren't *trying* to test serialization here. So they failures can be surprising. But surprising test failures are better than bugs. At least that is what I tell myself so I can sleep at night. Closes #73680
This tries to automatically detect problems seriealization aggregation results by round tripping the results in our usual `AggregatorTestCase`. It's "free" testing in that we already have the tests written and we'll get round trip testing "on the side". But it's kind of sneaky because we aren't *trying* to test serialization here. So they failures can be surprising. But surprising test failures are better than bugs. At least that is what I tell myself so I can sleep at night. Closes elastic#73680
This tries to automatically detect problems seriealization aggregation results by round tripping the results in our usual `AggregatorTestCase`. It's "free" testing in that we already have the tests written and we'll get round trip testing "on the side". But it's kind of sneaky because we aren't *trying* to test serialization here. So they failures can be surprising. But surprising test failures are better than bugs. At least that is what I tell myself so I can sleep at night. Closes elastic#73680
This tries to automatically detect problems seriealization aggregation results by round tripping the results in our usual `AggregatorTestCase`. It's "free" testing in that we already have the tests written and we'll get round trip testing "on the side". But it's kind of sneaky because we aren't *trying* to test serialization here. So they failures can be surprising. But surprising test failures are better than bugs. At least that is what I tell myself so I can sleep at night. Closes #73680
Pipeline aggregations have long been a way to "do one more thing" on aggregated data. I THINK that "one more thing" is always computed on the coordinating node and the coordinating node is always on the "local" cluster.
Consequently, there seems to be an opportunity here by not serializing pipeline aggregations to remote clusters in cross-cluster search. This way, newer pipeline aggs could be used when querying older clusters.
//CC @elastic/ml-core
The text was updated successfully, but these errors were encountered: