Performance regression on LatLonPoint#newPolygonQuery #11824

iverase · 2022-09-27T07:57:27Z

Description

I just notice a big performance regression on polygon queries using LatLonPoint field in lucene geo benchmarks:

I checked and the regression was introduced by this change: #1017.

My suspicion is that before this change, SpatialQuery was calling the method #getSpatialVisitor() once for the whole index but in the new version is calling it once per segment. This method might be expensive for LatLonPoint queries, threfore the regression.

@nknize FYI

Version and environment details

No response

The text was updated successfully, but these errors were encountered:

iverase · 2022-09-27T09:32:26Z

close in #11825

nknize · 2022-09-27T16:15:18Z

Just seeing this. That's exactly what it would be! Snuck in one of those last commits on the long running PR. Thanks for refactoring and merging @iverase!

iverase · 2022-09-28T09:53:41Z

Fix seems to bring performance back to previous levels:

mikemccand · 2022-09-28T11:48:23Z

Thanks for catching this @iverase and the quick fix, and the follow-on issue to better detect such regressions before release: #11827

nknize · 2022-09-28T13:44:17Z

What's annoying is how incredibly trappy this override logic is. That a method call literally moving from createWeight to getScorerSupplier results in a 72.2% regression even slipped by me before merge doesn't bode well for new committers. And then it sat in regression until an entire company was interested in releasing.

I wonder if we can do better? Like maybe figure out better guardrails in these methods? Perhaps by something as simple as a rename (e.g., getScorerSupplierPerSegment) to signal one happens per segment? This isn't the first and will certainly not be the last time an expensive operation accidentally slips to a critical path. Any other ideas how to lower the bar here for new committers?

iverase added the type:bug label Sep 27, 2022

iverase mentioned this issue Sep 27, 2022

Build SpatialVisitor once per index #11825

Merged

iverase closed this as completed Sep 27, 2022

iverase added this to the 9.4.0 milestone Sep 27, 2022

mikemccand mentioned this issue Sep 28, 2022

Release manager should review lucene benchmarks before building release candidates #11827

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance regression on LatLonPoint#newPolygonQuery #11824

Performance regression on LatLonPoint#newPolygonQuery #11824

iverase commented Sep 27, 2022

iverase commented Sep 27, 2022

nknize commented Sep 27, 2022

iverase commented Sep 28, 2022

mikemccand commented Sep 28, 2022

nknize commented Sep 28, 2022 •

edited

Loading

Performance regression on LatLonPoint#newPolygonQuery #11824

Performance regression on LatLonPoint#newPolygonQuery #11824

Comments

iverase commented Sep 27, 2022

Description

Version and environment details

iverase commented Sep 27, 2022

nknize commented Sep 27, 2022

iverase commented Sep 28, 2022

mikemccand commented Sep 28, 2022

nknize commented Sep 28, 2022 • edited Loading

nknize commented Sep 28, 2022 •

edited

Loading