Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Rare Terms Aggregation Performance Optimization #13122

Open
sandeshkr419 opened this issue Apr 8, 2024 · 2 comments
Open

Rare Terms Aggregation Performance Optimization #13122

sandeshkr419 opened this issue Apr 8, 2024 · 2 comments

Comments

@sandeshkr419
Copy link
Contributor

Unsure about existing performance of Rare Terms Aggregation at the moment, but looking through initial code at high level, it looks like that this aggregation also utilizes iterating through each document.

The idea is to utilize the terms frequency from Lucene similar to #11643 and avoid iterating through individual documents.

Next Steps:

  • Measure/gather existing performance of rare terms aggregation
  • Improve upon the implementation if it can be done with above ideation
@peternied
Copy link
Member

[Triage - attendees 1 2 3 4 5 6]

@sandeshkr419 Lets make these issues distinct if they need to be tracked separately, but overall idea capture around aggregation perf seems like a single topic

@sandeshkr419
Copy link
Contributor Author

Hi @peternied - keeping these issues separate since the underlying search operations, their code flows and ideas to optimize will be different. They do fall under the aggregation category and there is a probablity that these may share some optimization ideas but for now lets track each of them separately without one being influenced by the other.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: Untriaged
Status: 🏗 In progress
Development

No branches or pull requests

2 participants