-
Notifications
You must be signed in to change notification settings - Fork 125
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] Since upgrading to opensearch 2.4, having issues running knn search at scale #637
Comments
Moving this to k-nn repo. Do you have any more info about the dataset? |
@tomhamer In addition to previous ask please share details about your OpenSearch cluster configuration: num of data/leader nodes, hardware type, RAM size, RAM for java heap and params used for mapping and indexing: number of shards/replicas, for lucene hnsw value for m and ef_construction. |
@tomhamer Could you please answer one more question - did you run indexing while upgrading from 2.3 to 2.4? |
@tomhamer We have identified the reason for latencies you're seeing. While our team is working on a long-term solution you can use following workaround: For new indexes that you're going to create:
For existing indexes you need to do re-indexing in order to keep data. You need to create new index with updated "hybrid.mmap.extensions" setting and then re-index data.
|
I'd like to add to my previous post that it's possible to add new file extensions via
Similarly to update via API, this setting will override pre-delivered list, please make sure you include list of standard extensions along with |
Thanks Martin, this is really useful. We are testing it at the moment. |
With mentioned PR things should be improved in 2.5 release. |
Thanks @martin-gaievski this is excellent - it solved the problem. Really appreciate the work put into the investigation here! |
Describe the bug
A clear and concise description of what the bug is.
We are using lucene HNSW indices and recently upgraded from opensearch 2.3 to 2.4. Since upgrading to 2.4 we have had latencies 10-150 times higher than we had previously. For example, searches that used to take 100ms are taking 4-15 seconds.
To Reproduce
Steps to reproduce the behavior:
Expected behavior
A clear and concise description of what you expected to happen.
Search latency of approx 200ms
Host/Environment (please complete the following information):
Additional context
Add any other context about the problem here.
The text was updated successfully, but these errors were encountered: