[BUG] Since upgrading to opensearch 2.4, having issues running knn search at scale #637

tomhamer · 2022-11-30T10:45:13Z

Describe the bug
A clear and concise description of what the bug is.

We are using lucene HNSW indices and recently upgraded from opensearch 2.3 to 2.4. Since upgrading to 2.4 we have had latencies 10-150 times higher than we had previously. For example, searches that used to take 100ms are taking 4-15 seconds.

To Reproduce
Steps to reproduce the behavior:

Create a large lucene knn index (100 mil vectors)
Search it using approximate knn
Search latency is very high (5-15 seconds)

Expected behavior
A clear and concise description of what you expected to happen.
Search latency of approx 200ms

Host/Environment (please complete the following information):

OS: amazon linux
Version Opensearch 2.4

Additional context
Add any other context about the problem here.

dblock · 2022-11-30T15:51:37Z

Moving this to k-nn repo. Do you have any more info about the dataset?

martin-gaievski · 2022-11-30T18:24:09Z

@tomhamer In addition to previous ask please share details about your OpenSearch cluster configuration: num of data/leader nodes, hardware type, RAM size, RAM for java heap and params used for mapping and indexing: number of shards/replicas, for lucene hnsw value for m and ef_construction.
Is this a static data set or you're changing data in parallel with search requests? Did you enable force merge for the index?

martin-gaievski · 2022-12-05T20:21:17Z

@tomhamer Could you please answer one more question - did you run indexing while upgrading from 2.3 to 2.4?

martin-gaievski · 2022-12-19T17:34:08Z

@tomhamer We have identified the reason for latencies you're seeing.
With Lucene 9.4 (that is the base for OpenSearch/kNN 2.4) Lucene community made choose of making segment merges in parallel with data ingestion to optimize indexing time. This approach creates more segments comparing to 9.3/OpenSearch/kNN 2.3. On OpenSearch side segment files are not picked up for a fast MMap reading mode, we fall back to default slower but less resource consuming NIO mode.

While our team is working on a long-term solution you can use following workaround:

For new indexes that you're going to create:

add "index.store.hybrid.mmap.extensions" setting to list of index settings. User value for this setting will override default one, please make sure you include existing list of extensions as well as any additional file extension that needed to be read with MMap. You need to add "vec" and "vex" extensions, e.g.:

{
  "settings": {
    "index": {
      "knn": true,
      "refresh_interval": "30s",
      "number_of_shards": 3,
      "number_of_replicas": 0,
      "store.hybrid.mmap.extensions" :  ["nvd", "dvd", "tim", "tip", "dim", "kdd", "kdi", "cfs", "doc", "vec", "vex"]
    }
  },

For existing indexes you need to do re-indexing in order to keep data. You need to create new index with updated "hybrid.mmap.extensions" setting and then re-index data.

create second index "updated_index" with "store.hybrid.mmap.extensions" setting:

PUT /updated_index
{
  "settings": {
    "index": {
      "knn": true,
      "refresh_interval": "30s",
      "number_of_shards": 24,
      "number_of_replicas": 1,
      "store.hybrid.mmap.extensions" :  ["nvd", "dvd", "tim", "tip", "dim", "kdd", "kdi", "cfs", "doc", "vec", "vex"]
    }
  },
  "mappings": {
    "properties": {
      "target_field": {
        "type": "knn_vector",
        "dimension": 128,
        "method": {
          "name": "hnsw",
          "space_type": "l2",
          "engine": "lucene"
        }
      }
    }
  }
}

run re-index request from existing to updated index

POST _reindex
{
   "source":{
      "index":"current_index"
   },
   "dest":{
      "index":"updated_index"
   }
}

forward requests to "updated_index". Original index can be deleted after this.

martin-gaievski · 2023-01-09T18:43:58Z

I'd like to add to my previous post that it's possible to add new file extensions via opensearch.yml file. Exact line should be:

index.store.hybrid.mmap.extensions: [nvd, dvd, tim, tip, dim, kdd, kdi, cfs, doc, vec, vex]

Similarly to update via API, this setting will override pre-delivered list, please make sure you include list of standard extensions along with vec and vex files for vector values.

tomhamer · 2023-01-16T19:29:32Z

Thanks Martin, this is really useful. We are testing it at the moment.

martin-gaievski · 2023-01-24T01:20:16Z

With mentioned PR things should be improved in 2.5 release.

tomhamer · 2023-01-31T04:42:48Z

Thanks @martin-gaievski this is excellent - it solved the problem. Really appreciate the work put into the investigation here!

tomhamer added bug Something isn't working untriaged labels Nov 30, 2022

dblock transferred this issue from opensearch-project/OpenSearch Nov 30, 2022

navneet1v assigned martin-gaievski Nov 30, 2022

navneet1v removed the untriaged label Nov 30, 2022

martin-gaievski mentioned this issue Dec 20, 2022

Allow extension of core index setting for plugins opensearch-project/OpenSearch#5609

Closed

pandu-k mentioned this issue Jan 10, 2023

Decrease Marqo-os latencies via index settings[ENHANCEMENT] marqo-ai/marqo#265

Open

martin-gaievski mentioned this issue Jan 11, 2023

Add Lucene specific file extensions to core HybridFS #721

Merged

4 tasks

martin-gaievski closed this as completed Jan 24, 2023

martin-gaievski mentioned this issue Feb 14, 2023

[DOC] k-NN plugin: add reference to hybrid mmap extensions setting under performance tunning opensearch-project/documentation-website#2886

Closed

4 tasks

martin-gaievski mentioned this issue Mar 6, 2023

[FEATURE] Support pre-filter queries for Lucene-based approximate k-NN #376

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Since upgrading to opensearch 2.4, having issues running knn search at scale #637

[BUG] Since upgrading to opensearch 2.4, having issues running knn search at scale #637

tomhamer commented Nov 30, 2022 •

edited

Loading

dblock commented Nov 30, 2022

martin-gaievski commented Nov 30, 2022 •

edited

Loading

martin-gaievski commented Dec 5, 2022

martin-gaievski commented Dec 19, 2022 •

edited

Loading

martin-gaievski commented Jan 9, 2023

tomhamer commented Jan 16, 2023

martin-gaievski commented Jan 24, 2023

tomhamer commented Jan 31, 2023

[BUG] Since upgrading to opensearch 2.4, having issues running knn search at scale #637

[BUG] Since upgrading to opensearch 2.4, having issues running knn search at scale #637

Comments

tomhamer commented Nov 30, 2022 • edited Loading

dblock commented Nov 30, 2022

martin-gaievski commented Nov 30, 2022 • edited Loading

martin-gaievski commented Dec 5, 2022

martin-gaievski commented Dec 19, 2022 • edited Loading

martin-gaievski commented Jan 9, 2023

tomhamer commented Jan 16, 2023

martin-gaievski commented Jan 24, 2023

tomhamer commented Jan 31, 2023

tomhamer commented Nov 30, 2022 •

edited

Loading

martin-gaievski commented Nov 30, 2022 •

edited

Loading

martin-gaievski commented Dec 19, 2022 •

edited

Loading