Use a separate Lucene term for each hash table #376
Closed
alexklibisz
started this conversation in
Ideas
Replies: 1 comment
-
Closing this because:
|
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
The LSH models currently encode the hash table in the hash and index them all under the same term.
For example, if the hash value for the 99th table is 42, the value that's stored in Lucene is actually something like "99.42" (more efficiently encoded, but that's the gist of it.
Another way to index the hash values would be to have a separate Lucene term for each hash table. So the 99 would be part of the term name, not the actual value. In theory this should save space, but it would require some experimentation to answer:
Beta Was this translation helpful? Give feedback.
All reactions