Skip to content

Speed BM25 when increase Top K retrieval result #1228

Answered by lintool
hieudx149 asked this question in Q&A
Discussion options

You must be logged in to vote

Yes, it is well known that query latency increases as k in top-k retrieval increases. Here's a relatively recent survey that provides lots of details: https://www.nowpublishers.com/article/Details/INR-057

Insight is that top-k docs are kept in the heap during query evaluation - the larger the k, the more "work" the algorithm needs to do. Modern algorithms are fast because they are able to (heuristically) answer this question: can this document possibly be in the top k? If no, I can skip it.

Replies: 2 comments

Comment options

You must be logged in to vote
0 replies
Answer selected by hieudx149
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants