ranking: add tiebreakers to BM25 #914

stefanhengl · 2025-02-13T17:19:27Z

Relates to SPLF-838

This adds repo freshness and file order as tiebreakers to the final bm25 score, just like we have for Zoekt's default scoring.

During testing I found that it is a lot less likely for the tiebreakers to have an effect with BM25 because the score depends on qualities of the document, such as the relative length and number of matches, which usually differ even if the quality of the match is similar.

Note: I updated the format of the debug string a bit to make it (hopefully) more readable (see screenshot)

Test plan:

Score tests still pass
manual testing: see screenshot

This adds repo freshness and file order as tiebreakers to the final bm25 score, just like we have for Zoekt's default scoring. During testing I found that it is a lot less likely for the tiebreakers to have an effect with BM25 because the score depends on qualites of the document, such as the relative length and number of matches, which usually differ even with the quality of the match is similar. Test plan: - Score tests still pass - manual testing: see screenshots

jtibshirani · 2025-02-13T18:22:47Z

Nice ! Did you run our end-to-end evals to check this improves (or at least doesn't worsen) performance?

Also could you explain what the example shows? The queries are different between BM25 and Default, so not quite sure how to compare them.

index/score.go

jtibshirani · 2025-02-13T21:23:59Z

index/score.go

@@ -361,10 +349,26 @@ func (d *indexData) scoreFilesUsingBM25(fileMatch *zoekt.FileMatch, doc uint32,
 		sumTF += f
 		score += tfScore(k, b, L, f)
 	}
+	// 2 digits of precision


Are we using 2 digits here simply so we can add the tiebreaker? Or does it aid in "collapsing" similar BM25 scores together, to allow tiebreakers to actually become relevant?

We caught up about this over Zoom. We don't believe this will actually collapse BM25 scores, it's just so we can apply the tiebreaker.

The 2 digits are just to avoid an overlap with the tiebreaker. In my experiments and based on the evaluations, 2 digits seemed sufficient, but we can simply increase precision if we see that's necessary.

jtibshirani

Nice ! Did you run our end-to-end evals to check this improves (or at least doesn't worsen) performance?

@stefanhengl told me that he reran evals and didn't see any different in performance.

Over Zoom, we discussed whether we should even have this, because it kicks in so infrequently. I am still supportive of adding it, because I think it creates a simpler mental model for Zoekt scoring: we always have a "query/ file match" component, plus the same tiebreakers. Otherwise, BM25 would be an 'exception' compared to default scoring.

stefanhengl requested a review from jtibshirani February 13, 2025 17:21

stefanhengl marked this pull request as ready for review February 13, 2025 17:22

jtibshirani reviewed Feb 14, 2025

View reviewed changes

jtibshirani approved these changes Feb 14, 2025

View reviewed changes

tiebreaker -> repo-rank, file-rank

54fa03e

stefanhengl merged commit 914a27d into main Feb 17, 2025
10 checks passed

stefanhengl deleted the sh/add-tiebreakers-to-bm25 branch February 17, 2025 09:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ranking: add tiebreakers to BM25 #914

ranking: add tiebreakers to BM25 #914

stefanhengl commented Feb 13, 2025 •

edited

Loading

jtibshirani commented Feb 13, 2025 •

edited

Loading

jtibshirani Feb 13, 2025

jtibshirani Feb 14, 2025

stefanhengl Feb 17, 2025

jtibshirani left a comment

ranking: add tiebreakers to BM25 #914

ranking: add tiebreakers to BM25 #914

Conversation

stefanhengl commented Feb 13, 2025 • edited Loading

jtibshirani commented Feb 13, 2025 • edited Loading

jtibshirani Feb 13, 2025

Choose a reason for hiding this comment

jtibshirani Feb 14, 2025

Choose a reason for hiding this comment

stefanhengl Feb 17, 2025

Choose a reason for hiding this comment

jtibshirani left a comment

Choose a reason for hiding this comment

stefanhengl commented Feb 13, 2025 •

edited

Loading

jtibshirani commented Feb 13, 2025 •

edited

Loading