Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix(elasticsearch): Ignore doc length when scoring _all field #1562

Merged
merged 1 commit into from
Feb 26, 2025

Conversation

andersju
Copy link
Member

@andersju andersju commented Feb 26, 2025

Sets the BM25 similarity option b ("Controls to what degree document length normalizes tf values") to 0 (default 0.75) specifically for the _all field. _all contains all kinds of things and without this we're effectively punishing records with a longer _all length.

https://www.elastic.co/guide/en/elasticsearch/reference/current/index-modules-similarity.html#bm25

https://kbse.atlassian.net/browse/LWS-316

@andersju andersju requested review from olovy and kwahlin February 26, 2025 10:45
Copy link
Contributor

@olovy olovy left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🚢

@olovy olovy merged commit de9d742 into develop Feb 26, 2025
1 check passed
@olovy olovy deleted the feature/lws-316-es-ignore-doc-length-in-_all branch February 26, 2025 12:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants