-
Notifications
You must be signed in to change notification settings - Fork 192
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Unsupported Elastic Search distribution on BEIR.ipynb #58
Comments
This is an annoying thing about the elasticsearch client: They added a feature to the newest ES client which makes it work only with the newest ES server, to enforce the new licensing of ES. Either use the newest ES server or downgrade your Python ES client. |
I see, it works when I downgrade the client to 7.9.1. Should the NDCG@10 computed in this notebook match the value of the leaderboard? The notebook gives 0.6843 (figure below) while the leaderboard gives 0.62 for SciFact. |
I ran the script 3 times:
And got quite different results @NThakur20 I think the issue is that the index is not yet finished when retrieval starts. Is there some sleep between indexing the documents and starting to query? Elasticsearch is indexing docs in the background, i.e. we must wait until all docs are fully indexed before we can start to query. Another issue could be the shards. Is the ES index created with a single shard? Edit: Only one shard is created. If I run it on an existing ES index, I get:
|
Thanks for bringing this issue up.
After adding both the parameters, the code is now producing reproducible scores. I will update the development branch for now. In the next version release of BEIR, these changes will be reflected in the master branch and PyPI version ( Kind Regards, |
@NThakur20 Great, thanks for the quick fix. We should also either update the examples to the newest ES version, or freeze the Python ES client to e.g. version (in setup.py):
So that the above issues does not appear. |
Hi,
Thanks for the great work. BEIR is extremely valuable!
I just tried to run BEIR.ipynb on Goggle Colab and I was unable to complete "Lexical Retrieval using BM25 (Elasticsearch)" section due to an unsupported error from ElasticSearch as shown below:
I tried different versions but I couldn't get it to work. Any advice?
The text was updated successfully, but these errors were encountered: