Search engine project for the Information Retrieval course from CentraleSupelec.
Python 3.5+ is needed.
For the libraries, see requirements.txt
:
pip install -r requirements.txt
You also need to put the data for both collections in a folder data
(more precisely: data/CACM
and data/pa1-data
).
Please run Rapport.ipynb
or check the notebook.
We built our model using SciPy sparse matrices.