This repository stores the project for the Curricular Unit of Information Processing and Retrieval project from FEUP.
The /api
directory contains a Python API using Flask to connect with Solr and provide data from it.
The /characterization
directory includes scripts and notebooks for data characterization.
The /docs
directory holds documentation and assets.
Furthermore it contains a small video showcasing our final product, 3 slideshows used to support each of the milestone's presentations and a detailed report which documents all of our process developing this product (more technical than the others).
The /retrieval
directory includes scripts related to data retrieval (container initialization, etc.)
The /src
directory contains source code for data processing. This includes all of the pipeline for parsing the data from public apis, getting the data, etc.
The /versevault
directory hosts the web-app that integrates the various components of the project.
Feel free to explore each directory for more details about the specific components of the project.
This project is licensed under the MIT LICENSE.