A model capable of predicting movies genre's developed by students for the Natural Language Processing (NLP) subject lectured at Instituto Superior Técnico (IST).
-
knn.py
- Implementation of the K-Nearest Neighbours (KNN) classifier. -
svm.py
- Implementation of Support Vector Machine (SVM) classifier. -
mgc_input
- Dataset pre-processing. -
mgc_output
- Print predicted genre's to a text file. -
reviews.py
- Main program to run the KNN and SVM models. -
reviews.ipynb
- Main program to run the Bidirecional Encoder Representations from Transformers (BERT) model, which was the best model tested and the one submitted.
The file report.pdf
gives a more in depth description of the work done.
Extra tests that are mentioned in the report but which output images did not make into the report are available here.
António Jotta - 99893,
António Morais - 102643
Rúben Nobre - 99321