Information Retrieval exam repository.
Author: Lorenzo Basile
spam_detection.ipynb
contains the commented implementation of a web spam classifier and of a spam-robust version of PageRank algorithm, both based on the link structure of the web graph.spam_detection.py
contains the customized functions developed for this project.- Folder
data
contains the dataset WEBSPAM-UK2006. - Folder
papers
contains the key reference papers.