Skip to content

lorenzobasile/webspam_detection

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

34 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Web spam detection through link-based features

Information Retrieval exam repository.

Author: Lorenzo Basile

Structure of the repository

  • spam_detection.ipynb contains the commented implementation of a web spam classifier and of a spam-robust version of PageRank algorithm, both based on the link structure of the web graph.
  • spam_detection.py contains the customized functions developed for this project.
  • Folder data contains the dataset WEBSPAM-UK2006.
  • Folder papers contains the key reference papers.

About

Project for Information Retrieval exam

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published