Skip to content

francocco99/Spoiler_Detecion_Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Spoiler Detection Project

This is a project for the Human Language Technology 2023/2024 course. This project focuses on detecting spoiler reviews on IMDb. The dataset is taken from Kaggle and consists of two files:

  • Movie dataset: Contains information about the movies reviewed by the users, such as the movie plot synopsis.
  • Review dataset: Contains the text of the reviews and other metadata.

Implementation

The idea is to use different language models to classify the reviews. We start from a baseline using simple models like Logistic Regression and Naive Bayes, and gradually move to more complex models like LLama3. A more detailed description of the work is in the PDF This project is being made in group:

About

Human Language Project 2023/2024

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published