Predicting IMDb Ratings with Linear Regression

Description

This repository contains a working model to predict IMDb ratings for a movie using features available prior to the movie's release. The model uses linear regression and features obtained through scraping movie information from IMDb using BeautifulSoup.

Features and Target Variables

Target Variable: IMDb Rating
Features: Runtime, MPAA Rating, Genre, Director, Writer, Stars, Production Company, Release Month, Years Since Release

Data Used

Scraped over 8,000 IMDb pages to collect movie data.

Tools Used

Beautiful Soup for web scraping
Linear regression
Ridge regression
Polynomial regression
Supervised Machine Learning
Feature Engineering & Selection
Numpy
Pandas
Seaborn
Matplotlib

Potential Impact

This model is a good basis for producers or movie enthusiasts to understand what rating a movie will get after it is released based on variables that can all be determined prior to the release of the movie. Below is a visual illustrating the results from the analysis, highlighting what features had the largest effect on the model.

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
data		data
helper_functions		helper_functions
images		images
.gitignore		.gitignore
README.md		README.md
imdb_rating_presentation.pdf		imdb_rating_presentation.pdf
imdb_rating_proj.ipynb		imdb_rating_proj.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Predicting IMDb Ratings with Linear Regression

Contents

Description

Features and Target Variables

Data Used

Tools Used

Potential Impact

About

Releases

Packages

Languages

josephpcowell/cowell_proj_2

Folders and files

Latest commit

History

Repository files navigation

Predicting IMDb Ratings with Linear Regression

Contents

Description

Features and Target Variables

Data Used

Tools Used

Potential Impact

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages