How to use posters analyser

Code Explenation

The code is separated into 5 notebooks, each has its own responsibility but is not necessarily independent. The notebooks are:

posters_fetcher
duplicate_remover
actor_fetcher
face_recognition_ethnicity
analysis

Below you can see a quick explanation of each notebook, they need to be run in that order.

Posters_fetcher - Posters fetching

Download metadata from imdb
Download posters images
Build movie_posters dataframe
Save tmdb_data

Duplicate_remover - Duplicates removal

Load movie_posters
Calculate each image hash value
Assign a “dup” value of True or False to each poster
Save to posters_with_dup dataframe

Actor_fetcher - Actors fetching

Load movies ids from posters_with_dup
Save movies and actors metadata
Save actors images
Create actors dataframe and save as actors_df

Face_recognition_ethnicity - Face & Ethnicity recognition

Load posters_with_dup dataframe
Perform posters detection and save as posters_face_encodings
Perform posters encoding and save as posters_face_encodings
Load actors_df dataframe
Perform actors detection & encoding and save as actors_face_encodings
Recognize the actors who appear in the posters and save as match_poster_actor_cast_all
Use fairface 4 races model to predict each actors ethnicity
Add ethnicity information to the posters dataframe and save as posters_new_races4_cast_all
Create ranking dataframe to save the information about the actors positions in the cast list, saved as ranking_posters_new_races4_cast_all

Analysis

Data preperation
Graphs creation seperated by titles

Demo

There is a demo you can run to test the code, we uploaded the data and the dataframes you should expect to get.

Stages of the demo:

In poster_fetcher notebook, use sample_aids. The demo is the default so comment those lines for a full run.
In duplicate_remover notebook, load posters sample zip and previous obtained dataframes.
In actor_fetcher notebook use previous obtained dataframes
In Face_recognition_ethnicity notebook, load actos zip, movies zip, posters zip, w600k_r50.onnx model and use previous obtained dataframes
In analysis use previous obtained dataframes

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
data		data
.gitignore		.gitignore
Analysis.ipynb		Analysis.ipynb
Validation.ipynb		Validation.ipynb
actor_fetcher.ipynb		actor_fetcher.ipynb
duplicate_remover.ipynb		duplicate_remover.ipynb
face_recognition_ethnicity.ipynb		face_recognition_ethnicity.ipynb
posters_fetcher.ipynb		posters_fetcher.ipynb
readme.md		readme.md
requirements.txt		requirements.txt
requirements_actor_fetcher.txt		requirements_actor_fetcher.txt
requirements_duplicate_remover.txt		requirements_duplicate_remover.txt
requirements_face_recognition_ethnicity.txt		requirements_face_recognition_ethnicity.txt
requirements_posters_fetcher.txt		requirements_posters_fetcher.txt
requirements_validation.txt		requirements_validation.txt
test.png		test.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

How to use posters analyser

Code Explenation

Below you can see a quick explanation of each notebook, they need to be run in that order.

Demo

About

Releases

Packages

Contributors 2

Languages

data4goodlab/PosterAnalyzer

Folders and files

Latest commit

History

Repository files navigation

How to use posters analyser

Code Explenation

Below you can see a quick explanation of each notebook, they need to be run in that order.

Demo

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages