GitHub - Zhilin123/similar_movie_characters: Find out movie characters that are most similar to the ones you love

Identifying similar movie characters

Ever had the feeling that a character in a movie feels pretty much the same as another character in another movie? Wouldn't it be amazing if we could find more of our favorite character roles, understand what these recurring themes are and perhaps enjoy our binge-watching a little more?

This code does exactly that.

More interestingly, we might also be able to relate our own experiences to those of movie characters. (Code to be released)

Data

In this task, we try to predict movie characters that are most likely from a common trope (theme in cinematic speak) as another character.

prepare_dataset/ : Shows how concise descriptions of characters in tropes are downloaded from allthetropes.org and post-processed. A prepared version of the dataset is available here.
get_candidate_embeddings.py: Demonstrates how paragraph embeddings can be obtained based on these descriptions of movie characters.
generate_candidates_for_refinement.py: Approximately identifies candidates are likely to be similar to a movie character (using their paragraph-level text embeddings)
generate_cosine_like_grid_for_siamesebert.py: Helper script (to generate_candidates_for_refinement.py) to generate candidates using the SiameseBERT model.
ccm_training.py: Trains a Character Comparison Model to more precisely determine whether two movie characters are similar.
siamesebert_training.py: Training a baseline model in identifying similarity between movie characters
siamesebert_model.py: The model architecture of the baseline model (trained in siamesebert_training.py)
eval/compare_overlap_with_exhaustive_comparison.py: Determine which approach in selecting candidates overlaps most with using CCM to exhaustive compare all possible character-pairs(for a tiny number of characters).
eval/evaluat*: Uses automated metrics (Recall @ k, normalized Discounted Cumulative Gain @ k and Mean Reciprocal Rank) to compare the performance between baseline models and our Select-and-Refine models.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Identifying similar movie characters

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
eval		eval
prepare_dataset		prepare_dataset
candidate_refinement.py		candidate_refinement.py
ccm_training.py		ccm_training.py
generate_candidates_for_refinement.py		generate_candidates_for_refinement.py
generate_cosine_like_grid_for_siamesebert.py		generate_cosine_like_grid_for_siamesebert.py
get_candidate_embeddings.py		get_candidate_embeddings.py
readme.md		readme.md
requirements.txt		requirements.txt
siamesebert_model.py		siamesebert_model.py
siamesebert_training.py		siamesebert_training.py

Zhilin123/similar_movie_characters

Folders and files

Latest commit

History

Repository files navigation

Identifying similar movie characters

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages