In this project, we use unsupervised learning models to cluster synopses of Top 100 Greatest Movies of All Time from IMDB into different groups, visualize the results and identify their latent topics/structures.
A list of the 100 movies and their synopses from IMDB and Wikipedia are in data.
Data processing and modeling are shown in Document_Clustering.ipynb.