Final project for the Computer Vision course 2024
Downloading and downsampling the data.
Preprocessing data: converting the annotation file URLs into an image dataset constisting of youtube video thumbnails.
Image resize: Resizing all the thumbnail images to a unifrom 320x320.
Dataset cleaning: filtered only relevant attributes from the csv file. Reduced the thumbnail dataset samples to match the filtered annotations.
Linear Classifier: implemented LogisticRegression model using sklearn.
Nerual Network: implemented basic and complex NN model using Tensorflow.
CNN: implemented CNN models with and without regularisation. Also implemented pretrained VGG16 and resnet models (experimented with regularisation and finetuning).
Increasing the dataset size: the models were run on the larger dataset.
https://drive.google.com/drive/folders/1LLrRdI3bqBoqQ5Wul8DM9DGFgPbSsEKv?usp=sharing