This repository contains project & exercises files for Udacity's Computer Vision Nanodegree program which I enrolled on 10 August, 2019.
In this project, I build a facial keypoint detection system. The system consists of a face detector that uses Haar Cascades and a Convolutional Neural Network (CNN) that predict the facial keypoints in the detected faces. The facial keypoint detection system takes in any image with faces and predicts the location of 68 distinguishing keypoints on each face.
Some of my output from my Facial Keypoint Detection system:
NaimishNet
ResNet18(w Transfer learning)
Green points: Ground Truth
Purple points: Predicted points by my Model
Demo gifs on Facial Keypoint detection on videos
In this project, I design and train a CNN-RNN (Convolutional Neural Network - Recurrent Neural Network) model for automatically generating image captions. The network is trained on the Microsoft Common Objects in COntext (MS COCO) dataset. The image captioning model is displayed below.
Some output from my Model after training 4 epochs: