Flickr30k Image dataset has been used for training the model.
Click here to download the dataset.
- Pretrained ResNet50 (using imagenet weights) has been used for extracting image features.
- Pretrained Glove Embeddings has been used in the model's Embedding layer. Download from here.