This is a work in progress
Implementation of the paper "Show and Tell: A Neural Image Caption Generator" (https://arxiv.org/pdf/1411.4555.pdf) by Oriol Vinyals et al.
MSCOCO dataset - http://cocodataset.org/#download
Flickr30k capions - http://web.engr.illinois.edu/~bplumme2/Flickr30kEntities/
Flickr30k cpations - https://github.com/eriche2016/image_caption_with_semantic_attenion/blob/master/flickr30k-caption/annotations/captions_flickr30k.json
Flickr8k dataset - http://nlp.cs.illinois.edu/HockenmaierGroup/Framing_Image_Description/KCCA.html