This API is used to load Visual StoryTelling Dataset (VIST). The dataset currently contains Description-in-Isolation (DII) and Story-in-Sequence (SIS) annotations.
# Change OUT_PATH to your preference directory, note the dataset is of big size ~300GB.
python download.py
# Download Description-in-Isolation dataset
wget http://visionandlanguage.net/VIST/json_files/description-in-isolation/DII-with-labels.tar.gz
# Download Story-in-Sequence dataset
wget http://visionandlanguage.net/VIST/json_files/story-in-sequence/SIS-with-labels.tar.gz
The "vist.py" is able to load both DII and SIS datasets.
# locate your vist_directory, which contains images and annotations
vist_dir = '/playpen/data/vist'
# SIS instance
sis = vist.Story_in_Sequence(vist_dir)
# DII instance
dii = vist.Description_in_Isolation(vist_dir)