The following repository contains the code for a flask app that writes captions for Images. Two image to text models have been implemented BILP and ViT-GPT2.
Make sure you have the following libraries to run the code:
- pytorch
- transformers
- Pillow
- Flask
- requests
To run the code, just run the app.py file.