Implementation of CVPR 2023 paper "Prompting Large Language Models with Answer Heuristics for Knowledge-based Visual Question Answering".
-
Updated
May 23, 2023 - Python
Implementation of CVPR 2023 paper "Prompting Large Language Models with Answer Heuristics for Knowledge-based Visual Question Answering".
An end-to-end multimodal framework incorporating explicit knowledge graphs and OOD-detection. (NeurIPS23)
Implementation of CVPR 2023 paper "Prompting Large Language Models with Answer Heuristics for Knowledge-based Visual Question Answering".
Violet is a Python-based library designed for generating Arabic image captions. The pipeline leverages state-of-the-art transformer models, providing an easy-to-use interface for researchers and developers working on tasks such as image captioning and visual question answering (VQA).
Add a description, image, and links to the okvqa topic page so that developers can more easily learn about it.
To associate your repository with the okvqa topic, visit your repo's landing page and select "manage topics."