Skip to content
@GT-Vision-Lab

Georgia Tech Visual Intelligence Lab

Popular repositories Loading

  1. VQA_LSTM_CNN VQA_LSTM_CNN Public

    Train a deeper LSTM and normalized CNN Visual Question Answering model. This current code can get 58.16 on OpenEnded and 63.09 on Multiple-Choice on test-standard.

    Lua 379 133

  2. VQA VQA Public

    Python 374 142

  3. abstract_scenes_v002 abstract_scenes_v002 Public

    The second version of the interface for Abstract Scenes research project.

    JavaScript 22 18

  4. GuessWhich GuessWhich Public

    Evaluating Visual Conversational Agents via Cooperative Human-AI Games

    Lua 21 6

  5. vision_language_in_the_wild vision_language_in_the_wild Public

    Python 5 1

  6. vqa_browser vqa_browser Public

    The VQA dataset browser back-end code, using nginx, Django, an PostgreSQL (running in Docker containers).

    Python 4 5

Repositories

Showing 9 of 9 repositories
  • GuessWhich Public

    Evaluating Visual Conversational Agents via Cooperative Human-AI Games

    Lua 21 6 0 7 Updated Nov 22, 2022
  • abstract_scenes_v002 Public

    The second version of the interface for Abstract Scenes research project.

    JavaScript 22 18 0 1 Updated May 16, 2022
  • VQA-Website Public

    Visual Question Answering Website

    HTML 4 1 0 0 Updated Aug 19, 2021
  • VQA Public
    Python 374 142 9 4 Updated Mar 11, 2021
  • VQA_LSTM_CNN Public

    Train a deeper LSTM and normalized CNN Visual Question Answering model. This current code can get 58.16 on OpenEnded and 63.09 on Multiple-Choice on test-standard.

    Lua 379 133 13 1 Updated Mar 22, 2019
  • Python 5 1 0 0 Updated Apr 26, 2018
  • MATLAB 2 2 0 0 Updated Mar 8, 2017
  • torch-utilities Public

    Utility functions for neural network implementations in Torch

    Lua 2 2 0 0 Updated Feb 16, 2016
  • vqa_browser Public

    The VQA dataset browser back-end code, using nginx, Django, an PostgreSQL (running in Docker containers).

    Python 4 5 0 0 Updated Feb 16, 2016