Classification evaluation for LLaVA #4

rishika2110 · 2024-03-05T20:55:39Z

Hi, currently, the code throws a NotImplementedError for LLaVA, but I believe the paper demonstrates zero-shot classification on LLaVA. When will the code be updated to include this feature? Alternatively, could you point out the main parts that would need significant changes to incorporate LLaVA?

Thank you.

chs20 · 2024-03-05T22:07:19Z

Hi, thanks for asking. We demonstrate zero-shot classification only for the CLIP models on their own and consider LLaVA and OpenFlamingo for captioning/VQA tasks.

rishika2110 · 2024-03-07T16:36:52Z

Thank you for the clarification. I have another question: Why is the batch size hardcoded to 1? Is it just to avoid padding text tokens? Or am I missing something?

chs20 · 2024-03-09T10:45:54Z

You're right, it should definitely be possible to run with larger batch sizes, it's just hardcoded to batch_size 1 in a few places since we couldn't fit much more on our devices anyway for adversarial evaluations

rishika2110 · 2024-04-23T21:46:34Z

Hi, thank you so much for clarifying everything. Just one last question: does the code use beam search to generate the outputs?

chs20 · 2024-05-14T12:14:08Z

No problem :) We basically stick to how the models are evaluated in their respective papers, so greedy decoding without beam-search for LLaVA, and beam search with 3 beams for OpenFlamingo.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Classification evaluation for LLaVA #4

Classification evaluation for LLaVA #4

rishika2110 commented Mar 5, 2024

chs20 commented Mar 5, 2024

rishika2110 commented Mar 7, 2024 •

edited

Loading

chs20 commented Mar 9, 2024

rishika2110 commented Apr 23, 2024

chs20 commented May 14, 2024

Classification evaluation for LLaVA #4

Classification evaluation for LLaVA #4

Comments

rishika2110 commented Mar 5, 2024

chs20 commented Mar 5, 2024

rishika2110 commented Mar 7, 2024 • edited Loading

chs20 commented Mar 9, 2024

rishika2110 commented Apr 23, 2024

chs20 commented May 14, 2024

rishika2110 commented Mar 7, 2024 •

edited

Loading