LLAVA v1.6 #1204

enrico07 · 2024-02-21T14:45:15Z

Hello,

I was wondering if it is compatible with LLAVA v1.6 34B. If it is so, how can I implement it? Should I use the same code of LLAVA v1.5?

Robzz · 2024-03-17T23:34:40Z

Seems like it's compatible, I just tried running the llava-v1.6-mistral-7b.Q5_K_M.gguf model using the LLaVa 1.5 example code and it's been working fine so far.

Edit: I gave a 34B model a try (using the Q4_K_M quants from the ollama models library) a try and got a segfault, needs investigation.

enrico07 · 2024-03-18T17:01:18Z

I tried to use the 34B version. What I got at the beginning is that the model had a lot of hallucinations at the end like strange comments, emojis and also sometimes it appends the chats between the system and the user.

I solved it by controlling the output of the model by asking in the prompt to add a special token at the end and using the "stop" parameter to stop writing after that token.

My doubt is that if the right structure of LLAVA-v1.6 prompt is used and if the new technique of LLAVA-v1.6 of splitting the images into grids is exploited.

abetlen added the question Further information is requested label Feb 26, 2024

abetlen mentioned this issue Apr 28, 2024

Generic Chat Formats for Multimodal Models (Obsidian, LLaVA1.6, Moondream) #1147

Merged

14 tasks

abetlen closed this as completed in #1147 Apr 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LLAVA v1.6 #1204

LLAVA v1.6 #1204

enrico07 commented Feb 21, 2024 •

edited

Loading

Robzz commented Mar 17, 2024 •

edited

Loading

enrico07 commented Mar 18, 2024

LLAVA v1.6 #1204

LLAVA v1.6 #1204

Comments

enrico07 commented Feb 21, 2024 • edited Loading

Robzz commented Mar 17, 2024 • edited Loading

enrico07 commented Mar 18, 2024

enrico07 commented Feb 21, 2024 •

edited

Loading

Robzz commented Mar 17, 2024 •

edited

Loading