Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

LLAVA v1.6 #1204

Closed
enrico07 opened this issue Feb 21, 2024 · 2 comments · Fixed by #1147
Closed

LLAVA v1.6 #1204

enrico07 opened this issue Feb 21, 2024 · 2 comments · Fixed by #1147
Labels
question Further information is requested

Comments

@enrico07
Copy link

enrico07 commented Feb 21, 2024

Hello,

I was wondering if it is compatible with LLAVA v1.6 34B. If it is so, how can I implement it? Should I use the same code of LLAVA v1.5?

@abetlen abetlen added the question Further information is requested label Feb 26, 2024
@Robzz
Copy link

Robzz commented Mar 17, 2024

Seems like it's compatible, I just tried running the llava-v1.6-mistral-7b.Q5_K_M.gguf model using the LLaVa 1.5 example code and it's been working fine so far.

Edit: I gave a 34B model a try (using the Q4_K_M quants from the ollama models library) a try and got a segfault, needs investigation.

@enrico07
Copy link
Author

I tried to use the 34B version. What I got at the beginning is that the model had a lot of hallucinations at the end like strange comments, emojis and also sometimes it appends the chats between the system and the user.

I solved it by controlling the output of the model by asking in the prompt to add a special token at the end and using the "stop" parameter to stop writing after that token.

My doubt is that if the right structure of LLAVA-v1.6 prompt is used and if the new technique of LLAVA-v1.6 of splitting the images into grids is exploited.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants