Convert to Gguf format to work with Llama.cpp? #32

chigkim · 2024-01-12T21:29:20Z

Llava has various quantized models in gguf format, so it can be used with Llama.cpp.
ggml-org/llama.cpp#3436
Is this possible?

czczup · 2024-01-16T14:38:25Z

Hi, thank you for your suggestion. I will add compatibility with community tools to my to-do list.

leeaction · 2024-05-07T03:23:46Z

gguf format is good for ollama users, Any Update?

GHOST1834 · 2024-05-19T02:58:39Z

It will be nice to have this model in gguf format in ollama.

nischalj10 · 2024-06-06T13:51:16Z

any updates on this? the 4b intern model is killer for its size! would love to see it supported with llama.cpp

KOG-Nisse · 2024-06-26T07:51:11Z

Would love internvl-chat-v1-5 in a gguf format!
https://internvl.opengvlab.com/

thomas-rooty · 2024-06-29T10:42:04Z

I second this

ghost · 2024-07-11T15:25:06Z

@ErfeiCui why did you close this as completed?

chigkim · 2024-08-11T17:33:48Z

Any update on this? InternVL2-Llama3-76B on Ollama/llama.cpp would be amazing!

kim-gtek · 2024-08-12T19:46:30Z

If someone gives me a tutorial I will write my own code to tranform this for pytorch to gguf for llama.cpp myself

chigkim · 2024-08-12T22:15:56Z

It's more involved. you have to implement the model architecture and image preprocessing logic to llama.cpp which uses C++.

ErfeiCui closed this as completed Jul 11, 2024

Provide feedback