Local run instructions please #14

ramkumarkoppu · 2024-12-02T12:00:46Z

Hi,

Can you please provide instructions to run this model locally Linux/Mac computer?

XuGW-Kevin · 2024-12-02T15:20:41Z

Hi, to run this model locally on Linux/Mac computer, a good way is to convert the model to GGUF and use ollama.
However, llama.cpp does not support this feature.
It would be very hard for me to implement this, but it seems that the community has been working on this.

XuGW-Kevin · 2024-12-02T15:24:50Z

However, you can try running a quantized version of the model locally.
I found that the repository at unslothai supports quick quantization for using models in the llama-3.2-vision series, though I haven’t tried it myself. I find a demo here where you can run the model on a single T4 GPU.
If you’re willing to give it a try and encounter any issues, I’d be happy to work together to solve them. Thank you for your interest!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Local run instructions please #14

Local run instructions please #14

ramkumarkoppu commented Dec 2, 2024

XuGW-Kevin commented Dec 2, 2024

XuGW-Kevin commented Dec 2, 2024 •

edited

Loading

Local run instructions please #14

Local run instructions please #14

Comments

ramkumarkoppu commented Dec 2, 2024

XuGW-Kevin commented Dec 2, 2024

XuGW-Kevin commented Dec 2, 2024 • edited Loading

XuGW-Kevin commented Dec 2, 2024 •

edited

Loading