-
Notifications
You must be signed in to change notification settings - Fork 11.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support for InternVL #6803
Comments
Would be great |
I am working on a few projects right now, but if I get a chance I will try to get support in (assuming it doesn't already work). I would also like to get moondream support in |
+1 |
fwiw moondream support was merged in #6899, haven't had a chance to look at/try internvl |
I would really like to get InternVL support in llama.cpp. I have tested the demo extensively and it is really good, so much so that I feel like it is a game changer in many ways. But running it on consumer hardware is not possible right now. As noted here: InternLM/lmdeploy#1501 (comment) architecture: InternViT-6B-448px-V1-5 + MLP + InternLM2-Chat-20B Is it possible to GGUF the weights to allow for multi GPU splitting or splitting layers between CPU RAM and VRAM? Adding support for InternVL 1.5 would also (probably) make it easier to support future versions when they come out. |
@cjpais Hello, may I ask what is the progress of internvl support now? We are looking forward to using it on llama.cpp. |
Hey I am quite busy with a few projects, it's on my list but just not very high priority at the moment. It's really only something I can do in my spare/free time |
Thank you for your reply. Thank you for your hard work. Looking forward to your future work. |
Which one would be better to focus: CogVLM or InternVL? I wish there is more resource/interest for language vision models among the llama.cpp community. Llama.cpp is the only hope to run newer language vision models on Apple Silicon. Especially since flash attention python library is not available for Apple Sillicon, you can't even run inference using Torch with MPS support. :( |
Please internVL,. In my tests it works better than CogVLM. Especially for stuff like receipts and documents. |
InternVL is quite good. Benchmarks, HF, Demo. |
how about now? any update? |
upvote for this |
This would be great! |
Any status on this. this is currently highest performing Vision LLM from user's tests on LocalLLama reddit. |
Any updates? |
I tested the now available InternVL2 model and it is indeed a great choice, I hope to give it a higher priority, thank you for your hard work. |
InternVL2 would be great to have! Seems to be SOTA in open source vision LLMs |
Any thoughts on this? |
This would be a gamechanger! @cjpais |
I'm sorry I don't know when I can do this, I have a huge backlog of projects I'm currently working on! I am very curious to try it but unfortunately it's not very high priority for me right now |
+1 |
I think model builder should contribute their vision model works in here. |
In an ideal situation,it's model builder's work! So, I really hope llama.cpp contributor can support this model, it is really good! |
I think the devs can add their own branches to the llama.cpp repo or huggingface.co? |
If they want to be popular and used by many , that would be the case. LMDeploy is full of bufferoverflow crashes , not recommended for any secure deployment. |
They have closed my issue for now. Guess this is never on their roadmap. |
It was reopened and they stated they are actively working towards "progressing the work." |
This issue was closed because it has been inactive for 14 days since being marked as stale. |
So , LLamacpp is not gonna focus on VLMs i guess. |
Now I have made some progress. Please consider reopen this issue. |
Anyway we can extract confidence score? |
New InternVL-Chat-V1.5 just came out, and the quality is really great, and the benchmark score is pretty high too. Possibly best open source vision language model yet?
Can we have llama.cpp to support it? @cmp-nct, @cjpais, @danbev, @monatis, has any of you tried it?
Demo: https://internvl.opengvlab.com/
The text was updated successfully, but these errors were encountered: