Support for InternVL #6803

chigkim · 2024-04-21T05:58:47Z

New InternVL-Chat-V1.5 just came out, and the quality is really great, and the benchmark score is pretty high too. Possibly best open source vision language model yet?

Can we have llama.cpp to support it? @cmp-nct, @cjpais, @danbev, @monatis, has any of you tried it?

Demo: https://internvl.opengvlab.com/

paryska99 · 2024-04-21T07:24:21Z

Would be great

cjpais · 2024-04-23T17:18:09Z

I am working on a few projects right now, but if I get a chance I will try to get support in (assuming it doesn't already work). I would also like to get moondream support in

2132660698 · 2024-04-26T02:48:15Z

+1

cjpais · 2024-04-27T01:48:43Z

fwiw moondream support was merged in #6899, haven't had a chance to look at/try internvl

sapere-aude-incipe · 2024-05-01T22:14:18Z

I would really like to get InternVL support in llama.cpp.

I have tested the demo extensively and it is really good, so much so that I feel like it is a game changer in many ways. But running it on consumer hardware is not possible right now.

As noted here: InternLM/lmdeploy#1501 (comment)

architecture: InternViT-6B-448px-V1-5 + MLP + InternLM2-Chat-20B
I am afraid it cannot fit into A10 (24G) even though LLM weights are quantized into 4 bits.

Is it possible to GGUF the weights to allow for multi GPU splitting or splitting layers between CPU RAM and VRAM? Adding support for InternVL 1.5 would also (probably) make it easier to support future versions when they come out.

Single430 · 2024-05-22T01:55:32Z

@cjpais Hello, may I ask what is the progress of internvl support now? We are looking forward to using it on llama.cpp.

cjpais · 2024-05-22T16:21:12Z

Hey I am quite busy with a few projects, it's on my list but just not very high priority at the moment. It's really only something I can do in my spare/free time

Single430 · 2024-05-23T06:54:21Z

Hey I am quite busy with a few projects, it's on my list but just not very high priority at the moment. It's really only something I can do in my spare/free time

Thank you for your reply. Thank you for your hard work. Looking forward to your future work.

chigkim · 2024-06-05T16:25:57Z

Which one would be better to focus: CogVLM or InternVL?

I wish there is more resource/interest for language vision models among the llama.cpp community. Llama.cpp is the only hope to run newer language vision models on Apple Silicon. Especially since flash attention python library is not available for Apple Sillicon, you can't even run inference using Torch with MPS support. :(

opisaac9001 · 2024-06-07T21:57:34Z

Which one would be better to focus: CogVLM or InternVL?

I wish there is more resource/interest for language vision models among the llama.cpp community. Llama.cpp is the only hope to run newer language vision models on Apple Silicon. Especially since flash attention python library is not available for Apple Sillicon, you can't even run inference using Torch with MPS support. :(

Please internVL,. In my tests it works better than CogVLM. Especially for stuff like receipts and documents.

fzzylogic · 2024-06-09T12:56:19Z

InternVL is quite good. Benchmarks, HF, Demo.

DoiiarX · 2024-06-17T10:46:40Z

how about now? any update?

James4Ever0 · 2024-06-23T10:14:36Z

upvote for this

fzzylogic · 2024-07-06T10:38:14Z

InternLM-XComposer-2.5-7b is out now out and having only tested the image capabilities, it seems great. HF, Demo.

KOG-Nisse · 2024-07-08T12:10:34Z

This would be great!

v3ss0n · 2024-07-09T16:20:38Z

Any status on this. this is currently highest performing Vision LLM from user's tests on LocalLLama reddit.

suncloudsmoon · 2024-07-23T08:09:32Z

Any updates?

CNEA-lw · 2024-07-25T08:26:18Z

嘿，我有几个项目很忙，它在我的清单上，但目前优先级并不多。这真的只是我可以在业余时间做的事情

I tested the now available InternVL2 model and it is indeed a great choice, I hope to give it a higher priority, thank you for your hard work.

goto-loop · 2024-07-29T07:23:44Z

InternVL2 would be great to have! Seems to be SOTA in open source vision LLMs

v3ss0n · 2024-07-29T19:43:48Z

Any thoughts on this?
Since Vision models varies alot , compare to LLM models do Maintainers thinks LLamacpp should be focusing on supporting it? Since there are already a lot of LLM models coming out and the core team is doing tremendous work on those already. Do core team feels VLMs should be supported outside of llamacpp project?
May be addon/extention architecture viable?

Backendmagier · 2024-08-05T13:44:23Z

This would be a gamechanger! @cjpais

cjpais · 2024-08-05T15:17:58Z

I'm sorry I don't know when I can do this, I have a huge backlog of projects I'm currently working on! I am very curious to try it but unfortunately it's not very high priority for me right now

nogifeet · 2024-08-18T14:45:33Z

InternVL2 would be great to have! Seems to be SOTA in open source vision LLMs

+1

v3ss0n · 2024-08-20T11:09:51Z

I think model builder should contribute their vision model works in here.

felixslu · 2024-08-21T12:37:55Z

I think model builder should contribute their vision model works in here.

In an ideal situation，it's model builder's work!
but sadly, maybe their work not focus on device，or they have self-deploy server framework，such as LMDeploy.

So, I really hope llama.cpp contributor can support this model, it is really good!

ZhongQiyu · 2024-09-10T12:50:04Z

I think the devs can add their own branches to the llama.cpp repo or huggingface.co?
The 2.5 version of InternVL also got released..can take it a try for transfer as a helper if needed.

v3ss0n · 2024-09-11T09:12:27Z

I think model builder should contribute their vision model works in here.

In an ideal situation，it's model builder's work! but sadly, maybe their work not focus on device，or they have self-deploy server framework，such as LMDeploy.

If they want to be popular and used by many , that would be the case.

LMDeploy is full of bufferoverflow crashes , not recommended for any secure deployment.

James4Ever0 · 2024-09-25T06:45:30Z

They have closed my issue for now. Guess this is never on their roadmap.

OpenGVLab/InternVL#522

rampageservices · 2024-10-02T03:17:59Z

They have closed my issue for now. Guess this is never on their roadmap.

OpenGVLab/InternVL#522

It was reopened and they stated they are actively working towards "progressing the work."

github-actions · 2024-11-16T01:59:12Z

This issue was closed because it has been inactive for 14 days since being marked as stale.

v3ss0n · 2024-11-20T10:57:50Z

So , LLamacpp is not gonna focus on VLMs i guess.

James4Ever0 · 2025-01-14T05:54:06Z

Now I have made some progress.

#9403 qlylangyu#1

Please consider reopen this issue.

rccrdmr · 2025-02-05T21:10:47Z

Anyway we can extract confidence score?

chigkim added the enhancement New feature or request label Apr 21, 2024

wwjCMP mentioned this issue May 8, 2024

Support for InternVL-Chat-V1.5 ollama/ollama#4257

Open

henk717 mentioned this issue Jul 23, 2024

Add support for InternVL2 models LostRuins/koboldcpp#1016

Open

sammcj mentioned this issue Jul 25, 2024

Request to add support for InternVL-2 model ollama/ollama#5786

Open

James4Ever0 mentioned this issue Aug 20, 2024

[Feature] Implement InternVL to llama.cpp OpenGVLab/InternVL#522

Open

github-actions bot added the stale label Nov 2, 2024

github-actions bot closed this as completed Nov 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for InternVL #6803

Support for InternVL #6803

chigkim commented Apr 21, 2024

paryska99 commented Apr 21, 2024

cjpais commented Apr 23, 2024

2132660698 commented Apr 26, 2024

cjpais commented Apr 27, 2024

sapere-aude-incipe commented May 1, 2024

Single430 commented May 22, 2024

cjpais commented May 22, 2024

Single430 commented May 23, 2024

chigkim commented Jun 5, 2024 •

edited

Loading

opisaac9001 commented Jun 7, 2024

fzzylogic commented Jun 9, 2024 •

edited

Loading

DoiiarX commented Jun 17, 2024

James4Ever0 commented Jun 23, 2024

fzzylogic commented Jul 6, 2024 •

edited

Loading

KOG-Nisse commented Jul 8, 2024

v3ss0n commented Jul 9, 2024

suncloudsmoon commented Jul 23, 2024

CNEA-lw commented Jul 25, 2024

goto-loop commented Jul 29, 2024

v3ss0n commented Jul 29, 2024 •

edited

Loading

Backendmagier commented Aug 5, 2024

cjpais commented Aug 5, 2024

nogifeet commented Aug 18, 2024

v3ss0n commented Aug 20, 2024

felixslu commented Aug 21, 2024

ZhongQiyu commented Sep 10, 2024

v3ss0n commented Sep 11, 2024 •

edited

Loading

James4Ever0 commented Sep 25, 2024 •

edited

Loading

rampageservices commented Oct 2, 2024

github-actions bot commented Nov 16, 2024

v3ss0n commented Nov 20, 2024

James4Ever0 commented Jan 14, 2025 •

edited

Loading

rccrdmr commented Feb 5, 2025

Support for InternVL #6803

Support for InternVL #6803

Comments

chigkim commented Apr 21, 2024

paryska99 commented Apr 21, 2024

cjpais commented Apr 23, 2024

2132660698 commented Apr 26, 2024

cjpais commented Apr 27, 2024

sapere-aude-incipe commented May 1, 2024

Single430 commented May 22, 2024

cjpais commented May 22, 2024

Single430 commented May 23, 2024

chigkim commented Jun 5, 2024 • edited Loading

opisaac9001 commented Jun 7, 2024

fzzylogic commented Jun 9, 2024 • edited Loading

DoiiarX commented Jun 17, 2024

James4Ever0 commented Jun 23, 2024

fzzylogic commented Jul 6, 2024 • edited Loading

KOG-Nisse commented Jul 8, 2024

v3ss0n commented Jul 9, 2024

suncloudsmoon commented Jul 23, 2024

CNEA-lw commented Jul 25, 2024

goto-loop commented Jul 29, 2024

v3ss0n commented Jul 29, 2024 • edited Loading

Backendmagier commented Aug 5, 2024

cjpais commented Aug 5, 2024

nogifeet commented Aug 18, 2024

v3ss0n commented Aug 20, 2024

felixslu commented Aug 21, 2024

ZhongQiyu commented Sep 10, 2024

v3ss0n commented Sep 11, 2024 • edited Loading

James4Ever0 commented Sep 25, 2024 • edited Loading

rampageservices commented Oct 2, 2024

github-actions bot commented Nov 16, 2024

v3ss0n commented Nov 20, 2024

James4Ever0 commented Jan 14, 2025 • edited Loading

rccrdmr commented Feb 5, 2025

chigkim commented Jun 5, 2024 •

edited

Loading

fzzylogic commented Jun 9, 2024 •

edited

Loading

fzzylogic commented Jul 6, 2024 •

edited

Loading

v3ss0n commented Jul 29, 2024 •

edited

Loading

v3ss0n commented Sep 11, 2024 •

edited

Loading

James4Ever0 commented Sep 25, 2024 •

edited

Loading

James4Ever0 commented Jan 14, 2025 •

edited

Loading