Consider bring VLM into RagFlow project? #1725

simoncai519 · 2024-07-26T05:23:42Z

simoncai519
Jul 26, 2024

In generation, VLM can answer questions based on picture stored in knowledge base, instead of parsed text from image. Has this been considered?

yingfeng · 2024-07-26T06:02:29Z

yingfeng
Jul 26, 2024
Maintainer

Hi, actually VLM has already been integrated already. There are a bag of multi modal models including openai, openrouter, tongyi-qianwen, gemini, stepfun, and you can decide which one to use. These models will convert the images into text.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

InfiniFlow

Consider bring VLM into RagFlow project? #1725

{{title}}

Replies: 0 comments 1 reply

{{title}}

Select a reply

InfiniFlow

Consider bring VLM into RagFlow project? #1725

simoncai519 Jul 26, 2024

Replies: 0 comments · 1 reply

yingfeng Jul 26, 2024 Maintainer

simoncai519
Jul 26, 2024

Replies: 0 comments 1 reply

yingfeng
Jul 26, 2024
Maintainer