Consider bring VLM into RagFlow project? #1725
simoncai519
started this conversation in
Ideas
Replies: 0 comments 1 reply
-
Hi, actually VLM has already been integrated already. There are a bag of multi modal models including openai, openrouter, tongyi-qianwen, gemini, stepfun, and you can decide which one to use. These models will convert the images into text. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
In generation, VLM can answer questions based on picture stored in knowledge base, instead of parsed text from image. Has this been considered?
Beta Was this translation helpful? Give feedback.
All reactions