Add support for arbitary image keys for multi modal queries #74

reinoldus · 2025-02-02T10:55:06Z

Currently the only way to add images is to put them in the "images" key of the InputSchema to send them to the model.
Further in the current implementation we can only have 1 text description for the image, but for my use case the image is just a small part of a larger analysis, so I think it would be better to be able to add the full payload like in the normal text queries instead of silently swallowing the rest.

Not sure if I misunderstand something w.r.t multi-modality, but I don't see any other way for my use case.

Caveats

I had to edit one test for all the tests to pass, because before just the image description was sent to the model not the full dumped json like in the "text only" queries.

TODO:
~~Found the dev-guide, will fix the issues~~

reinoldus added 3 commits February 2, 2025 12:47

Add support for arbitary image keys for multi modal queries

c26ea9e

Updating formatting

0956cf3

Fixing formatting for one more test

6b140e7

KennyVaneetvelde approved these changes Feb 2, 2025

View reviewed changes

KennyVaneetvelde merged commit 5697bdf into BrainBlend-AI:main Feb 2, 2025
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for arbitary image keys for multi modal queries #74

Add support for arbitary image keys for multi modal queries #74

reinoldus commented Feb 2, 2025 •

edited

Loading

Add support for arbitary image keys for multi modal queries #74

Add support for arbitary image keys for multi modal queries #74

Conversation

reinoldus commented Feb 2, 2025 • edited Loading

reinoldus commented Feb 2, 2025 •

edited

Loading