Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add support for arbitary image keys for multi modal queries #74

Conversation

reinoldus
Copy link
Contributor

@reinoldus reinoldus commented Feb 2, 2025

Currently the only way to add images is to put them in the "images" key of the InputSchema to send them to the model.
Further in the current implementation we can only have 1 text description for the image, but for my use case the image is just a small part of a larger analysis, so I think it would be better to be able to add the full payload like in the normal text queries instead of silently swallowing the rest.

Not sure if I misunderstand something w.r.t multi-modality, but I don't see any other way for my use case.

Caveats

I had to edit one test for all the tests to pass, because before just the image description was sent to the model not the full dumped json like in the "text only" queries.

TODO:
Found the dev-guide, will fix the issues

@KennyVaneetvelde KennyVaneetvelde merged commit 5697bdf into BrainBlend-AI:main Feb 2, 2025
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants