Add multi-modality features #6

jorgeantonio21 · 2024-10-15T07:49:26Z

Feature Request

Currently, we only support text based models for the chats. The following models are the most important to integrate atm:

Llama3.2 Vision models.
Flux (text-to-image) models.
Whisper (voice-to-text), or any alternative.

Moreover, each chat session should have a multi-modality feature. For example, if a user interacts with a specific character, it can request it to analyze and give feedback on images, etc.

jorgeantonio21 added the utopia-release-0.2 label Oct 15, 2024

jorgeantonio21 assigned francis2tm Oct 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add multi-modality features #6

Add multi-modality features #6

jorgeantonio21 commented Oct 15, 2024 •

edited by francis2tm

Loading

Add multi-modality features #6

Add multi-modality features #6

Comments

jorgeantonio21 commented Oct 15, 2024 • edited by francis2tm Loading

Feature Request

jorgeantonio21 commented Oct 15, 2024 •

edited by francis2tm

Loading