Multimodal Input TextBox #4668

taoari · 2023-06-25T04:43:16Z

I have searched to see if a similar issue already exists.

Is your feature request related to a problem? Please describe.
A clear and concise description of what the problem is. Ex. I'm always frustrated when [...]

Multimodal LLMs become popular nowadays. However, for multimodal input, the current gradio app has to use separate widgets for images, videos, audio, and files (attachments). The UI is super non-intuitive, it would be good to have a multimodal input textbox.

Describe the solution you'd like
A clear and concise description of what you want to happen.

Any modern chat app has a multimodal input textbox, e.g. Slack, Teams, etc. The screenshot would be the Slack input box, it would be nice to has something similar.

Additional context
Add any other context or screenshots about the feature request here.

It would also be great that the gr.Chatbot can be updated accordingly that can show text, images, videos and attachments in a single message. The current version of gr.Chatbot only shows a single modality (text or image but not both in one message). PR467 #4667 is a bug that does show the file. It would also be great if the Chatbot can show 3D models, as there is a gr.Model3D component.

pngwn · 2023-06-25T14:19:16Z

I don't know if we'd want a full rich text input like slack but something like discord or WhatsApp might be nice. The ability to upload text along with various media (images, audio, video).

Cc @dawoodkhan82

dawoodkhan82 · 2023-06-25T14:28:07Z

@pngwn @taoari I've actually thought about this, and I think it's a good idea. Especially as more multimodal projects become popular, it would be good to have a component that supports them. I think making the chatbot a single component (input + output) also makes a lot of sense plus easier for devs to use. We can explore this in 4.0

taoari · 2023-06-25T19:07:10Z

@pngwn @dawoodkhan82 It's great to see this is on the wish list, looking forward to it.

dawoodkhan82 · 2023-07-10T14:57:34Z

@pngwn @abidlabs Do you think this should be a new component or a variant of gr.Textbox()?

abidlabs · 2023-07-10T15:34:10Z

The rich textbox should be a separate component, particularly if we want to support files, as that would involve changing the API (we could do a similar tuple format to support files).

abidlabs · 2023-07-10T15:34:19Z

Aside: it would be cool if the rich textbox could support text color so that we could address this feature request: #2303

dawoodkhan82 · 2023-07-10T15:51:06Z

@abidlabs we can allow file upload and the text styling features to be turned off for the rich textbox, in case a dev wants only one feature and not the other.

abidlabs · 2023-11-07T00:07:29Z

Hey! We've now made it possible for Gradio users to create their own custom components -- meaning that you can write some Python and JavaScript (Svelte), and publish it as a Gradio component. You can use it in your own Gradio apps, or share it so that anyone can use it in their Gradio apps. Here are some examples of custom Gradio components:

A "Rich Textbox" that allows you to write bold/italics/colored text
A "Folium Map Viewer" component that allows you to use interactive maps

You can see the source code for those components by clicking the "Files" icon and then clicking "src". The complete source code for the backend and frontend is visible. In particular, its very fast if you want to build off an existing component. We've put together a Guide: https://www.gradio.app/guides/five-minute-guide, and we're happy to help. Hopefully this will help address this issue.

abidlabs · 2024-02-06T07:29:58Z

Closing this issue in favor of: #6976

abidlabs · 2024-03-19T20:21:45Z

Just FYI @taoari we now support a gr.MultimodalTextbox component in gradio

dawoodkhan82 self-assigned this Jun 25, 2023

abidlabs added enhancement New feature or request new component Involves creating a new component labels Jun 26, 2023

dawoodkhan82 mentioned this issue Jul 5, 2023

Expand gr.Chatbot() functionality + Refactor Tracking Issue #4800

Closed

12 tasks

abidlabs mentioned this issue Jul 5, 2023

Simpler Chatbot Component / API #3510

Closed

abidlabs added this to the Component Cleanup milestone Jul 9, 2023

abidlabs mentioned this issue Jul 10, 2023

Create a custom component that adds color and formatting to the text field #2303

Closed

dawoodkhan82 mentioned this issue Jul 28, 2023

RichTextbox component #5032

Closed

hannahblair assigned dawoodkhan82 and unassigned dawoodkhan82 Jul 31, 2023

abidlabs removed this from the Component Cleanup milestone Sep 4, 2023

abidlabs removed the enhancement New feature or request label Dec 5, 2023

abidlabs closed this as completed Feb 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multimodal Input TextBox #4668

Multimodal Input TextBox #4668

taoari commented Jun 25, 2023

pngwn commented Jun 25, 2023

dawoodkhan82 commented Jun 25, 2023

taoari commented Jun 25, 2023

dawoodkhan82 commented Jul 10, 2023

abidlabs commented Jul 10, 2023

abidlabs commented Jul 10, 2023

dawoodkhan82 commented Jul 10, 2023

abidlabs commented Nov 7, 2023

abidlabs commented Feb 6, 2024

abidlabs commented Mar 19, 2024

Multimodal Input TextBox #4668

Multimodal Input TextBox #4668

Comments

taoari commented Jun 25, 2023

pngwn commented Jun 25, 2023

dawoodkhan82 commented Jun 25, 2023

taoari commented Jun 25, 2023

dawoodkhan82 commented Jul 10, 2023

abidlabs commented Jul 10, 2023

abidlabs commented Jul 10, 2023

dawoodkhan82 commented Jul 10, 2023

abidlabs commented Nov 7, 2023

abidlabs commented Feb 6, 2024

abidlabs commented Mar 19, 2024