Implement img2text widget #290

mishig25 · 2022-09-02T08:19:50Z

Implement Text-to-Image widget

huggingface/transformers#18821 (comment)

input (identical to ImageClassificationWidget):

const requestBody = { file }; // img file

output (identical to TextGenerationWidget):

Array<{ generated_text: string; }>

example:

[
  {
    "generated_text": "Some text"
  }
]

Note:

In transformers implementation, I see that the pipeline is called image-to-text-generation, while in the hub, we already defined it as image-to-text without the generation part. Please let me know if it is an issue: @Narsil @osanseviero

todos:

test when api-inference is up @Narsil
document widget input sample

osanseviero · 2022-09-02T08:57:17Z

Related to

In transformers implementation, I see that the pipeline is called image-to-text-generation, while in the hub, we already defined it as image-to-text without the generation part. Please let me know if it is an issue:

This should take care of it huggingface/transformers#18864

Narsil · 2022-09-02T09:50:28Z

LGTM. I don't think we chose List[{"generated_text": str}] for the output but {"generated_text": str} only (I think).

I think being aligned with what we already have is better.

@OlivierDehaene In general, unfortunately since the pipelines were created at different times by different people originally, the exact output types are not super normalized, especially on the List, List of List stuff. This is something I would like to change on v5 (whenever we decide to go for it and I don't think we have plans) so that the pipeline code could be more regular.

For text-generation it can really generate multiple texts (it's controlled by num_return_sequences) and I think enabling the list here also enables more uses cases (even for captioning you might be interested to generate several at once and choose the best).

However, I don't think we should by highly strict in v4 since every pipeline is ever so slightly different than it's neighbor.

osanseviero · 2023-11-22T20:58:28Z

Closing as this now lives in https://github.com/huggingface/hub-docs.

Implement ing2text widget

aa88b2c

mishig25 requested review from Narsil, NielsRogge and OlivierDehaene September 2, 2022 08:40

mishig25 changed the title ~~Implement ing2text widget~~ Implement img2text widget Sep 2, 2022

osanseviero closed this Nov 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement img2text widget #290

Implement img2text widget #290

mishig25 commented Sep 2, 2022 •

edited

Loading

osanseviero commented Sep 2, 2022

Narsil commented Sep 2, 2022

osanseviero commented Nov 22, 2023

Implement img2text widget #290

Implement img2text widget #290

Conversation

mishig25 commented Sep 2, 2022 • edited Loading

Implement Text-to-Image widget

Note:

osanseviero commented Sep 2, 2022

Narsil commented Sep 2, 2022

osanseviero commented Nov 22, 2023

mishig25 commented Sep 2, 2022 •

edited

Loading