[Tasks] Mismatch between input specs and task pipelines expected generation #923

hanouticelina · 2024-09-23T15:00:42Z

Description

There's a mismatch between the task input spec and the expected generation parameters in task pipelines for automatic-speech-recognition and image-to-text tasks, and possibly others. Specifically, the input schema defines a parameter named generate for generation parameters. However, the _sanitize_parameters() method implemented in these task pipelines (eg: AutomaticSpeechRecognitionPipeline. _sanitize_parameters()) expects a parameter named generate_kwargs instead.

This mismatch causes errors when calling InferenceAPI according to the specified input specs.

Reproduction

If we follow the input specs defined here for automatic-speech-recognition:

import requests
import base64

API_URL = "https://api-inference.huggingface.co/models/openai/whisper-large-v3"
headers = {"Authorization": f"Bearer {os.environ.get('HF_TOKEN')}"}

with open("sample-3.flac", "rb") as f:
    data = f.read()
payload = {
    "inputs": base64.b64encode(data).decode("utf-8"),
    "parameters": {
        "return_timestamps": True,
        "generate": {
            "max_new_tokens": 100,
            "temperature": 0.7,
        },
    },
}

response = requests.post(
    API_URL,
    headers=headers,
    json=payload,
)
print(response.json())

this returns an output with an error:

{'error': 'unknown error', 'warnings': ["There was an inference error: unknown error: AutomaticSpeechRecognitionPipeline._sanitize_parameters() got an unexpected keyword argument 'generate'"]}

When replacing generate with generate_kwargs:

import requests
import base64

API_URL = "https://api-inference.huggingface.co/models/openai/whisper-large-v3"
headers = {"Authorization": f"Bearer {os.environ.get('HF_TOKEN')}"}

with open("sample-3.flac", "rb") as f:
    data = f.read()
payload = {
    "inputs": base64.b64encode(data).decode("utf-8"),
    "parameters": {
        "return_timestamps": True,
        "generate_kwargs": {
            "max_new_tokens": 100,
            "temperature": 0.7,
        },
    },
}

response = requests.post(
    API_URL,
    headers=headers,
    json=payload,
)
print(response.json())

No error in the output:

{'text': ' Thank you.', 'chunks': [{'timestamp': [0.0, 1.0], 'text': ' Thank you.'}]}

We have the same issue for image-to-text task.

I wanted to open this issue first before proposing any PR to have your opinion first on how to better align the task input specs with the task pipelines.

The text was updated successfully, but these errors were encountered:

coyotte508 · 2024-09-23T15:38:56Z

To be clear this is a report regarding the json-schema definitions of tasks in @huggingface/tasks in https://github.com/huggingface/huggingface.js/tree/main/packages/tasks/src/tasks/automatic-speech-recognition/spec and such

cc @SBrandeis

SBrandeis · 2024-09-23T16:39:28Z

cc @Wauplin as well

Wauplin · 2024-09-24T12:56:41Z

Thanks for flagging @hanouticelina This is quite annoying indeed. I personally don't like the idea of having an API parameter called generate_kwargs (which feels python-centric).

@ArthurZucker @LysandreJik is there a world where transformers pipelines could accept either generate or generate_parameters as kwargs instead of generate_kwargs? Since these are used for Inference API and Inference Endpoint, it would be nice to rename them.

ArthurZucker · 2024-09-26T16:11:04Z

generate_parameters does not look bad, generate is meaningless so no!

Wauplin · 2024-09-27T09:14:10Z

So generate_parameters it is then :)

Wauplin · 2024-09-27T09:16:01Z

cc @Rocketknight1 who started to work on integrating officially the specs to transformers pipelines in huggingface/transformers#33730.

@hanouticelina let's update the specs to generate_parameters and ship it in huggingface_hub. Then transformers will be able to make the change.

hanouticelina · 2024-09-27T09:22:57Z

Perfect! I will take care of updating the specs now :)

pcuenca · 2024-09-27T09:27:31Z

generate_parameters does not look bad, generate is meaningless so no!

Big +1

Or generation_parameters, if we are moving away from the Python-centric generate_kwargs.

LysandreJik · 2024-09-27T09:30:38Z

Agree with @pcuenca that generation_parameters sounds better -> generate_parameters makes it sound like it's the parameters that will be passed to the generate() method in transformers (or to the /generate path of an API).

generation_parameters seems like the most agnostic

pcuenca · 2024-09-27T09:36:47Z

Exactly my thinking, sorry for not explaining. Thanks @LysandreJik!

hanouticelina · 2024-09-27T09:37:59Z

+1 @pcuenca. It also aligns with the naming used in the schema here

Wauplin · 2024-09-27T09:43:26Z

Even better! Agree on generation_parameters :)

Fixes #923 This PR updates the task specs to rename the `generate` property to `generation_parameters`. This change aligns with the discussion in the issue. Key changes: - Renamed `generate` to `generation_parameters` in the specs for `automatic-speech-recognition`, `image-to-text`, `text-to-audio` and `text-to-speech` tasks.

hanouticelina added the bug Something isn't working label Sep 23, 2024

coyotte508 added the tasks @huggingface/tasks related label Sep 23, 2024

hanouticelina mentioned this issue Sep 24, 2024

[Inference Client] Add task parameters and a maintenance script of these parameters huggingface/huggingface_hub#2561

Merged

5 tasks

Wauplin mentioned this issue Sep 27, 2024

Make audio classification pipeline spec-compliant and add test huggingface/transformers#33730

Merged

hanouticelina mentioned this issue Sep 27, 2024

[Tasks] Rename generate property in task specs #930

Merged

hanouticelina closed this as completed in #930 Sep 30, 2024

hanouticelina mentioned this issue Dec 2, 2024

Align I/O with Inference API huggingface/huggingface-inference-toolkit#99

Merged

hanouticelina mentioned this issue Dec 16, 2024

[InferenceClient] Keep generate_kwargs instead of generation_parameters huggingface/huggingface_hub#2711

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Tasks] Mismatch between input specs and task pipelines expected generation #923

[Tasks] Mismatch between input specs and task pipelines expected generation #923

hanouticelina commented Sep 23, 2024

coyotte508 commented Sep 23, 2024

SBrandeis commented Sep 23, 2024

Wauplin commented Sep 24, 2024

ArthurZucker commented Sep 26, 2024

Wauplin commented Sep 27, 2024

Wauplin commented Sep 27, 2024

hanouticelina commented Sep 27, 2024

pcuenca commented Sep 27, 2024

LysandreJik commented Sep 27, 2024

pcuenca commented Sep 27, 2024

hanouticelina commented Sep 27, 2024

Wauplin commented Sep 27, 2024

[Tasks] Mismatch between input specs and task pipelines expected generation #923

[Tasks] Mismatch between input specs and task pipelines expected generation #923

Comments

hanouticelina commented Sep 23, 2024

Description

Reproduction

coyotte508 commented Sep 23, 2024

SBrandeis commented Sep 23, 2024

Wauplin commented Sep 24, 2024

ArthurZucker commented Sep 26, 2024

Wauplin commented Sep 27, 2024

Wauplin commented Sep 27, 2024

hanouticelina commented Sep 27, 2024

pcuenca commented Sep 27, 2024

LysandreJik commented Sep 27, 2024

pcuenca commented Sep 27, 2024

hanouticelina commented Sep 27, 2024

Wauplin commented Sep 27, 2024