[hf inference] ASR remote inference model parser impl #1020

Ankush-lastmile · 2024-01-25T00:45:39Z

[hf inference] ASR remote inference model parser impl

Implementation of the HuggingFaceAutomaticSpeechRecognition Model parser using the inference endpoint to run inference. Python API takes in bytes as well as path, skip binary for now.

Very similar to #1018

Testplan

Temporarily add model parser to Gradio Cookbook model parser registry.

    asr = HuggingFaceAutomaticSpeechRecognitionRemoteInference()
    AIConfigRuntime.register_model_parser(
        asr, asr.id()
    )

run AIConfig Edit on Gradio example

python3 -m 'aiconfig.scripts.aiconfig_cli' edit --aiconfig-path=cookbooks/Gradio/huggingface.aiconfig.json --parsers-module-path=cookbooks/Gradio/hf_model_parsers.py --server-mode=debug_servers

rholinshead

Some minor changes, mostly from copy/paste

.../src/aiconfig_extension_hugging_face/remote_inference_client/automatic_speech_recognition.py

rholinshead · 2024-01-25T15:39:52Z

.../src/aiconfig_extension_hugging_face/remote_inference_client/automatic_speech_recognition.py

+ if len(inputs) > 1:
+ raise ValueError(
+ f"Multiple audio inputs are not supported for the HF Automatic Speech Recognition Inference api. Please specify a single audio input attachment."
+ )


Ah, ok, that's what I was thinking. Instead of doing this, we should just make the validate_and_retrieve function return a single value, not array

updated. refactpred validate_and_retrieve returns a single value, not an array

rholinshead · 2024-01-25T15:41:09Z

.../src/aiconfig_extension_hugging_face/remote_inference_client/automatic_speech_recognition.py

+
+ # HuggingFace Automatic Speech Recognition outputs should only ever be string
+ # format so shouldn't get here, but just being safe
+ return json.dumps(output_data, indent=2)


Based on your comment on the image_2_text one, maybe this should raise a ValueError instead?

Yep, updated. Had previously copy pasted what you had for consistency.

Ankush-lastmile · 2024-01-25T17:38:47Z

Fixed a couple of nits on comments
Renamed any references of image to audio
Refactored validate_and_retrieve_audio_from_attachments to also validate only one audio attachment

Testplan,

Same as in original pr description, output had the same output so omitting the screenshot

rholinshead · 2024-01-25T17:47:27Z

.../src/aiconfig_extension_hugging_face/remote_inference_client/automatic_speech_recognition.py

+ f"Attachment has no mime type. Specify the audio mimetype in the aiconfig"
+ )
+
+ if not attachment.mime_type.startswith("audio/"):


Should be "audio" without trailing slash since we default to just "audio" and this would invalidate that. Alternatively, could default to "audio/*" but not sure if that will work the same

ah I see what you mean. Technically this doesn't break anything yet but will be needed.

Updated, thanks for catching

rholinshead

Accepting to unblock, but please update the mimetype validation before landing

Implementation of the HuggingFaceAutomaticSpeechRecognition Model parser using the inference endpoint to run inference. Python API takes in bytes as well as path, skip binary for now. Very similar to #1018 ## Testplan <img width="1000" alt="Screenshot 2024-01-24 at 10 37 05 PM" src="https://github.com/lastmile-ai/aiconfig/assets/141073967/808956ce-e3be-4528-9f34-c8d31d704ddb"> 1. Temporarily add model parser to Gradio Cookbook model parser registry. ``` asr = HuggingFaceAutomaticSpeechRecognitionRemoteInference() AIConfigRuntime.register_model_parser( asr, asr.id() ) ``` 2. run AIConfig Edit on Gradio example `python3 -m 'aiconfig.scripts.aiconfig_cli' edit --aiconfig-path=cookbooks/Gradio/huggingface.aiconfig.json --parsers-module-path=cookbooks/Gradio/hf_model_parsers.py --server-mode=debug_servers`

Ankush-lastmile · 2024-01-25T17:56:26Z

update the mimetype validation to check audio, not audio/

Ankush-lastmile mentioned this pull request Jan 25, 2024

[hf inference] Prompt Schema ASR #1021

Merged

Ankush-lastmile force-pushed the pr1020 branch from a91c116 to 4d6e47c Compare January 25, 2024 03:38

Ankush-lastmile changed the title ~~[hf inference] ASR~~ [hf inference] ASR remote inference model parser impl Jan 25, 2024

Ankush-lastmile force-pushed the pr1020 branch from 4d6e47c to 7574133 Compare January 25, 2024 03:49

Ankush-lastmile marked this pull request as ready for review January 25, 2024 03:56

Ankush-lastmile requested review from saqadri, rholinshead, suyoglastmileai, jonathanlastmileai and rossdanlm as code owners January 25, 2024 03:56

Ankush-lastmile force-pushed the pr1020 branch from 7574133 to 3a8d37c Compare January 25, 2024 03:57

rholinshead requested changes Jan 25, 2024

View reviewed changes

Ankush-lastmile force-pushed the pr1020 branch from 3a8d37c to a0f6f12 Compare January 25, 2024 17:38

Ankush-lastmile force-pushed the pr1020 branch from a0f6f12 to a6dfbdb Compare January 25, 2024 17:42

rholinshead reviewed Jan 25, 2024

View reviewed changes

rholinshead approved these changes Jan 25, 2024

View reviewed changes

Ankush-lastmile force-pushed the pr1020 branch from a6dfbdb to 02e43fd Compare January 25, 2024 17:55

Ankush-lastmile merged commit 50ac544 into main Jan 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[hf inference] ASR remote inference model parser impl #1020

[hf inference] ASR remote inference model parser impl #1020

Ankush-lastmile commented Jan 25, 2024 •

edited

Loading

rholinshead left a comment

rholinshead Jan 25, 2024

Ankush-lastmile Jan 25, 2024

rholinshead Jan 25, 2024

Ankush-lastmile Jan 25, 2024

Ankush-lastmile commented Jan 25, 2024

rholinshead Jan 25, 2024

Ankush-lastmile Jan 25, 2024

rholinshead left a comment

Ankush-lastmile commented Jan 25, 2024

[hf inference] ASR remote inference model parser impl #1020

[hf inference] ASR remote inference model parser impl #1020

Conversation

Ankush-lastmile commented Jan 25, 2024 • edited Loading

Testplan

rholinshead left a comment

Choose a reason for hiding this comment

rholinshead Jan 25, 2024

Choose a reason for hiding this comment

Ankush-lastmile Jan 25, 2024

Choose a reason for hiding this comment

rholinshead Jan 25, 2024

Choose a reason for hiding this comment

Ankush-lastmile Jan 25, 2024

Choose a reason for hiding this comment

Ankush-lastmile commented Jan 25, 2024

Testplan,

rholinshead Jan 25, 2024

Choose a reason for hiding this comment

Ankush-lastmile Jan 25, 2024

Choose a reason for hiding this comment

rholinshead left a comment

Choose a reason for hiding this comment

Ankush-lastmile commented Jan 25, 2024

Ankush-lastmile commented Jan 25, 2024 •

edited

Loading