Set Working Default Models in HF Parsers #1221

rholinshead · 2024-02-13T16:12:52Z

Set Working Default Models in HF Parsers

The default model parsers for AST and TTS don't work:

For ASR, openai/whisper-large-v3 just throws 500 errors (even on the model card in HF)
For TTS, suno/bark requires a HF PRO subscription to run

For now, we can just default to the next-best models for those tasks (based on like count) which support remote inference

Testing:

Build the hf extension following its README
Update aiconfig/cookbooks/Gradio/aiconfig_model_registry.py to have the tts and asr remote parsers added (should register all remote ones by default there?)

(aiconfig) ryanholinshead@Ryans-MBP aiconfig % parsers_path=/Users/ryanholinshead/Projects/aiconfig/cookbooks/Gradio/aiconfig_model_registry.py
(aiconfig) ryanholinshead@Ryans-MBP aiconfig % aiconfig edit --server-mode=debug_servers --parsers-module-path=$parsers_path

Ensure the model defaults work:

Note, was getting an error using the travel.aiconfig.json config

(aiconfig) ryanholinshead@Ryans-MBP aiconfig % aiconfig edit --aiconfig-path=$aiconfig_path --server-mode=debug_servers --parsers-module-path=$parsers_path

 File "/Users/ryanholinshead/Projects/aiconfig/python/src/aiconfig/Config.py", line 138, in load_json
    update_model_parser_registry_with_config_runtime(config_runtime)
  File "/Users/ryanholinshead/Projects/aiconfig/python/src/aiconfig/registry.py", line 134, in update_model_parser_registry_with_config_runtime
    retrieved_model_parser = ModelParserRegistry.get_model_parser(
                             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/ryanholinshead/Projects/aiconfig/python/src/aiconfig/registry.py", line 64, in get_model_parser
    return ModelParserRegistry._parsers[model_id]
           ~~~~~~~~~~~~~~~~~~~~~~~~~~~~^^^^^^^^^^
KeyError: 'HuggingFaceImage2TextTransformer'

Stack created with Sapling. Best reviewed with ReviewStack.

saqadri · 2024-02-13T16:35:11Z

.../src/aiconfig_extension_hugging_face/remote_inference_client/automatic_speech_recognition.py

+ # The default model is openai/whisper-large-v3, which does not work as of
+ # 02/13/2024. Instead, default to a free model (which supports remote
+ # inference) with the next most "likes" in HF
+ # https://huggingface.co/models?pipeline_tag=automatic-speech-recognition&sort=likes


Thanks for documenting

rossdanlm · 2024-02-13T16:39:42Z

...gingFace/python/src/aiconfig_extension_hugging_face/remote_inference_client/text_2_speech.py

@@ -47,6 +47,13 @@ def refine_completion_params(model_settings: dict[Any, Any]) -> dict[str, Any]:
 if key.lower() in supported_keys:
 completion_data[key.lower()] = model_settings[key]

+ # The default model is suno/bark, which requires HF Pro subscription


NIt: I feel we should remove these as default from the input schema? Similar issue we've had where people can't use GPT-4 because that requires paid access

NIt: I feel we should remove these as default from the input schema? Similar issue we've had where people can't use GPT-4 because that requires paid access

Just so I understand, this comment is not relevant after the next PR, right? I just did them as separate PRs to separate concerns

Yea I guess my point is more than this comment "default model is suno/bark" won't apply anymore to this after next PR, but not a blocker

rossdanlm · 2024-02-13T16:40:00Z

.../src/aiconfig_extension_hugging_face/remote_inference_client/automatic_speech_recognition.py

@@ -43,6 +43,13 @@ def refine_completion_params(model_settings: dict[Any, Any]) -> dict[str, Any]:
 if key.lower() in supported_keys:
 completion_data[key.lower()] = model_settings[key]

+ # The default model is openai/whisper-large-v3, which does not work as of


Same comment as below

rossdanlm

couple of nits, but the input schema is independent of the package itself so not a blocker, just a bit of confusion if we decide to update prompt schema and this comment then becomes out of sync

# Update Default Models in HF Prompt Schemas Update the prompt schemas for ASR and TTS remote inference to match the defaults set in #1221 <img width="1479" alt="Screenshot 2024-02-13 at 11 25 38 AM" src="https://github.com/lastmile-ai/aiconfig/assets/5060851/28e48030-0fc3-49d2-8c6b-ad0ea142d4d8"> <img width="1465" alt="Screenshot 2024-02-13 at 11 26 42 AM" src="https://github.com/lastmile-ai/aiconfig/assets/5060851/b5d132da-8777-47e9-95f1-73cf4f25a906"> --- [//]: # (BEGIN SAPLING FOOTER) Stack created with [Sapling](https://sapling-scm.com). Best reviewed with [ReviewStack](https://reviewstack.dev/lastmile-ai/aiconfig/pull/1222). * __->__ #1222 * #1221

# Update Default Models in HF Prompt Schemas Update the prompt schemas for ASR and TTS remote inference to match the defaults set in #1221 <img width="1479" alt="Screenshot 2024-02-13 at 11 25 38 AM" src="https://github.com/lastmile-ai/aiconfig/assets/5060851/28e48030-0fc3-49d2-8c6b-ad0ea142d4d8"> <img width="1465" alt="Screenshot 2024-02-13 at 11 26 42 AM" src="https://github.com/lastmile-ai/aiconfig/assets/5060851/b5d132da-8777-47e9-95f1-73cf4f25a906">

I messed up for 0.0.8 becuase I forgot to rebase. This includs Ryan's changes from #1221

# Add Remote Inference Parsers to Gradio Cookbook Parser Registry Was getting annoyed of manually updating this to test the remote parsers each time. Just copied registration from what we do in Gradio and did a quick check a few worked --- [//]: # (BEGIN SAPLING FOOTER) Stack created with [Sapling](https://sapling-scm.com). Best reviewed with [ReviewStack](https://reviewstack.dev/lastmile-ai/aiconfig/pull/1223). * __->__ #1223 * #1222 * #1221

Update HF extension to 0.0.9 I messed up for 0.0.8 becuase I forgot to rebase. This includs Ryan's changes from #1221

Set Working Default Models in HF Parsers

7d28479

rholinshead mentioned this pull request Feb 13, 2024

Update Default Models in HF Prompt Schemas #1222

Merged

rholinshead marked this pull request as ready for review February 13, 2024 16:31

rholinshead requested review from saqadri, suyoglastmileai, Ankush-lastmile, jonathanlastmileai and rossdanlm as code owners February 13, 2024 16:31

saqadri approved these changes Feb 13, 2024

View reviewed changes

rossdanlm reviewed Feb 13, 2024

View reviewed changes

rossdanlm approved these changes Feb 13, 2024

View reviewed changes

rholinshead merged commit 7d28479 into main Feb 13, 2024

rholinshead mentioned this pull request Feb 13, 2024

Add Remote Inference Parsers to Gradio Cookbook Parser Registry #1223

Merged

rossdanlm pushed a commit that referenced this pull request Feb 13, 2024

Update HF extension to 0.0.9

7262793

I messed up for 0.0.8 becuase I forgot to rebase. This includs Ryan's changes from #1221

rossdanlm mentioned this pull request Feb 13, 2024

Update HF extension to 0.0.9 #1224

Merged

rossdanlm added a commit that referenced this pull request Feb 13, 2024

Update HF extension to 0.0.9 (#1224)

933a04e

Update HF extension to 0.0.9 I messed up for 0.0.8 becuase I forgot to rebase. This includs Ryan's changes from #1221

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Set Working Default Models in HF Parsers #1221

Set Working Default Models in HF Parsers #1221

rholinshead commented Feb 13, 2024 •

edited

Loading

saqadri Feb 13, 2024

rossdanlm Feb 13, 2024

rholinshead Feb 13, 2024

rossdanlm Feb 13, 2024

rossdanlm Feb 13, 2024

rossdanlm left a comment

Set Working Default Models in HF Parsers #1221

Set Working Default Models in HF Parsers #1221

Conversation

rholinshead commented Feb 13, 2024 • edited Loading

Set Working Default Models in HF Parsers

Testing:

saqadri Feb 13, 2024

Choose a reason for hiding this comment

rossdanlm Feb 13, 2024

Choose a reason for hiding this comment

rholinshead Feb 13, 2024

Choose a reason for hiding this comment

rossdanlm Feb 13, 2024

Choose a reason for hiding this comment

rossdanlm Feb 13, 2024

Choose a reason for hiding this comment

rossdanlm left a comment

Choose a reason for hiding this comment

rholinshead commented Feb 13, 2024 •

edited

Loading