Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Document that users must specify a model kind in the SpeechTestDataCI workflow #71

Open
KatieProchilo opened this issue Jun 12, 2020 · 0 comments
Labels
P2 Nice to have

Comments

@KatieProchilo
Copy link
Contributor

In v3 of Custom Speech, models can be comprised of:

  1. Acoustic models (audio + human-labeled transcript data)
  2. Language models (pronunciation and language data)
  3. Acoustic and Language models (all 3 types of data)

However, in v2 (current) they can only be:

  1. Acoustic models (audio + human-labeled transcript data)
  2. Language models (pronunciation and language data)

The pipeline is already configured to work with Language models, and users won't have to change anything for that to work.

However, if they wanted to switch and work with Acoustic models, within the SpeechTestDataCI workflow, users must set the environment variable CUSTOM_SPEECH_MODEL_KIND to be Acoustic. We already describe to the user how to set this in Advanced customizations.

They also need to set the proper paths to their data.

Solution

We should note this earlier in the solution, probably step 3 when they're looking at their data for the first time. Plus this will not impact their initial baseline run. The first mention of language vs. acoustic models is in the intro to step 3, so here would be a good spot to go even further.

@KatieProchilo KatieProchilo added the P2 Nice to have label Jun 12, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
P2 Nice to have
Projects
None yet
Development

No branches or pull requests

1 participant