The accuracy tester assists with evaluating and optimizing the accuracy of ASR and NLU engines.
The utterances are defined in the records.csv file.
The key fields are:
- UTTERANCE: the text to be tested (usign text to speech)
- EXPECTED TRANSCRIPT: (optional) when using an audio URL you can specify here the expected text based on the audio
- VOICEID: (optional) when using an audio URL use this field to categorized the audio file voice
- UTTERANCE TYPE: (optional) you can use this field to categorize the utterance (QUERY, DESTINATION, MEMBER ID, etc)
The platforms to be tested are defined here:
https://github.com/bespoken/accuracy-tester/blob/main/input/accuracy-test.json#L3
This test is currently configured to run with azure
, dialogflow
, and lex
.
Contact Bespoken for a full list of supported platforms and assistance with setting up.
The voices use to generate sample audio are defined in the configuration file here:
https://github.com/bespoken/accuracy-tester/blob/main/input/accuracy-test.json#L35
-
Clone the entire repo
- Run
pnpm install
at the root level (needs keys for private repositories)
- Run
-
Run
pnpm run batch
The github action will run automatically after the records.csv file has been updated, additionally you can start manually by going to the actions page and select the Accuracy Test Runner workflow.
The results are created as CSV file for each GitHub action run.
Additionally, they can be configured to send data to:
- CloudWatch
- DataDog
- Metabase
- MySQL