Processing characters in wizard is very slow #446

roedoejet · 2024-05-30T23:17:05Z

I'm working with a large dataset (111K utterances) and it looks like it will currently take about 1.5 hours to process everything. We should get this quicker.

roedoejet · 2024-05-30T23:19:14Z

The issue is that g2p is slow, so if you say that the input is characters, it will process all the utterances into phones as well and that's what is taking the time. If I lie and say that the input is phones already, it takes 36 seconds

The cache is implemented on a token basis to keep the results identical but maximize reuse potential. Fixes: #446

roedoejet added enhancement New feature or request help wanted Extra attention is needed labels May 30, 2024

joanise self-assigned this Jun 13, 2024

joanise removed the help wanted Extra attention is needed label Jun 13, 2024

joanise added a commit that referenced this issue Jun 13, 2024

perf: speed up g2p processing by caching the results

32a4813

The cache is implemented on a token basis to keep the results identical but maximize reuse potential. Fixes: #446

joanise mentioned this issue Jun 13, 2024

perf: speed up g2p processing by caching the results #464

Merged

joanise closed this as completed in #464 Jun 13, 2024

joanise closed this as completed in 689f324 Jun 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Processing characters in wizard is very slow #446

Processing characters in wizard is very slow #446

roedoejet commented May 30, 2024

roedoejet commented May 30, 2024

Processing characters in wizard is very slow #446

Processing characters in wizard is very slow #446

Comments

roedoejet commented May 30, 2024

roedoejet commented May 30, 2024