Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to improve inference latency performance? #796

Open
twocode opened this issue Oct 22, 2024 · 1 comment
Open

How to improve inference latency performance? #796

twocode opened this issue Oct 22, 2024 · 1 comment
Labels
algorithm Algorithm improvements & issues documentation Improvements or additions to documentation help wanted Extra attention is needed performance Running speed & quality

Comments

@twocode
Copy link

twocode commented Oct 22, 2024

2024-10-22 03:26:36.033 | INFO     | app:generate_audio:73 - Refined text: ['but since [uv_break] 波 卡 [uv_break] like [uv_break] like 里 法, like pocari sweat, [uv_break] the drink. [uv_break], and [uv_break] 东 方 民 族, [uv_break] eastern cultures and peoples, are super different,']
2024-10-22 03:26:36.033 | INFO     | app:generate_audio:78 - Start voice inference.
text:  16%|█▌        | 62/384(max) [00:01, 56.04it/s]
code:  30%|██▉       | 606/2048(max) [00:10, 55.61it/s]
2024-10-22 03:26:48.069 | INFO     | app:generate_audio:91 - Inference completed.

This simple sentence took 12 seconds on Nvidia Tesla T4. Is it correct to assume ChatTTS is not suitable for situations that require low "Time To First Audio(TTFA)"?

@medemi68
Copy link

I second this. I think there is some major improvement needed to get the Time to first audio down pat. I was able to do some optimization personally by reducing the chunk size and setting the stream speed. I was running it on a 4090 which has 80 TFLOPS or so, and I was able to get the inference speed a lot faster. But definitely you would need to use streams to get a fast time to first byte, disable the refine text portion too.

@fumiama fumiama added documentation Improvements or additions to documentation help wanted Extra attention is needed algorithm Algorithm improvements & issues performance Running speed & quality labels Oct 30, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
algorithm Algorithm improvements & issues documentation Improvements or additions to documentation help wanted Extra attention is needed performance Running speed & quality
Projects
None yet
Development

No branches or pull requests

3 participants