support batch processing #57

TengdaHan · 2023-02-01T19:49:58Z

What it is: Parallalize the transcribe function. It gives at least 2X speedup on transcribing.
TODO: supporting temperature sweeping under batch processing.

m-bain · 2023-02-01T20:37:50Z

epic thank you tengda

Barabazs · 2023-02-02T07:25:05Z

Hi @TengdaHan, can you clarify how we should specify an optimal batch size please?

It seems that a batch_size of 16 would divide the audio in 16 equally sized chunks. But does that mean that these 16 chunks will be processed in parallel?

TengdaHan · 2023-02-02T07:57:55Z

@Barabazs The audio file is chunked in the other direction: a batch_size of 16 divides the audio into X batches where each has 16 audio segments. Then each batch is transcribed together: https://github.com/m-bain/whisperX/blob/main/whisperx/transcribe.py#L415
A general rule is to choose a larger batch_size that still fits in your GPU memory. I suggest starting with 16 or 32.

support batch processing

039af89

m-bain merged commit 29e95b7 into m-bain:main Feb 1, 2023

DavidFarago mentioned this pull request Apr 3, 2023

More efficient batch inference resulting in large-v2 with *60-70x REAL TIME speed (now in custom v3 branch, see comment for details) #159

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support batch processing #57

support batch processing #57

TengdaHan commented Feb 1, 2023

m-bain commented Feb 1, 2023

Barabazs commented Feb 2, 2023

TengdaHan commented Feb 2, 2023 •

edited

Loading

support batch processing #57

support batch processing #57

Conversation

TengdaHan commented Feb 1, 2023

m-bain commented Feb 1, 2023

Barabazs commented Feb 2, 2023

TengdaHan commented Feb 2, 2023 • edited Loading

TengdaHan commented Feb 2, 2023 •

edited

Loading