Automatic Inference Batching Support #295

SinanAkkoyun · 2023-06-12T13:36:52Z

Hey!
the HuggingFace text-generation-inference is an inference server solution that can, if you do concurrent HTTP requests, automatically batch compute generations. (Let's say there is a request in progress and another one on the way, it can automatically adapt the batching in-progress)

I want to build an inference solution based on faster-whisper.
Is manual batching supported? I am not sufficient enough to safely implement it on my own, but I would like to build up on top of that, if possible.

arnavmehta7 · 2023-06-13T07:55:22Z

Faster-whisper performs batching internally for various vad_segments. I would be curious to know if we can get more speedups if we batch multiple audios.

Afaik, huggingface one, has a DELTA during which, if multiple requests are sent then they will be batched and run together, otherwise not.

guillaumekln · 2023-06-13T08:06:29Z

Faster-whisper performs batching internally for various vad_segments.

No, there is no batching at this time. See #59 which is the main issue for batching support.

arnavmehta7 · 2023-06-13T08:10:11Z

@guillaumekln oh very very sorry for the oversight. I feel this could be easily done by taking this as reference https://github.com/m-bain/whisperX/blob/main/whisperx/asr.py#L210-L230

guillaumekln changed the title ~~Inference Batching Support~~ Automatic Inference Batching Support Aug 28, 2023

MahmoudAshraf97 closed this as completed Nov 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Automatic Inference Batching Support #295

Automatic Inference Batching Support #295

SinanAkkoyun commented Jun 12, 2023

arnavmehta7 commented Jun 13, 2023

guillaumekln commented Jun 13, 2023

arnavmehta7 commented Jun 13, 2023

Automatic Inference Batching Support #295

Automatic Inference Batching Support #295

Comments

SinanAkkoyun commented Jun 12, 2023

arnavmehta7 commented Jun 13, 2023

guillaumekln commented Jun 13, 2023

arnavmehta7 commented Jun 13, 2023