support for BatchedInferencePipeline #169

Eve-146T · 2024-12-02T10:45:06Z

faster whisper recently added BatchedInference support in version 1.1.0, allowing up to 4x faster transcription when transcribing large files.

Is there any plan to add this to this server?

Here are my test results, on my RTX 3090:

Model	Batch Size	Processing Time (s)	Total Time (s)	Speedup vs No Batch
Large v2	16	51.59	53.39	3.29x
Large v2	9	54.34	56.10	3.12x
Large v2	8	55.21	56.99	3.07x
Large v2	6	61.41	63.35	2.76x
Large v2	No Batch	169.75	171.50	1.00x
Turbo	16	33.37	34.65	2.48x
Turbo	8	33.39	34.52	2.48x
Turbo	No Batch	82.75	83.93	1.00x

(This was for a 1 hour audio file, Total Time includes the time it takes to load the model)

I did some testing using the faster whisper server, and its speed matches up with large v2 no batch.
When using smaller audio files (30 seconds) the speed up was around 24% on batch 8 and 16.

Additionally: can we treat the requests of multiple different users as one batch?

fedirz · 2024-12-04T01:35:36Z

Yep, I'll definitely be bumping the faster-whisper version and adding support for batched inference
I like the idea. I'll take a look at how to implement this.

fedirz pushed a commit that referenced this issue Dec 17, 2024

feat: support BatchedInferencePipeline (#169)

3ed59e8

fedirz mentioned this issue Dec 17, 2024

address issues #176

Merged

fedirz pushed a commit that referenced this issue Dec 17, 2024

feat: support BatchedInferencePipeline (#169)

51337ef

fedirz pushed a commit that referenced this issue Dec 22, 2024

feat: support BatchedInferencePipeline (#169)

79581e3

fedirz pushed a commit that referenced this issue Dec 22, 2024

feat: support BatchedInferencePipeline (#169)

f08ae47

fedirz pushed a commit that referenced this issue Dec 22, 2024

feat: support BatchedInferencePipeline (#169)

3a0bd05

fedirz closed this as completed Jan 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support for BatchedInferencePipeline #169

support for BatchedInferencePipeline #169

Eve-146T commented Dec 2, 2024

fedirz commented Dec 4, 2024

support for BatchedInferencePipeline #169

support for BatchedInferencePipeline #169

Comments

Eve-146T commented Dec 2, 2024

fedirz commented Dec 4, 2024