Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Summary
Attaching the results of preliminary performance tests. I selected the same instance type we use in production for recordings
c6i.2xlarge
and executed on all the models we include (tiny
,base
,small
) with a base sample of 10 minutes.For the default configuration of threads (NumCPU / 2) I also performed tests on a full hour of meeting sample.
The call samples were extracted from real developers meetings so the test should be as close as possible to a real use case with the caveat that it was a single track. In general though I wouldn't expect multiple tracks to cause significant overhead since it's unlikely for speech from different tracks to be overlapping for long periods.
What's likely causing some overhead is the number of speech segments that we get out of these tracks (due to the speech detection process). We can probably tune this further to try and minimize the number of contiguous samples. Right now we are using a value of 2 seconds of silence after which we split.
Overall the results show almost linear performance gains with the number of threads of execution.
Please let me know if you have any questions or concerns.
Ticket Link
https://mattermost.atlassian.net/browse/MM-55475