cache background_noise rms data #145

fantasyRqg · 2022-06-17T09:24:44Z

Boost background_noise performance.

Reduce audio decode and file io
Reduce rms compute. maybe a diffrenece between rms(partial audio) and rms(full audio)

iver56 · 2022-06-20T07:37:47Z

Hi fantasyRgg, and thanks for your PR 😃

Just for context, so I understand the problem you're proposing to solve, I want to ask some questions:

How large is your background noise dataset?
If you are training a model, how many workers do you use for preparing the audio examples that go into the training batches?
How much memory (RAM) is there on the computer where you are doing the training?
What audio file format are your background noise files? And do they have the same sample rate as the "clean" input audios that the noises get added to?
Are you using an SSD or a HDD?

Ideally, a good solution would work well in all kinds of combinations of answers to those questions

fantasyRqg · 2022-06-22T10:25:12Z

How large is your background noise dataset?

About 2k records
If you are training a model, how many workers do you use for preparing the audio examples that go into the training batches?

Only one worker, I tried multi worker, not fast enough.
How much memory (RAM) is there on the computer where you are doing the training?

I cached samples and noises. samples took 7GB, noiese took 1.5GB
What audio file format are your background noise files? And do they have the same sample rate as the "clean" input audios that the noises get added to?

I don't think audio format and sample rate is problem. audio: Audio paramter will take care of all problem.
Are you using an SSD or a HDD?

HDD

iver56 · 2022-06-29T07:54:33Z

Thanks for the insight :) Indeed, in your case it makes sense to apply caching like this.

HDD
Not very large dataset - fits in RAM
Single worker

My own use case is quite different, and would actually be best without caching:

SSD
Very large dataset, cannot fit in RAM
Many workers

I don't think audio format and sample rate is problem. audio: Audio paramter will take care of all problem.

The reason why I asked is that resampling (in case of mismatch) may take a significant amount of CPU time, slowing down the model training.

I'm currently wrapping up the 0.11 release, and then I'll have some work preparing a few new transforms, and then after that I'll hopefully have more time to consider this caching feature. In the meantime, thanks for your patience, and I hope you're okay with using your own fork for now

cache background_noise rms data

eaaa855

iver56 force-pushed the master branch from 57fd377 to 643f320 Compare June 29, 2022 09:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cache background_noise rms data #145

cache background_noise rms data #145

fantasyRqg commented Jun 17, 2022

iver56 commented Jun 20, 2022 •

edited

Loading

fantasyRqg commented Jun 22, 2022

iver56 commented Jun 29, 2022

cache background_noise rms data #145

Are you sure you want to change the base?

cache background_noise rms data #145

Conversation

fantasyRqg commented Jun 17, 2022

iver56 commented Jun 20, 2022 • edited Loading

fantasyRqg commented Jun 22, 2022

iver56 commented Jun 29, 2022

iver56 commented Jun 20, 2022 •

edited

Loading