Are there other advices to preprocess audio files? #2

attitudechunfeng · 2017-08-09T08:24:51Z

Hi, twerkmeister. I'm following your excellent work 'iLID' recently. The approach shows good performance when tested on the dataset consisting of lots of clean audios. However, when tested on the audios recorded in natural scenes, it doesn't perform as well as before. In your project, I've seen the loudness normalization operation. Are there other advices to preprocess the audio to make it more clean?

many thanks.

hotzenklotz · 2017-08-09T11:37:40Z

You could augment your training data to include more "natural scenes". Adding some noise and or background music could help. Alternatively, consider using more natural training data to begin with.

We also found that the neural network architecture published here is not deep / sufficient enough for noisy environments. Given enough data, I had good results using the Inceptionv3 network for LID.

attitudechunfeng · 2017-08-09T11:51:34Z

okay, I'll try that and thank u for your advices.

attitudechunfeng · 2017-08-10T02:38:32Z

By the way, could you show me your prototxt files? I want to check the hyper-parameters.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Are there other advices to preprocess audio files? #2

Are there other advices to preprocess audio files? #2

attitudechunfeng commented Aug 9, 2017

hotzenklotz commented Aug 9, 2017

attitudechunfeng commented Aug 9, 2017

attitudechunfeng commented Aug 10, 2017

Are there other advices to preprocess audio files? #2

Are there other advices to preprocess audio files? #2

Comments

attitudechunfeng commented Aug 9, 2017

hotzenklotz commented Aug 9, 2017

attitudechunfeng commented Aug 9, 2017

attitudechunfeng commented Aug 10, 2017