Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Are there other advices to preprocess audio files? #2

Open
attitudechunfeng opened this issue Aug 9, 2017 · 3 comments
Open

Are there other advices to preprocess audio files? #2

attitudechunfeng opened this issue Aug 9, 2017 · 3 comments

Comments

@attitudechunfeng
Copy link

Hi, twerkmeister. I'm following your excellent work 'iLID' recently. The approach shows good performance when tested on the dataset consisting of lots of clean audios. However, when tested on the audios recorded in natural scenes, it doesn't perform as well as before. In your project, I've seen the loudness normalization operation. Are there other advices to preprocess the audio to make it more clean?

many thanks.

@hotzenklotz
Copy link
Collaborator

You could augment your training data to include more "natural scenes". Adding some noise and or background music could help. Alternatively, consider using more natural training data to begin with.

We also found that the neural network architecture published here is not deep / sufficient enough for noisy environments. Given enough data, I had good results using the Inceptionv3 network for LID.

@attitudechunfeng
Copy link
Author

okay, I'll try that and thank u for your advices.

@attitudechunfeng
Copy link
Author

By the way, could you show me your prototxt files? I want to check the hyper-parameters.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants