-
Notifications
You must be signed in to change notification settings - Fork 670
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Blank zeroed rows in Mel spectrograms #851
Comments
Hi @duvtedudug Thanks for reporting the issue. Can you provide us how you load audio file? Also, do you know a publicly accessible audio data that causes the same issue, or can you share your data? |
After more exploration the cause was having not enough I was following the code in the tutorial Here's my code...
Just adding more resolution to the spectrogram seems to give the Mel conversion what it needs! My only suggestion is to change the default Thanks for your time! |
Thanks for the suggestion :) There was also a discussion in #384 about using powers of 2 instead. (I'm also noting that the default was changed in #83.) If you use master, do you now get a warning about zero Mel filters if you don't specify |
🐛 Bug
Mel spectrogram have some random rows that are all zeros. Normal spectrograms are fine.
To Reproduce
Steps to reproduce the behavior:
mel = torchaudio.transforms.MelSpectrogram()(waveform)
or
spec = torchaudio.transforms.Spectrogram()(waveform)
mel = torchaudio.transforms.MelScale()(spec)
Both give something like this (white lines are zero valued)
Expected behavior
All rows to have data!
Environment
The text was updated successfully, but these errors were encountered: