Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

torchaudio.compliance.kaldi.fbank #1245

Open
qmpzzpmq opened this issue Feb 7, 2021 · 6 comments
Open

torchaudio.compliance.kaldi.fbank #1245

qmpzzpmq opened this issue Feb 7, 2021 · 6 comments
Labels

Comments

@qmpzzpmq
Copy link

qmpzzpmq commented Feb 7, 2021

please support batch kaldi fbank computation/
"waveform (Tensor) – Tensor of audio of size (c, n) where c is in the range [0,2)"
right now only single utt compute is support

@mthrok
Copy link
Collaborator

mthrok commented Feb 8, 2021

Thanks for the feedback. Surely, this is very important and will try to address this.
We are thinking to apply tweak on torchaudio.compliance.kaldi.
We do not have an immediate action plan at the moment, but we will try to come back to this as soon as possible.

@Oktai15
Copy link
Contributor

Oktai15 commented Feb 17, 2021

@qmpzzpmq you can use torchaudio.transforms.MelSpectrogram as alternative

@qmpzzpmq
Copy link
Author

@Oktai15
hi, I just wondering if the result of theme are same?
From description, these result looks difference.

@Oktai15
Copy link
Contributor

Oktai15 commented Feb 18, 2021

@qmpzzpmq for example, check this issue: #157 (comment)

@qmpzzpmq
Copy link
Author

@Oktai15 thanks for your example, I will test them for same result. but it looks, still some parames to be check.

mthrok pushed a commit to mthrok/audio that referenced this issue Feb 26, 2021
@haha010508
Copy link

i found the FBank can not run in async mode, who can fix this? thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

4 participants