Add RTFMVDR module #2368

nateanl · 2022-05-06T14:12:49Z

Add a new design of MVDR module.
The RTFMVDR module supports the method based on the relative transfer function (RTF) and power spectral density (PSD) matrix of noise.
The input arguments are:

multi-channel spectrum.
RTF vector of the target speech
PSD matrix of noise.
reference channel in the microphone array.
diagonal_loading option to enable or disable diagonal loading in matrix inverse computation.
diag_eps for computing the inverse of the matrix.
eps for computing the beamforming weight.
The output of the module is the single-channel complex-valued spectrum for the enhanced speech.

nateanl · 2022-05-06T17:09:33Z

Docs: https://output.circle-artifacts.com/output/job/1fef7f01-57e0-4220-a503-5dc5bd5ab6ee/artifacts/0/docs/transforms.html#rtfmvdr

torchaudio/transforms/_transforms.py

facebook-github-bot · 2022-05-06T21:30:54Z

@nateanl has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

torchaudio/functional/functional.py

facebook-github-bot · 2022-05-07T08:39:37Z

@nateanl has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

Summary: Add a new design of MVDR module. The RTFMVDR module supports the method based on the relative transfer function (RTF) and power spectral density (PSD) matrix of noise. The input arguments are: - multi-channel spectrum. - RTF vector of the target speech - PSD matrix of noise. - reference channel in the microphone array. - diagonal_loading option to enable or disable diagonal loading in matrix inverse computation. - diag_eps for computing the inverse of the matrix. - eps for computing the beamforming weight. The output of the module is the single-channel complex-valued spectrum for the enhanced speech. Pull Request resolved: pytorch#2368 Reviewed By: carolineechen Differential Revision: D36214940 Pulled By: nateanl fbshipit-source-id: 6c6606209db677fd3c4c6d7e049b3ac5a4affbfc

facebook-github-bot · 2022-05-10T08:51:31Z

This pull request was exported from Phabricator. Differential Revision: D36214940

nateanl added new feature module: ops labels May 6, 2022

facebook-github-bot added the CLA Signed label May 6, 2022

nateanl mentioned this pull request May 6, 2022

[Migration] TorchAudio Beamforming Module Migration #2280

Closed

11 tasks

carolineechen mentioned this pull request May 6, 2022

Add diagonal_loading optional to rtf_power #2369

Closed

carolineechen approved these changes May 6, 2022

View reviewed changes

torchaudio/transforms/_transforms.py Outdated Show resolved Hide resolved

torchaudio/transforms/_transforms.py Outdated Show resolved Hide resolved

carolineechen reviewed May 6, 2022

View reviewed changes

torchaudio/functional/functional.py Show resolved Hide resolved

carolineechen approved these changes May 6, 2022

View reviewed changes

nateanl force-pushed the mvdr_rtf branch from f820063 to d068bda Compare May 10, 2022 08:51

facebook-github-bot closed this in 4b021ae May 10, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add RTFMVDR module #2368

Add RTFMVDR module #2368

nateanl commented May 6, 2022

nateanl commented May 6, 2022

facebook-github-bot commented May 6, 2022

facebook-github-bot commented May 7, 2022

facebook-github-bot commented May 10, 2022

Add RTFMVDR module #2368

Add RTFMVDR module #2368

Conversation

nateanl commented May 6, 2022

nateanl commented May 6, 2022

facebook-github-bot commented May 6, 2022

facebook-github-bot commented May 7, 2022

facebook-github-bot commented May 10, 2022