Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add RTFMVDR module #2368

Closed
wants to merge 1 commit into from
Closed

Add RTFMVDR module #2368

wants to merge 1 commit into from

Conversation

nateanl
Copy link
Member

@nateanl nateanl commented May 6, 2022

Add a new design of MVDR module.
The RTFMVDR module supports the method based on the relative transfer function (RTF) and power spectral density (PSD) matrix of noise.
The input arguments are:

  • multi-channel spectrum.
  • RTF vector of the target speech
  • PSD matrix of noise.
  • reference channel in the microphone array.
  • diagonal_loading option to enable or disable diagonal loading in matrix inverse computation.
  • diag_eps for computing the inverse of the matrix.
  • eps for computing the beamforming weight.
    The output of the module is the single-channel complex-valued spectrum for the enhanced speech.

torchaudio/transforms/_transforms.py Outdated Show resolved Hide resolved
torchaudio/transforms/_transforms.py Outdated Show resolved Hide resolved
@facebook-github-bot
Copy link
Contributor

@nateanl has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

@facebook-github-bot
Copy link
Contributor

@nateanl has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

Summary:
Add a new design of MVDR module.
The RTFMVDR module supports the method based on the relative transfer function (RTF) and power spectral density (PSD) matrix of noise.
The input arguments are:
- multi-channel spectrum.
- RTF vector of the target speech
- PSD matrix of noise.
- reference channel in the microphone array.
- diagonal_loading option to enable or disable diagonal loading in matrix inverse computation.
- diag_eps for computing the inverse of the matrix.
- eps for computing the beamforming weight.
The output of the module is the single-channel complex-valued spectrum for the enhanced speech.

Pull Request resolved: pytorch#2368

Reviewed By: carolineechen

Differential Revision: D36214940

Pulled By: nateanl

fbshipit-source-id: 6c6606209db677fd3c4c6d7e049b3ac5a4affbfc
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D36214940

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants