Add diagonal_loading optional to rtf_power #2369

nateanl · 2022-05-06T16:23:50Z

When computing the MVDR beamforming weights using the power iteration method, the PSD matrix of noise can be applied with diagonal loading to improve the robustness. This is also applicable to computing the RTF matrix (See https://github.com/espnet/espnet/blob/master/espnet2/enh/layers/beamformer.py#L614 as an example). This also aligns with current torchaudio.transforms.MVDR module to keep the consistency.

This PR adds the diagonal_loading argument with True as default value to torchaudio.functional.rtf_power.

facebook-github-bot · 2022-05-06T16:27:11Z

@nateanl has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2022-05-06T18:33:43Z

@nateanl has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

carolineechen

was rtf_power included in the last release? the implementation looks good but will change the default behavior of the function to produce different values, so may need to be labeled as BC-breaking

nateanl · 2022-05-06T19:12:00Z

was rtf_power included in the last release? the implementation looks good but will change the default behavior of the function to produce different values, so may need to be labeled as BC-breaking

No, it was not in the previous release, I think we can regard the changes as BC. what do you think?

carolineechen · 2022-05-06T19:20:55Z

torchaudio/functional/functional.py

+            (Default: ``True``)
+        diag_eps (float, optional): The coefficient multiplied to the identity matrix for diagonal loading
+            (Default: ``1e-7``)
+        eps (float, optional): a value to avoid the correlation matrix is all-zero (Default: ``1e-8``)


can this docstring be more helpful? does a value added to the denominator in the beamforming weight computation. from #2368 make sense here, and is it worth adding that this is only used for the case when diagonal_loading=True?

Makes sense. I will align the eps docstring in the functions and modules.

The eps here is for diagonal loading, which is confusing with eps in computing beamforming weight. I decided to exclude it from the API and use the default value in _tik_reg.

carolineechen

oops think I was looking at the wrong branch and thought it was part of last release, that sounds good to me! just the docstring change then

facebook-github-bot · 2022-05-06T21:28:57Z

@nateanl has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

add diag_loading to rtf_power

e5b1a41

facebook-github-bot added the CLA Signed label May 6, 2022

nateanl added improvement module: ops labels May 6, 2022

nateanl requested review from hwangjeff, xiaohui-zhang, mthrok and carolineechen May 6, 2022 16:26

nateanl mentioned this pull request May 6, 2022

[Migration] TorchAudio Beamforming Module Migration #2280

Closed

11 tasks

fix torchscript test

8c7fc8d

carolineechen reviewed May 6, 2022

View reviewed changes

carolineechen approved these changes May 6, 2022

View reviewed changes

fix docstring, update unit tests

58d94a1

facebook-github-bot closed this in da1e83c May 10, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add diagonal_loading optional to rtf_power #2369

Add diagonal_loading optional to rtf_power #2369

nateanl commented May 6, 2022

facebook-github-bot commented May 6, 2022

facebook-github-bot commented May 6, 2022

carolineechen left a comment

nateanl commented May 6, 2022

carolineechen May 6, 2022

nateanl May 6, 2022

nateanl May 6, 2022

carolineechen left a comment

facebook-github-bot commented May 6, 2022

Add diagonal_loading optional to rtf_power #2369

Add diagonal_loading optional to rtf_power #2369

Conversation

nateanl commented May 6, 2022

facebook-github-bot commented May 6, 2022

facebook-github-bot commented May 6, 2022

carolineechen left a comment

Choose a reason for hiding this comment

nateanl commented May 6, 2022

carolineechen May 6, 2022

Choose a reason for hiding this comment

nateanl May 6, 2022

Choose a reason for hiding this comment

nateanl May 6, 2022

Choose a reason for hiding this comment

carolineechen left a comment

Choose a reason for hiding this comment

facebook-github-bot commented May 6, 2022