Skip to content

Joint magnitude estimation and phase recovery using Cycle-in-Cycle GAN for non-parallel speech enhancement

Notifications You must be signed in to change notification settings

yuguochencuc/CinCGAN-SE

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 

Repository files navigation

This is the repo of the manuscript "Joint magnitude estimation and phase recovery using Cycle-in-Cycle GAN for non-parallel speech enhancement", which is accepted by ICASSP2022. Some audio samples are provided here and the code for network is released soon.

Abstract: For the lack of adequate paired noisy-clean speech corpus in many real scenarios, non-parallel training is a promising task for DNN-based speech enhancement methods. However, because of the severe mismatch between input and target speech, many previous studies only focus on magnitude spectrum estimation and remain the phase unaltered, resulting in the degraded speech quality under low signal-to-noise ratio conditions. To tackle this problem, we decouple the difficult target w.r.t. original spectrum optimization into spectral magnitude and phase, and a novel Cycle-in-ycle generative adversarial network (dubbed CinCGAN) is proposed to jointly estimate the spectral magnitude and phase information stage by stage. In the first stage, we pretrain a magnitude CycleGAN to coarsely denoise the spectral magnitude spectrum. In the second stage, we couple the pretrained CycleGAN with a complex-valued CycleGAN as a cycle-in-cycle structure to recover phase information and refine the spectral magnitude simultaneously. Experimental results on Voice Bank + DEMAND show that the proposed approach significantly outperforms previous baselines under non-parallel training. The evaluation on training the models with standard paired data also shows that the proposed method can achieve remarkable performance.

System flowchart of CinCGAN

image

The architecture of generators: image

Results:

image

image

Samples under non-parallel training:

image

Citation

If you use our code in your research or wish to refer to the baseline results, please use the following BibTeX entry.

@article{yu2021joint,
title={Joint magnitude estimation and phase recovery using Cycle-in-Cycle GAN for non-parallel speech enhancement},
author={Yu, Guochen and Li, Andong and Wang, Yutian and Guo, Yinuo and Zheng, Chengshi and Wang, Hui},
journal={arXiv preprint arXiv:2109.12591},
year={2021}
}

About

Joint magnitude estimation and phase recovery using Cycle-in-Cycle GAN for non-parallel speech enhancement

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published