VAE and STCN implementations for single-channel speech enhancement

This repository contains the non-sequential VAE and STCN speech models and the NMF noise model for single-channel speech enhancement.

3/18/2021: VAE is now instance of STCN when the parameters are set to:

kernel_size = 1
tcn_channels = [128]
latent_channels = [16]
dec_channels = [16, 128, 128, 513]
concat_z = False

3/24/2021: STFT is being calculated on GPU

Whenever you use this code for any experiments and/or publications you need to cite our original paper [1].

[1] Julius Richter, Guillaume Carbajal, Timo Gerkmann, "Speech Enhancement with Stochastic Temporal Convolutional Networks", Proc. Interspeech 2020, DOI: 10.21437/Interspeech.2020-2588.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
data		data
loss		loss
models		models
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
data_creation.py		data_creation.py
dataset.py		dataset.py
enhancement.py		enhancement.py
mcem.py		mcem.py
stcn.py		stcn.py
training_stcn.py		training_stcn.py
training_vae.py		training_vae.py
utils.py		utils.py
vae.py		vae.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VAE and STCN implementations for single-channel speech enhancement

About

Releases

Packages

Languages

License

sp-uhh/stcn-nmf

Folders and files

Latest commit

History

Repository files navigation

VAE and STCN implementations for single-channel speech enhancement

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages