About the application #1

Lerry123 · 2020-09-20T03:38:17Z

Hello,I have used the sequnet with residual in the speech enhancement. When used the large data in train and test,the result is so bad.Could you give some suggestion about this issue?

f90 · 2020-09-22T15:16:23Z

Hey, can you clarify more about how you applied the Seq-U-Net? I need some more info about the setting so I can give you some clues as to what the problem might be!

Lerry123 · 2020-09-23T01:03:44Z

I sampled the speech using a sliding window of about 1s, which was about 16,384 sample points. The sample points were input into the SEQ-U-NET used for speech waveform generation in the original text, and the number of channels was changed to 1.
The VCTK database was used for training, the train set is 11578, and the test set is 874. When all the samples were used for training, the speech was distorted.But when I trained with 50 samples and tested with 30, there was no distortion in the enhanced speech.

Lerry123 · 2020-09-23T01:08:18Z

I have used raw_audio in original code. I wrote the input size and output size as 16384.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About the application #1

About the application #1

Lerry123 commented Sep 20, 2020

f90 commented Sep 22, 2020

Lerry123 commented Sep 23, 2020

Lerry123 commented Sep 23, 2020

About the application #1

About the application #1

Comments

Lerry123 commented Sep 20, 2020

f90 commented Sep 22, 2020

Lerry123 commented Sep 23, 2020

Lerry123 commented Sep 23, 2020