To Much Noise on Mandarin #498

Johnzxf · 2020-08-20T02:12:57Z

I have synthesize on Mandarin.But it seems too much noise,listen to the audio below. How can I do to decrease the noise. Also, it seems synthesize femal audio is better than male.
BTW: the audio was synthesizer from texts.

链接：https://pan.baidu.com/s/1jXy9-6KnBKVciapcbGZmFw
提取码：knjb

ghost · 2020-08-20T05:49:07Z

Not enough info provided. I can't really help with this but if you answer the following questions it will improve your chances of getting a helpful response.

Did you use the code from this repo? If so what modifications did you make?
Which dataset did you train on?
What are your settings? Attach the synthesizer/hparams.py file
Which vocoder are you using?

Also you should make a zip file of your audio samples and attach them here, many of us here can't download from pan.baidu.com.

Johnzxf · 2020-08-20T07:42:44Z

Not enough info provided. I can't really help with this but if you answer the following questions it will improve your chances of getting a helpful response.

Did you use the code from this repo? If so what modifications did you make?

Which dataset did you train on?

What are your settings? Attach the synthesizer/hparams.py file

Which vocoder are you using?

Also you should make a zip file of your audio samples and attach them here, many of us here can't download from pan.baidu.com.

1: Yes, I use this repo. I did not change anything on the model. The most modifications is the preprocess of the dataset.
2: The dataset I have trained on is internal, and the audio is clean. The audio in my dataset is not as longer as Libirspeech.Most of the audio is between 3-5s.
3：Attach is setting.
4: The vocoder is WaveRNN.

During training synthesizer, I found the aligments has some gap,bteween encoder and decoder. Also see the attachment. Is it has inflence?

SV2TTS.zip
hparams.txt

ghost · 2020-08-25T02:04:43Z

@Johnzxf I'm sorry you didn't get a response. Did you figure out what was causing the noise?

Johnzxf · 2020-08-28T13:46:19Z

Thank you for your replay. I have not got the reason yet. I will check the vocoder first then the acoustic model. If I found the reason I will share with you. thank you------------------ 原始邮件 ------------------ 发件人: "blue-fish"<notifications@github.com> 发送时间: 2020年8月25日(星期二) 上午10:04 收件人: "CorentinJ/Real-Time-Voice-Cloning"<Real-Time-Voice-Cloning@noreply.github.com>; 抄送: "Johnzxf"<1240721730@qq.com>;"Mention"<mention@noreply.github.com>; 主题: Re: [CorentinJ/Real-Time-Voice-Cloning] To Much Noise on Mandarin (#498)

ghost · 2020-08-28T21:00:14Z

Let's reopen and see if anyone knows why this is the case. Are you using RTVC (this repo) or zhrtvc?

lawrence124 · 2020-09-12T15:14:31Z

@blue-fish
i have not tried the waveRNN vocoder on RTVC, but the pretrained waveRNN vocoder can't give me anything useful. The melgan multi speaker (or other melgan vocoder) yield far better result. I'm not too sure why though. (do we need to train vocoder for a specific language ??)

ghost · 2020-09-12T22:22:07Z

@lawrence124 A vocoder trained on enough speakers can generalize to unseen speakers and even other languages. mozilla/TTS#221 (comment)

The vocoders work within very narrow parameters and will fail if input does not meet the specification. To avoid incompatibility, the vocoder in this repo gets some of its hparams from the synthesizer. The relevant parameters are:

sample_rate
num_mels
n_fft, hop_length, window_length
fmin, fmax
pre-emphasize

With default settings, I got very mediocre results on zhrtvc and I suspect that some of the vocoder parameters are not set correctly. Griffin-Lim performed the best. You should try this repo with the English models, the pretrained WaveRNN works quite well.

ghost · 2020-10-06T19:43:07Z

Closing this issue due to inactivity. Would like to know what is causing the noise if you find out.

Johnzxf closed this as completed Aug 25, 2020

ghost reopened this Aug 28, 2020

ghost closed this as completed Oct 6, 2020

This issue was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

To Much Noise on Mandarin #498

To Much Noise on Mandarin #498

Johnzxf commented Aug 20, 2020

ghost commented Aug 20, 2020

Johnzxf commented Aug 20, 2020

ghost commented Aug 25, 2020

Johnzxf commented Aug 28, 2020 via email

ghost commented Aug 28, 2020

lawrence124 commented Sep 12, 2020

ghost commented Sep 12, 2020

ghost commented Oct 6, 2020

To Much Noise on Mandarin #498

To Much Noise on Mandarin #498

Comments

Johnzxf commented Aug 20, 2020

ghost commented Aug 20, 2020

Johnzxf commented Aug 20, 2020

ghost commented Aug 25, 2020

Johnzxf commented Aug 28, 2020 via email

ghost commented Aug 28, 2020

lawrence124 commented Sep 12, 2020

ghost commented Sep 12, 2020

ghost commented Oct 6, 2020