-
Notifications
You must be signed in to change notification settings - Fork 8.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
To Much Noise on Mandarin #498
Comments
Not enough info provided. I can't really help with this but if you answer the following questions it will improve your chances of getting a helpful response.
Also you should make a zip file of your audio samples and attach them here, many of us here can't download from pan.baidu.com. |
1: Yes, I use this repo. I did not change anything on the model. The most modifications is the preprocess of the dataset. During training synthesizer, I found the aligments has some gap,bteween encoder and decoder. Also see the attachment. Is it has inflence? |
@Johnzxf I'm sorry you didn't get a response. Did you figure out what was causing the noise? |
Thank you for your replay. I have not got the reason yet. I will check the vocoder first then the acoustic model. If I found the reason I will share with you. thank you------------------ 原始邮件 ------------------
发件人: "blue-fish"<notifications@github.com>
发送时间: 2020年8月25日(星期二) 上午10:04
收件人: "CorentinJ/Real-Time-Voice-Cloning"<Real-Time-Voice-Cloning@noreply.github.com>;
抄送: "Johnzxf"<1240721730@qq.com>;"Mention"<mention@noreply.github.com>;
主题: Re: [CorentinJ/Real-Time-Voice-Cloning] To Much Noise on Mandarin (#498)
|
Let's reopen and see if anyone knows why this is the case. Are you using RTVC (this repo) or zhrtvc? |
@blue-fish |
@lawrence124 A vocoder trained on enough speakers can generalize to unseen speakers and even other languages. mozilla/TTS#221 (comment) The vocoders work within very narrow parameters and will fail if input does not meet the specification. To avoid incompatibility, the vocoder in this repo gets some of its hparams from the synthesizer. The relevant parameters are:
With default settings, I got very mediocre results on zhrtvc and I suspect that some of the vocoder parameters are not set correctly. Griffin-Lim performed the best. You should try this repo with the English models, the pretrained WaveRNN works quite well. |
Closing this issue due to inactivity. Would like to know what is causing the noise if you find out. |
I have synthesize on Mandarin.But it seems too much noise,listen to the audio below. How can I do to decrease the noise. Also, it seems synthesize femal audio is better than male.
BTW: the audio was synthesizer from texts.
链接:https://pan.baidu.com/s/1jXy9-6KnBKVciapcbGZmFw
提取码:knjb
The text was updated successfully, but these errors were encountered: