Dev pr2 : handle multi-speaker and GST in synthetizer class #5

kirianguiller · 2021-02-26T15:57:37Z

Hi guys,

Here the second split of the PR I did earlier this week.

This new content is for handling multi speaker and GST inference in the Synthesizer class (that is used in server.py or in the google colab for Chinese that I mentioned in the first point). Now, you can pass the following two optional parameters to the Synthesizer.tts() method :
speaker_json_key and style_wav . speaker_json_key is the name of the key of one of the speaker in the provided speakers.json . style_wav is either a path to a wav file for GST style transfer, or is a dict containing the {"token1":0.25, "token2" -0.1, etc...}. *The next step is to also give the user the possibility to directly provide the optional parameter speaker_embedding that is a speaker embedding (as a numpy array or a list?) that will be passed to Tacotron at inference time.

The synthesizer class is now simplier to use, and we can see in this google colab that this reduces the number of lines required for having working generation samples.

Thanks :)

erogol · 2021-03-22T13:02:47Z

Sorry for being slow. I'll check the PR definitely the latest tomorrow.

kirianguiller · 2021-03-23T10:01:29Z

No problem ! Just let me know if you have any requests for modification :)

erogol · 2021-03-23T12:10:32Z

I think the only immediate requirement is writing some testing code for the synthesizer. I'll write one for the current synthesizer in the dev branch then you can rebase and add more for multi-speaker and GST changes you made.

erogol · 2021-03-23T12:45:18Z

ok we already have test_synthesizer.py :)

Can you implement test cases for your changes - GST and multi-speaker?

kirianguiller · 2021-03-25T08:53:42Z

Yes ! I will implement this and push the changes at the beginning of next week :)

erogol · 2021-04-16T13:16:55Z

@kirianguiller any updates?

kirianguiller · 2021-04-16T13:44:24Z

@kirianguiller any updates?

Yes sorry, quite busy weeks I had here. Thanks for reminding me though. I will implement the test for the code I added and work on the new conflicts.

erogol · 2021-04-16T15:47:49Z

@kirianguiller I am also on this PR. Maybe better if you wait me to push my updates. I also rebased the latest dev.

I'll ping you.

kirianguiller · 2021-04-17T07:50:42Z

Oh cool ! Thank you. I am waiting for your changes then :)

erogol · 2021-04-28T13:40:19Z

I close this for the sake of #441

Fix TTS().list_models()

Add tokenizer logging, update version for release 0.23.0

kirianguiller changed the title ~~Dev pr2~~ Dev pr2 : handle multi-speaker and GST in synthetizer class Feb 26, 2021

kirianguiller added 3 commits March 1, 2021 17:44

handle multi speaker and gst in Synthetizer class

5f7d4d9

add usage of new Synthetizer class in the chinese model notebook

ea76492

rename usage in test of previously modified Synthetizer methods

1fcbb7c

kirianguiller force-pushed the dev-pr2 branch from f471895 to 1fcbb7c Compare March 1, 2021 16:45

erogol added the enhancement General library enhancement. label Mar 6, 2021

erogol closed this Apr 28, 2021

kirianguiller mentioned this pull request Apr 28, 2021

Speaker manager for multi-speaker handling #441

Merged

7 tasks

erogol added a commit that referenced this pull request May 6, 2021

config refactor #5 WIP

b985bad

erogol added a commit that referenced this pull request May 11, 2021

config refactor #5 WIP

79d7215

eginhard referenced this pull request in idiap/coqui-ai-TTS Apr 2, 2024

Merge pull request #5 from eginhard/fix-list-models

f6464d7

Fix TTS().list_models()

gravityrail pushed a commit to gravityrail/TTS that referenced this pull request Jul 8, 2024

Merge pull request coqui-ai#5 from idiap/tokenizer-logging

5527f70

Add tokenizer logging, update version for release 0.23.0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dev pr2 : handle multi-speaker and GST in synthetizer class #5

Dev pr2 : handle multi-speaker and GST in synthetizer class #5

kirianguiller commented Feb 26, 2021 •

edited

Loading

erogol commented Mar 22, 2021

kirianguiller commented Mar 23, 2021

erogol commented Mar 23, 2021

erogol commented Mar 23, 2021

kirianguiller commented Mar 25, 2021

erogol commented Apr 16, 2021

kirianguiller commented Apr 16, 2021

erogol commented Apr 16, 2021

kirianguiller commented Apr 17, 2021

erogol commented Apr 28, 2021

Dev pr2 : handle multi-speaker and GST in synthetizer class #5

Dev pr2 : handle multi-speaker and GST in synthetizer class #5

Conversation

kirianguiller commented Feb 26, 2021 • edited Loading

erogol commented Mar 22, 2021

kirianguiller commented Mar 23, 2021

erogol commented Mar 23, 2021

erogol commented Mar 23, 2021

kirianguiller commented Mar 25, 2021

erogol commented Apr 16, 2021

kirianguiller commented Apr 16, 2021

erogol commented Apr 16, 2021

kirianguiller commented Apr 17, 2021

erogol commented Apr 28, 2021

kirianguiller commented Feb 26, 2021 •

edited

Loading