Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update pretrained tts models #29

Merged
merged 2 commits into from
Dec 13, 2023
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion egs/datasets/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -163,7 +163,7 @@ The official LibriTTS dataset can be download [here](https://www.openslr.org/60/

## LJSpeech

The official LibriTTS dataset can be download [here](https://keithito.com/LJ-Speech-Dataset/). The file structure tree is like:
The official LJSpeech dataset can be download [here](https://keithito.com/LJ-Speech-Dataset/). The file structure tree is like:

```plaintext
[LJSpeech dataset path]
Expand Down
2 changes: 1 addition & 1 deletion egs/tts/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -14,4 +14,4 @@ Until now, Amphion TTS supports the following models or architectures,
- **[NaturalSpeech2](NaturalSpeech2)** (👨‍💻 developing): An architecture for TTS that utilizes a latent diffusion model to generate natural-sounding voices.

## Amphion TTS Demo
Here are some [TTS samples](https://openhlt.github.io/Amphion_TTS_Demo/) from Amphion (👨‍💻 developing).
Here are some [TTS samples](https://openhlt.github.io/Amphion_TTS_Demo/) from Amphion.
2 changes: 1 addition & 1 deletion egs/tts/VALLE/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -127,7 +127,7 @@ sh egs/tts/VALLE/run.sh --stage 3 --gpu "0" \
```


We will release a pre-trained VALL-E. So you can download the pre-trained model and generate speech following the above inference instruction.
We released a pre-trained Amphion VALL-E model. So you can download the pre-trained model [here](https://huggingface.co/amphion/valle-libritts) and generate speech following the above inference instruction.

```bibtex
@article{wang2023neural,
Expand Down
2 changes: 1 addition & 1 deletion egs/tts/VITS/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -121,7 +121,7 @@ sh egs/tts/VITS/run.sh --stage 3 --gpu "0" \
--infer_text "This is a clip of generated speech with the given text from a TTS model."
```

We will release a pre-trained VITS model trained on LJSpeech. So you can download the pre-trained model and generate speech following the above inference instruction.
We released a pre-trained Amphion VITS model trained on LJSpeech. So you can download the pre-trained model [here](https://huggingface.co/amphion/vits-ljspeech) and generate speech following the above inference instruction.


```bibtex
Expand Down