Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. Weโ€™ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dev v0.0.10 #2

Merged
merged 254 commits into from
Mar 10, 2021
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
254 commits
Select commit Hold shift + click to select a range
131a163
Merge pull request #628 from thorstenMueller/dev
Jan 28, 2021
8a6eee7
distill import statement, check python version in setup.py
erogol Jan 28, 2021
a926aa1
reorder imports
erogol Jan 29, 2021
094b399
pyaml
erogol Jan 29, 2021
5a6abe7
setup import reset
erogol Jan 29, 2021
e81ebec
fix device mismatch wavegrad training
sanjaesc Jan 29, 2021
aa5f246
hubconf.py and load .models.json from the defualt location by mange.py
erogol Jan 29, 2021
0354b6f
move hubconf
erogol Jan 29, 2021
66c2a61
docstring hubconf
erogol Jan 29, 2021
536366d
Merge pull request #635 from SanjaESC/patch-1
Jan 29, 2021
44c4a49
Set out_path to be required param.
thorstenMueller Jan 29, 2021
dfdac1d
Merge pull request #636 from thorstenMueller/dev
Jan 29, 2021
879d946
Ups. Added missing ,
thorstenMueller Jan 30, 2021
c740757
fix #638
erogol Feb 1, 2021
d003e59
readme update for espeak install
erogol Feb 1, 2021
699d2aa
pin cython verions 0.29.20
erogol Feb 1, 2021
5beed0d
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
erogol Feb 1, 2021
8774e37
unpin cython version and commentout pyworld in audio.py causing dep
erogol Feb 1, 2021
5c46543
linter fixes and version updates for deps
erogol Feb 1, 2021
41f6579
push numpy version up to 1.17.5
erogol Feb 1, 2021
167bbc6
update version number to 0.0.9.1
erogol Feb 1, 2021
c75ea74
Added info if model already downloaded in --list_models
thorstenMueller Jan 27, 2021
4cb4fcf
Set out_path to be required param.
thorstenMueller Jan 29, 2021
a82152e
Ups. Added missing ,
thorstenMueller Jan 30, 2021
23c9009
Merge branch 'dev' of https://github.com/thorstenMueller/TTS into dev
thorstenMueller Feb 2, 2021
d74866c
Merge remote-tracking branch 'upstream/dev' into dev
thorstenMueller Feb 2, 2021
7bc9862
Merge pull request #637 from thorstenMueller/dev
Feb 3, 2021
cb77aef
waveRNN fix
bgerazov Feb 4, 2021
8871c11
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
erogol Feb 5, 2021
24ffa9e
update wavernn test config, delete cap=True
bgerazov Feb 5, 2021
f063545
improve robustness of defining wavernn in config file
bgerazov Feb 5, 2021
d49757f
linter updates
erogol Feb 5, 2021
f4f6290
Merge branch 'pr/gerazov/641' into dev
erogol Feb 5, 2021
5b06c74
hubconf dependency
erogol Feb 5, 2021
e7e880f
fix gdown
erogol Feb 5, 2021
39f65f1
hubconf update
erogol Feb 5, 2021
4f8f274
restructured arg parsing and processing to utils
gerazov Feb 6, 2021
2705d27
changed train scripts
gerazov Feb 6, 2021
8fdd08e
updated to current dev
gerazov Feb 6, 2021
ad17dc9
final fixes
gerazov Feb 6, 2021
e507373
final final fixes
gerazov Feb 6, 2021
acc0af7
update nose command for circle CI
erogol Feb 8, 2021
b22d99c
reduce glowtts size for testing
erogol Feb 8, 2021
7aa873b
use single process dataloder in tests
erogol Feb 8, 2021
c619859
linter fixes
erogol Feb 9, 2021
1f0385a
Merge branch 'pr/gerazov/642' into dev
erogol Feb 9, 2021
cea5e51
download github model releases by model manager
erogol Feb 9, 2021
0c6519c
test_compute_statistics.sh
erogol Feb 9, 2021
9cad435
css10 dataset preprocessor
erogol Feb 9, 2021
b08b8ca
add russian phoneme char
erogol Feb 10, 2021
6c824a6
spelling error. should be multiband not mulitband
adonispujols Feb 11, 2021
b29a7e9
spelling error. should be multiband not mulitband
adonispujols Feb 11, 2021
48011a8
add encoding="utf-8"
adonispujols Feb 11, 2021
3c2e13c
fix the default vocoder name
erogol Feb 11, 2021
a3d1e65
Merge branch 'pr/adonispujols/646' into dev
erogol Feb 11, 2021
3baec4e
add missing phonemes to test_config.json
erogol Feb 11, 2021
f1799db
docstring update
erogol Feb 11, 2021
bc13120
fix spelling of a def argument and parse phonemes from config.json if
erogol Feb 11, 2021
ff27690
bug fix
erogol Feb 11, 2021
2043a9b
define default vocoders
erogol Feb 11, 2021
0657b38
use default vocoder in synthesize.py
erogol Feb 11, 2021
43f54d2
fix make_symbols
erogol Feb 11, 2021
1649ad3
save_wav with a custom sampling rate
erogol Feb 11, 2021
0c52d27
return the json entry of the downloaded model
erogol Feb 11, 2021
8aa6a0d
set an output_sample_rate in synthesizer and use it for writing the wav
erogol Feb 11, 2021
a1e5957
use default vocoders in server.pu
erogol Feb 11, 2021
af46727
loading last checkpoint/best_model works, deleting last best models oโ€ฆ
gerazov Feb 12, 2021
8b6fd76
find unique characters in a dataset
erogol Feb 12, 2021
310d183
brushed up printing model load path and best loss path
gerazov Feb 12, 2021
702dff3
added keep_best and keep_after to test configs.
gerazov Feb 12, 2021
0e78e31
reformated docstrings in arguments.py
gerazov Feb 12, 2021
e774f68
save used model characters to the checkpoints
erogol Feb 12, 2021
918f007
docstring update
erogol Feb 12, 2021
2abfff1
enable saving model characters in io.py
erogol Feb 12, 2021
593cede
parse_characters function
erogol Feb 12, 2021
7ab527d
save default model chars to the training config file
erogol Feb 12, 2021
b28c724
remove _phoneme_punctuations
erogol Feb 12, 2021
4244096
update test_text_processing for espeak-ng
erogol Feb 12, 2021
420901f
linter fixes
erogol Feb 12, 2021
c613e01
update ci to espeak-ng
erogol Feb 12, 2021
33bcdc6
Updating models list to include EK1 TTS/vocoder
nmstoker Feb 14, 2021
e1bc823
Merge branch 'pr/nmstoker/652' into dev
erogol Feb 15, 2021
77e6303
author , license and contact info in .model.json
erogol Feb 15, 2021
dc3596d
model_manager tests
erogol Feb 15, 2021
706e841
linterup date
erogol Feb 15, 2021
3b6ce04
Update TTS/bin/find_unique_chars.py
Feb 15, 2021
80af8ca
Update TTS/utils/arguments.py
Feb 15, 2021
ff218e2
remove redundancy
erogol Feb 15, 2021
7f58fa3
Merge branch 'save_characters' into dev
erogol Feb 15, 2021
06a3ba2
linter update
erogol Feb 15, 2021
216945e
Merge pull request #647 from adonispujols/patch-1
Feb 15, 2021
40f4475
add encoding="utf-8"
adonispujols Feb 11, 2021
9cb02ae
<add> Chinese mandarin implementation (tacotron2)
kirianguiller Feb 15, 2021
8a106e0
fix #655
erogol Feb 15, 2021
eb543c0
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
erogol Feb 15, 2021
61c88be
refactored keep_all_best
gerazov Feb 15, 2021
3e59d3c
modify according to PR reviews
kirianguiller Feb 16, 2021
a9ea71c
remove re.Match typing in '_number_replace()'
kirianguiller Feb 16, 2021
fe049cb
add pypinyin and jieba to requierements.txt (chinese implementation)
kirianguiller Feb 16, 2021
4543562
remove gst handling in synthetizer.py class
kirianguiller Feb 16, 2021
ce0c5ec
do not test server and modelManager until fixing #657
erogol Feb 17, 2021
f6e6314
add RUSLAN dataset preprocessor
erogol Feb 17, 2021
a8ea0ea
Docstrings for audioprocessor
erogol Feb 17, 2021
d045446
Merge branch 'pr/gerazov/650-2' into dev
erogol Feb 17, 2021
c4c7bc1
<add> Chinese mandarin implementation (tacotron2)
kirianguiller Feb 15, 2021
fb0655d
modify according to PR reviews
kirianguiller Feb 16, 2021
3911b87
remove re.Match typing in '_number_replace()'
kirianguiller Feb 16, 2021
184ce07
add pypinyin and jieba to requierements.txt (chinese implementation)
kirianguiller Feb 16, 2021
22a6bbf
remove gst handling in synthetizer.py class
kirianguiller Feb 16, 2021
35a781d
Merge branch 'chinese-implementation' of https://github.com/kirianguiโ€ฆ
kirianguiller Feb 17, 2021
e4f81d6
Merge pull request #654 from kirianguiller/chinese-implementation
Feb 18, 2021
5b70c8b
enable backward compat for loading the best model
erogol Feb 18, 2021
adaeec5
Merge branch 'dev' of https://github.com/mozilla/TTS into dev
erogol Feb 18, 2021
547bfc4
bug fix
erogol Feb 18, 2021
7f36d91
update chinese model
kirianguiller Mar 1, 2021
c0ed527
update mandarin example notebook with new model gdrive ids
kirianguiller Mar 1, 2021
d706d4c
Merge pull request #6 from kirianguiller/dev-pr3
erogol Mar 2, 2021
977cb51
Fix README.md
erogol Mar 6, 2021
e663d8f
Update CODE_OF_CONDUCT.md
erogol Mar 7, 2021
1e7472b
Update README.md
erogol Mar 7, 2021
16f9e6a
Update README.md
erogol Mar 7, 2021
ea60b48
Update README.md
erogol Mar 7, 2021
0914c95
Update README.md
erogol Mar 7, 2021
3d46d54
Update CODE_OF_CONDUCT.md
erogol Mar 7, 2021
534e3c6
README update, set default models for synthesize.py and server.py. Diโ€ฆ
erogol Jan 27, 2021
b4bc5f6
update fixed stopnet_pos_weight parameter
sanjaesc Jan 27, 2021
915ec1f
Added info if model already downloaded in --list_models
thorstenMueller Jan 27, 2021
db231c8
distill import statement, check python version in setup.py
erogol Jan 28, 2021
90d4f08
reorder imports
erogol Jan 29, 2021
e9364a4
setup import reset
erogol Jan 29, 2021
bbea6a0
hubconf.py and load .models.json from the defualt location by mange.py
erogol Jan 29, 2021
30c3bef
move hubconf
erogol Jan 29, 2021
9646624
docstring hubconf
erogol Jan 29, 2021
83143fb
fix #638
erogol Feb 1, 2021
b2f1c97
readme update for espeak install
erogol Feb 1, 2021
bfb1273
pin cython verions 0.29.20
erogol Feb 1, 2021
ace430d
fix device mismatch wavegrad training
sanjaesc Jan 29, 2021
3eb00e8
Set out_path to be required param.
thorstenMueller Jan 29, 2021
a30a231
unpin cython version and commentout pyworld in audio.py causing dep
erogol Feb 1, 2021
93a6bdf
linter fixes and version updates for deps
erogol Feb 1, 2021
1679018
Ups. Added missing ,
thorstenMueller Jan 30, 2021
08581de
linter updates
erogol Feb 5, 2021
b1e3160
waveRNN fix
bgerazov Feb 4, 2021
5e2bc8c
update wavernn test config, delete cap=True
bgerazov Feb 5, 2021
ed56944
improve robustness of defining wavernn in config file
bgerazov Feb 5, 2021
fa44e32
hubconf dependency
erogol Feb 5, 2021
2fbe4a1
fix gdown
erogol Feb 5, 2021
755872f
hubconf update
erogol Feb 5, 2021
d684375
update nose command for circle CI
erogol Feb 8, 2021
d57560d
reduce glowtts size for testing
erogol Feb 8, 2021
83e5075
use single process dataloder in tests
erogol Feb 8, 2021
2daca15
restructured arg parsing and processing to utils
gerazov Feb 6, 2021
6f06e31
changed train scripts
gerazov Feb 6, 2021
10d5a63
updated to current dev
gerazov Feb 6, 2021
b3c5cc2
final fixes
gerazov Feb 6, 2021
2b5cb24
final final fixes
gerazov Feb 6, 2021
3c96137
linter fixes
erogol Feb 9, 2021
49771f2
download github model releases by model manager
erogol Feb 9, 2021
a2e7403
test_compute_statistics.sh
erogol Feb 9, 2021
29d928d
css10 dataset preprocessor
erogol Feb 9, 2021
ee58ff2
add russian phoneme char
erogol Feb 10, 2021
59b1b13
spelling error. should be multiband not mulitband
adonispujols Feb 11, 2021
81b145c
spelling error. should be multiband not mulitband
adonispujols Feb 11, 2021
43b9510
fix the default vocoder name
erogol Feb 11, 2021
6cd642c
add missing phonemes to test_config.json
erogol Feb 11, 2021
1c1abb8
docstring update
erogol Feb 11, 2021
5f1018a
fix spelling of a def argument and parse phonemes from config.json if
erogol Feb 11, 2021
6bd8485
bug fix
erogol Feb 11, 2021
23b282f
define default vocoders
erogol Feb 11, 2021
8955333
use default vocoder in synthesize.py
erogol Feb 11, 2021
9fefc79
fix make_symbols
erogol Feb 11, 2021
00e0933
save_wav with a custom sampling rate
erogol Feb 11, 2021
3ccb015
return the json entry of the downloaded model
erogol Feb 11, 2021
d30608a
set an output_sample_rate in synthesizer and use it for writing the wav
erogol Feb 11, 2021
dad3565
use default vocoders in server.pu
erogol Feb 11, 2021
c6702b5
find unique characters in a dataset
erogol Feb 12, 2021
ae0d54d
Updating models list to include EK1 TTS/vocoder
nmstoker Feb 14, 2021
fe41084
author , license and contact info in .model.json
erogol Feb 15, 2021
e06c93f
model_manager tests
erogol Feb 15, 2021
8ec28b1
linterup date
erogol Feb 15, 2021
62aeacb
save used model characters to the checkpoints
erogol Feb 12, 2021
f9fe167
docstring update
erogol Feb 12, 2021
0b33acd
enable saving model characters in io.py
erogol Feb 12, 2021
62a8eba
parse_characters function
erogol Feb 12, 2021
194f82d
save default model chars to the training config file
erogol Feb 12, 2021
5b8a673
remove _phoneme_punctuations
erogol Feb 12, 2021
55fc50b
update test_text_processing for espeak-ng
erogol Feb 12, 2021
ee71eb4
linter fixes
erogol Feb 12, 2021
19e2193
update ci to espeak-ng
erogol Feb 12, 2021
39fbf2f
Update TTS/bin/find_unique_chars.py
Feb 15, 2021
93a83c0
Update TTS/utils/arguments.py
Feb 15, 2021
0e1e60b
remove redundancy
erogol Feb 15, 2021
534c341
linter update
erogol Feb 15, 2021
ffceccb
fix #655
erogol Feb 15, 2021
89b7f01
add encoding="utf-8"
adonispujols Feb 11, 2021
8993120
do not test server and modelManager until fixing #657
erogol Feb 17, 2021
2ca74b8
add RUSLAN dataset preprocessor
erogol Feb 17, 2021
4111df6
Docstrings for audioprocessor
erogol Feb 17, 2021
f2e474c
loading last checkpoint/best_model works, deleting last best models oโ€ฆ
gerazov Feb 12, 2021
2db4045
brushed up printing model load path and best loss path
gerazov Feb 12, 2021
6214799
added keep_best and keep_after to test configs.
gerazov Feb 12, 2021
8cefa76
reformated docstrings in arguments.py
gerazov Feb 12, 2021
2451a81
refactored keep_all_best
gerazov Feb 15, 2021
e3102e7
enable backward compat for loading the best model
erogol Feb 18, 2021
e6fd118
<add> Chinese mandarin implementation (tacotron2)
kirianguiller Feb 15, 2021
0d45253
modify according to PR reviews
kirianguiller Feb 16, 2021
e85658a
remove re.Match typing in '_number_replace()'
kirianguiller Feb 16, 2021
ab8994e
add pypinyin and jieba to requierements.txt (chinese implementation)
kirianguiller Feb 16, 2021
4966578
remove gst handling in synthetizer.py class
kirianguiller Feb 16, 2021
42ba30e
<add> Chinese mandarin implementation (tacotron2)
kirianguiller Feb 15, 2021
9ab07f9
modify according to PR reviews
kirianguiller Feb 16, 2021
557239d
remove re.Match typing in '_number_replace()'
kirianguiller Feb 16, 2021
628afe5
remove gst handling in synthetizer.py class
kirianguiller Feb 16, 2021
e03a426
bug fix
erogol Feb 18, 2021
008b8ba
github actions CI
erogol Mar 8, 2021
54104d3
pyworld acting up
erogol Mar 8, 2021
4422642
git ignore temp_build/X
erogol Mar 8, 2021
9a48ba3
a ton of linter updates
erogol Mar 8, 2021
d01546d
ci config update
erogol Mar 8, 2021
462fb5e
ci update
erogol Mar 8, 2021
e15734c
linter fix
erogol Mar 8, 2021
00b5090
linter fix
erogol Mar 8, 2021
988ae7a
update CI config
erogol Mar 8, 2021
594d8d8
linter fixes
erogol Mar 8, 2021
bc21877
CI config update, show OS info
erogol Mar 8, 2021
372cd3c
CI caching
erogol Mar 8, 2021
d43a46c
CI 'ci skip' check
erogol Mar 8, 2021
426bdbd
update the main README
erogol Mar 8, 2021
c16ad38
update server rEADME
erogol Mar 8, 2021
a519ed5
deprecate embedding models to the wheel
erogol Mar 8, 2021
4e41dee
update setup.py following prev commit
erogol Mar 8, 2021
dd4f46b
disable model manager test until moving models to github releases
erogol Mar 8, 2021
00bfa54
update server tests
erogol Mar 8, 2021
947e3d6
rename test
erogol Mar 8, 2021
5dcc4be
rebrand demo server
erogol Mar 8, 2021
1e169a5
bunch of short fixes
erogol Mar 8, 2021
9480523
Merge branch 'dev' of https://github.com/coqui-ai/TTS into dev
erogol Mar 8, 2021
f24d48c
linter fix
erogol Mar 8, 2021
8a4dcd1
Add Dutch model
r-dh Feb 19, 2021
43379ee
fix the nl model and add the vocoder
erogol Mar 9, 2021
19bb9ba
fix tts endpoint using list-models argument
erogol Mar 9, 2021
fc19411
update some of the models to github releases
erogol Mar 10, 2021
599149a
downloading models from github releases
erogol Mar 10, 2021
4337e9f
pad_mode in torch_stft
erogol Mar 10, 2021
7782034
fix #369
erogol Mar 10, 2021
9e84c8a
do not copy scale_stats if exist in the output folder
erogol Mar 10, 2021
6c932c8
print the desc if required parameters are not provided
erogol Mar 10, 2021
4aba4e5
linter fx
erogol Mar 10, 2021
d260fb0
fix handling scale_stats.npy for models downloaded from Github rls
erogol Mar 10, 2021
e5bb317
fix model manager
erogol Mar 10, 2021
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
6 changes: 2 additions & 4 deletions .circleci/config.yml
Original file line number Diff line number Diff line change
Expand Up @@ -25,17 +25,15 @@ jobs:
- checkout
- run: |
sudo apt update
sudo apt install espeak git
sudo apt install espeak-ng git
- run: sudo pip install --upgrade pip
- run: sudo pip install -e .
- run: |
sudo pip install --quiet --upgrade cardboardlint pylint
cardboardlinter --refspec ${CIRCLE_BRANCH} -n auto
- run: nosetests tests --nocapture
- run: nosetests tests --nocapture --processes=0 --process-timeout=20 --process-restartworker
- run: |
sudo ./tests/test_server_package.sh
sudo ./tests/test_glow-tts_train.sh
sudo ./tests/test_server_package.sh
sudo ./tests/test_tacotron_train.sh
sudo ./tests/test_vocoder_gan_train.sh
sudo ./tests/test_vocoder_wavegrad_train.sh
Expand Down
60 changes: 60 additions & 0 deletions .github/workflows/main.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,60 @@
name: Test

on:
push:
branches:
- master
- dev
pull_request:
types: [opened, synchronize, reopened]
jobs:
check_skip:
runs-on: ubuntu-latest
if: "! contains(github.event.head_commit.message, '[ci skip]')"
steps:
- run: echo "${{ github.event.head_commit.message }}"

test:
runs-on: ubuntu-latest
strategy:
fail-fast: false
matrix:
python-version: [3.6, 3.7, 3.8]

steps:
- uses: actions/checkout@v2
- uses: actions/cache@v1
with:
path: ~/.cache/pip
key: ${{ runner.os }}-pip-${{ matrix.python-version }}-${{ hashFiles('**/setup.py') }}
- name: Set up Python ${{ matrix.python-version }}
uses: actions/setup-python@v2
with:
python-version: ${{ matrix.python-version }}
- name: check OS
run: cat /etc/os-release
- name: Install dependencies
run: |
sudo apt update
sudo apt install espeak-ng git
- name: Upgrade pip
# so we can take advantage of pyproject.toml build-dependency support
run: python3 -m pip install --upgrade pip
- name: Install TTS
run: |
python3 -m pip install .
python3 setup.py egg_info
- name: Lint check
run: |
cardboardlinter -n auto
- name: Unit tests
run: nosetests tests --nocapture --processes=0 --process-timeout=20 --process-restartworker
- name: Test scripts
run: |
./tests/test_demo_server.sh
./tests/test_glow-tts_train.sh
./tests/test_tacotron_train.sh
./tests/test_vocoder_gan_train.sh
./tests/test_vocoder_wavegrad_train.sh
./tests/test_vocoder_wavernn_train.sh
./tests/test_speedy_speech_train.sh
1 change: 1 addition & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -130,3 +130,4 @@ TODO.txt
data/*
notebooks/data/*
TTS/tts/layers/glow_tts/monotonic_align/core.c
temp_build/*
140 changes: 127 additions & 13 deletions CODE_OF_CONDUCT.md
Original file line number Diff line number Diff line change
@@ -1,19 +1,133 @@
# Ethical Notice

Please consider possible consequences and be mindful of any adversarial use cases of this project. In this regard, please contact us if you have any concerns.
# Contributor Covenant Code of Conduct

# Community Participation Guidelines
## Our Pledge

This repository is governed by Mozilla's code of conduct and etiquette guidelines.
For more details, please read the
[Mozilla Community Participation Guidelines](https://www.mozilla.org/about/governance/policies/participation/).
We as members, contributors, and leaders pledge to make participation in our
community a harassment-free experience for everyone, regardless of age, body
size, visible or invisible disability, ethnicity, sex characteristics, gender
identity and expression, level of experience, education, socio-economic status,
nationality, personal appearance, race, caste, color, religion, or sexual identity
and orientation.

## How to Report
For more information on how to report violations of the Community Participation Guidelines, please read our '[How to Report](https://www.mozilla.org/about/governance/policies/participation/reporting/)' page.
We pledge to act and interact in ways that contribute to an open, welcoming,
diverse, inclusive, and healthy community.

<!--
## Project Specific Etiquette
## Our Standards

In some cases, there will be additional project etiquette i.e.: (https://bugzilla.mozilla.org/page.cgi?id=etiquette.html).
Please update for your project.
-->
Examples of behavior that contributes to a positive environment for our
community include:

* Demonstrating empathy and kindness toward other people
* Being respectful of differing opinions, viewpoints, and experiences
* Giving and gracefully accepting constructive feedback
* Accepting responsibility and apologizing to those affected by our mistakes,
and learning from the experience
* Focusing on what is best not just for us as individuals, but for the
overall community

Examples of unacceptable behavior include:

* The use of sexualized language or imagery, and sexual attention or
advances of any kind
* Trolling, insulting or derogatory comments, and personal or political attacks
* Public or private harassment
* Publishing others' private information, such as a physical or email
address, without their explicit permission
* Other conduct which could reasonably be considered inappropriate in a
professional setting

## Enforcement Responsibilities

Community leaders are responsible for clarifying and enforcing our standards of
acceptable behavior and will take appropriate and fair corrective action in
response to any behavior that they deem inappropriate, threatening, offensive,
or harmful.

Community leaders have the right and responsibility to remove, edit, or reject
comments, commits, code, wiki edits, issues, and other contributions that are
not aligned to this Code of Conduct, and will communicate reasons for moderation
decisions when appropriate.

## Scope

This Code of Conduct applies within all community spaces, and also applies when
an individual is officially representing the community in public spaces.
Examples of representing our community include using an official e-mail address,
posting via an official social media account, or acting as an appointed
representative at an online or offline event.

## Enforcement

Instances of abusive, harassing, or otherwise unacceptable behavior may be
reported to the community leaders responsible for enforcement at
coc-report@coqui.ai.
All complaints will be reviewed and investigated promptly and fairly.

All community leaders are obligated to respect the privacy and security of the
reporter of any incident.

## Enforcement Guidelines

Community leaders will follow these Community Impact Guidelines in determining
the consequences for any action they deem in violation of this Code of Conduct:

### 1. Correction

**Community Impact**: Use of inappropriate language or other behavior deemed
unprofessional or unwelcome in the community.

**Consequence**: A private, written warning from community leaders, providing
clarity around the nature of the violation and an explanation of why the
behavior was inappropriate. A public apology may be requested.

### 2. Warning

**Community Impact**: A violation through a single incident or series
of actions.

**Consequence**: A warning with consequences for continued behavior. No
interaction with the people involved, including unsolicited interaction with
those enforcing the Code of Conduct, for a specified period of time. This
includes avoiding interactions in community spaces as well as external channels
like social media. Violating these terms may lead to a temporary or
permanent ban.

### 3. Temporary Ban

**Community Impact**: A serious violation of community standards, including
sustained inappropriate behavior.

**Consequence**: A temporary ban from any sort of interaction or public
communication with the community for a specified period of time. No public or
private interaction with the people involved, including unsolicited interaction
with those enforcing the Code of Conduct, is allowed during this period.
Violating these terms may lead to a permanent ban.

### 4. Permanent Ban

**Community Impact**: Demonstrating a pattern of violation of community
standards, including sustained inappropriate behavior, harassment of an
individual, or aggression toward or disparagement of classes of individuals.

**Consequence**: A permanent ban from any sort of public interaction within
the community.

## Attribution

This Code of Conduct is adapted from the [Contributor Covenant][homepage],
version 2.0, available at
[https://www.contributor-covenant.org/version/2/0/code_of_conduct.html][v2.0].

Community Impact Guidelines were inspired by
[Mozilla's code of conduct enforcement ladder][Mozilla CoC].

For answers to common questions about this code of conduct, see the FAQ at
[https://www.contributor-covenant.org/faq][FAQ]. Translations are available
at [https://www.contributor-covenant.org/translations][translations].

[homepage]: https://www.contributor-covenant.org
[v2.0]: https://www.contributor-covenant.org/version/2/0/code_of_conduct.html
[Mozilla CoC]: https://github.com/mozilla/diversity
[FAQ]: https://www.contributor-covenant.org/faq
[translations]: https://www.contributor-covenant.org/translations
53 changes: 31 additions & 22 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,17 +1,18 @@
# <img src="images/coqui-log-green-TTS.png" height="56"/>

TTS is a library for advanced Text-to-Speech generation. It's built on the latest research, was designed to achieve the best trade-off among ease-of-training, speed and quality.
TTS comes with [pretrained models](https://github.com/coqui-ai/TTS/wiki/Released-Models), tools for measuring dataset quality and already used in **20+ languages** for products and research projects.
:frog: TTS is a library for advanced Text-to-Speech generation. It's built on the latest research, was designed to achieve the best trade-off among ease-of-training, speed and quality.
:frog: TTS comes with [pretrained models](https://github.com/coqui-ai/TTS/wiki/Released-Models), tools for measuring dataset quality and already used in **20+ languages** for products and research projects.

<!-- [![CircleCI](TODO)]() -->
[![License](<https://img.shields.io/badge/License-MPL%202.0-brightgreen.svg>)](https://opensource.org/licenses/MPL-2.0)
[![PyPI version](https://badge.fury.io/py/TTS.svg)](https://badge.fury.io/py/TTS)
[![Covenant](https://camo.githubusercontent.com/7d620efaa3eac1c5b060ece5d6aacfcc8b81a74a04d05cd0398689c01c4463bb/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f436f6e7472696275746f72253230436f76656e616e742d76322e3025323061646f707465642d6666363962342e737667)](https://github.com/coqui-ai/TTS/blob/master/CODE_OF_CONDUCT.md)

:loudspeaker: [English Voice Samples](https://erogol.github.io/ddc-samples/) and [SoundCloud playlist](https://soundcloud.com/user-565970875/pocket-article-wavernn-and-tacotron2)
๐Ÿ“ข [English Voice Samples](https://erogol.github.io/ddc-samples/) and [SoundCloud playlist](https://soundcloud.com/user-565970875/pocket-article-wavernn-and-tacotron2)

:man_cook: [TTS training recipes](https://github.com/erogol/TTS_recipes)
๐Ÿ‘ฉ๐Ÿฝโ€๐Ÿณ [TTS training recipes](https://github.com/erogol/TTS_recipes)

:page_facing_up: [Text-to-Speech paper collection](https://github.com/erogol/TTS-papers)
๐Ÿ“„ [Text-to-Speech paper collection](https://github.com/erogol/TTS-papers)

## ๐Ÿ’ฌ Where to ask questions
Please use our dedicated channels for questions and discussion. Help is much more valuable if it's shared publicly, so that more people can benefit from it.
Expand Down Expand Up @@ -87,24 +88,29 @@ Underlined "TTS*" and "Judy*" are :frog:TTS models
- WaveRNN: [origin](https://github.com/fatchord/WaveRNN/)
- WaveGrad: [paper](https://arxiv.org/abs/2009.00713)

You can also help us implement more models. Some TTS related work can be found [here](https://github.com/erogol/TTS-papers).
You can also help us implement more models. Some :frog: TTS related work can be found [here](https://github.com/erogol/TTS-papers).

## Install TTS
TTS is tested on Ubuntu 18.04 with **python >= 3.6, < 3.9**.
:frog: TTS is tested on Ubuntu 18.04 with **python >= 3.6, < 3.9**.

If you are only interested in [synthesizing speech](https://github.com/coqui-ai/TTS/tree/dev#example-synthesizing-speech-on-terminal-using-the-released-models) with the released TTS models, installing from PyPI is the easiest option.
If you are only interested in [synthesizing speech](https://github.com/coqui-ai/TTS/tree/dev#example-synthesizing-speech-on-terminal-using-the-released-models) with the released :frog: TTS models, installing from PyPI is the easiest option.

```bash
pip install TTS
```

If you plan to code or train models, clone TTS and install it locally.
If you plan to code or train models, clone :frog: TTS and install it locally.

```bash
git clone https://github.com/coqui-ai/TTS
pip install -e .
```

We use ```espeak-ng``` to convert graphemes to phonemes. You might need to install separately.
```bash
sudo apt-get install espeak-ng
```

## Directory Structure
```
|- notebooks/ (Jupyter Notebooks for model evaluation, parameter selection and data analysis.)
Expand Down Expand Up @@ -136,11 +142,11 @@ Audio examples: [soundcloud](https://soundcloud.com/user-565970875/pocket-articl
<img src="images/example_model_output.png?raw=true" alt="example_output" width="400"/>

## Datasets and Data-Loading
TTS provides a generic dataloader easy to use for your custom dataset.
:frog: TTS provides a generic dataloader easy to use for your custom dataset.
You just need to write a simple function to format the dataset. Check ```datasets/preprocess.py``` to see some examples.
After that, you need to set ```dataset``` fields in ```config.json```.

Some of the public datasets that we successfully applied TTS:
Some of the public datasets that we successfully applied :frog: TTS:

- [LJ Speech](https://keithito.com/LJ-Speech-Dataset/)
- [Nancy](http://www.cstr.ed.ac.uk/projects/blizzard/2011/lessac_blizzard2011/)
Expand All @@ -151,9 +157,9 @@ Some of the public datasets that we successfully applied TTS:

## Example: Synthesizing Speech on Terminal Using the Released Models.

After the installation, TTS provides a CLI interface for synthesizing speech using pre-trained models. You can either use your own model or the release models under the TTS project.
After the installation, :frog: TTS provides a CLI interface for synthesizing speech using pre-trained models. You can either use your own model or the release models under :frog: TTS.

Listing released TTS models.
Listing released :frog: TTS models.
```bash
tts --list_models
```
Expand Down Expand Up @@ -230,25 +236,28 @@ In case of any error or intercepted execution, if there is no checkpoint yet und
You can also enjoy Tensorboard, if you point Tensorboard argument```--logdir``` to the experiment folder.

## Contribution guidelines
Please send your Pull Request to ```dev``` branch. Before making a Pull Request, check your changes for basic mistakes and style problems by using a linter. We have cardboardlinter setup in this repository, so for example, if you've made some changes and would like to run the linter on just the changed code, you can use the follow command:
Please follow the steps below as you send a PR for :frog:. It helps us to keep things organized.

1. Create a new branch.
2. Implement your changes.
3. (if applicable) Add [Google Style](https://google.github.io/styleguide/pyguide.html#381-docstrings) docstrings.
4. (if applicable) Implement a test case under ```tests``` folder.
5. (Optional but Prefered) Run tests.
```bash
./run_tests.sh
```
6. Run the linter.
```bash
pip install pylint cardboardlint
cardboardlinter --refspec master
```
7. Send a PR to ```dev``` branch, explain what the change is about.
8. Let us discuss until we make it perfect :).
8. Let us discuss until we make it perfect :) ๐Ÿ’ช.
9. We merge it to the ```dev``` branch once things look good.

Feel free to ping us at any step you need help using our communication channels.
[Here](https://github.com/firstcontributions/first-contributions) is a good resource for complete beginners.

## Collaborative Experimentation Guide
If you like to use TTS to try a new idea and like to share your experiments with the community, we urge you to use the following guideline for a better collaboration.
(If you have an idea for better collaboration, let us know)
- Create a new branch.
- Open an issue pointing your branch.
- Explain your idea and experiment.
- Share your results regularly. (Tensorboard log files, audio results, visuals etc.)

### Acknowledgement
- https://github.com/keithito/tacotron (Dataset pre-processing)
Expand Down
Loading