Skip to content

Commit

Permalink
Add steps for document of getting dataset 'SF Bilingual Speech' (#7378)
Browse files Browse the repository at this point in the history
* Add steps for document of getting dataset 'SF Bilingual Speech'

Signed-off-by: Robin Dong <robin.k.dong@gmail.com>

* Update datasets.rst

added a link from a tutorial demonstrating detailed data prep steps.

Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>

---------

Signed-off-by: Robin Dong <robin.k.dong@gmail.com>
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
  • Loading branch information
2 people authored and gshennvm committed Sep 23, 2023
1 parent 59d1135 commit 1d75e32
Showing 1 changed file with 11 additions and 5 deletions.
16 changes: 11 additions & 5 deletions docs/source/tts/datasets.rst
Original file line number Diff line number Diff line change
Expand Up @@ -172,18 +172,24 @@ SFSpeech Chinese/English Bilingual Speech
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
* Dataset URL: https://catalog.ngc.nvidia.com/orgs/nvidia/resources/sf_bilingual_speech_zh_en
* Dataset Processing Script: https://github.com/NVIDIA/NeMo/tree/stable/scripts/dataset_processing/tts/sfbilingual/get_data.py
* Command Line Instruction:
* Command Line Instruction: please refer details in Section 1 (NGC Registry CLI installation), Section 2 (Downloading SFSpeech Dataset), and Section 3 (Creatiung Data Manifests) from https://github.com/NVIDIA/NeMo/blob/main/tutorials/tts/FastPitch_ChineseTTS_Training.ipynb. Below code block briefly describes the steps.

.. code-block:: bash
# [prerequisite] Install and setup 'ngc' cli tool by following document https://docs.ngc.nvidia.com/cli/cmd.html
$ ngc registry resource download-version "nvidia/sf_bilingual_speech_zh_en:v1"
$ unzip sf_bilingual_speech_zh_en_vv1/SF_bilingual.zip -d <your_local_dataset_root>
$ python scripts/dataset_processing/tts/sfbilingual/get_data.py \
--data-root <your_local_dataset_root> \
--val-size 0.1 \
--test-size 0.2 \
--data-root <your_local_dataset_root>/SF_bilingual \
--val-size 0.005 \
--test-size 0.01 \
--seed-for-ds-split 100
$ python scripts/dataset_processing/tts/extract_sup_data.py \
--config-path sfbilingual/ds_conf \
--config-name ds_for_fastpitch_align.yaml \
manifest_filepath=<your_path_to_train_manifest> \
sup_data_path=<your_path_to_where_to_save_supplementary_data>
sup_data_path=<your_path_to_where_to_save_supplementary_data>

0 comments on commit 1d75e32

Please sign in to comment.