From 56d80e640e52fbbb67585f67d017b66387f68859 Mon Sep 17 00:00:00 2001
From: Robin Dong <robin.k.dong@gmail.com>
Date: Wed, 6 Sep 2023 21:19:16 +1000
Subject: [PATCH 1/2] Add steps for document of getting dataset 'SF Bilingual
 Speech'

Signed-off-by: Robin Dong <robin.k.dong@gmail.com>
---
 docs/source/tts/datasets.rst | 10 ++++++++--
 1 file changed, 8 insertions(+), 2 deletions(-)

diff --git a/docs/source/tts/datasets.rst b/docs/source/tts/datasets.rst
index b5317ce01f64..4c4d6c105566 100644
--- a/docs/source/tts/datasets.rst
+++ b/docs/source/tts/datasets.rst
@@ -176,8 +176,14 @@ SFSpeech Chinese/English Bilingual Speech
 
 .. code-block:: bash
 
+    # [prerequisite] Install and setup 'ngc' cli tool by following document https://docs.ngc.nvidia.com/cli/cmd.html
+
+    $ ngc registry resource download-version "nvidia/sf_bilingual_speech_zh_en:v1"
+
+    $ unzip sf_bilingual_speech_zh_en_vv1/SF_bilingual.zip -d <your_local_dataset_root>
+
     $ python scripts/dataset_processing/tts/sfbilingual/get_data.py \
-        --data-root <your_local_dataset_root> \
+        --data-root <your_local_dataset_root>/SF_bilingual \
         --val-size 0.1 \
         --test-size 0.2 \
         --seed-for-ds-split 100
@@ -186,4 +192,4 @@ SFSpeech Chinese/English Bilingual Speech
         --config-path sfbilingual/ds_conf \
         --config-name ds_for_fastpitch_align.yaml \
         manifest_filepath=<your_path_to_train_manifest> \
-        sup_data_path=<your_path_to_where_to_save_supplementary_data>
\ No newline at end of file
+        sup_data_path=<your_path_to_where_to_save_supplementary_data>

From eb78d6b41f9045ca210029c7230e6f425868fa62 Mon Sep 17 00:00:00 2001
From: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Date: Tue, 19 Sep 2023 00:35:11 -0700
Subject: [PATCH 2/2] Update datasets.rst

added a link from a tutorial demonstrating detailed data prep steps.

Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
---
 docs/source/tts/datasets.rst | 6 +++---
 1 file changed, 3 insertions(+), 3 deletions(-)

diff --git a/docs/source/tts/datasets.rst b/docs/source/tts/datasets.rst
index 4c4d6c105566..dabf50b30dae 100644
--- a/docs/source/tts/datasets.rst
+++ b/docs/source/tts/datasets.rst
@@ -172,7 +172,7 @@ SFSpeech Chinese/English Bilingual Speech
 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 * Dataset URL: https://catalog.ngc.nvidia.com/orgs/nvidia/resources/sf_bilingual_speech_zh_en
 * Dataset Processing Script: https://github.com/NVIDIA/NeMo/tree/stable/scripts/dataset_processing/tts/sfbilingual/get_data.py
-* Command Line Instruction:
+* Command Line Instruction: please refer details in Section 1 (NGC Registry CLI installation), Section 2 (Downloading SFSpeech Dataset), and Section 3 (Creatiung Data Manifests) from https://github.com/NVIDIA/NeMo/blob/main/tutorials/tts/FastPitch_ChineseTTS_Training.ipynb. Below code block briefly describes the steps.
 
 .. code-block:: bash
 
@@ -184,8 +184,8 @@ SFSpeech Chinese/English Bilingual Speech
 
     $ python scripts/dataset_processing/tts/sfbilingual/get_data.py \
         --data-root <your_local_dataset_root>/SF_bilingual \
-        --val-size 0.1 \
-        --test-size 0.2 \
+        --val-size 0.005 \
+        --test-size 0.01 \
         --seed-for-ds-split 100
 
     $ python scripts/dataset_processing/tts/extract_sup_data.py \