Add the ReazonSpeech recipe #1330

Triplecq · 2024-05-02T16:54:13Z

ReazonSpeech is an open-source dataset that contains a diverse set of natural Japanese speech, collected from terrestrial television streams. It contains more than 35,000 hours of audio.

The dataset is available on Hugging Face. For more details, please visit:

I created this recipe by copying "aishell4" recipe, and stripping the most of the contents. Signed-off-by: Fujimoto Seiji <fujimoto@ceptord.net>

…/lhotse Experiemntal version for ReazonSpeech

pzelasko

Is it possible to add a download method if it’s publicly available?

pzelasko · 2024-05-02T18:31:37Z

lhotse/recipes/reazonspeech.py

+ duration=item["duration"],
+ channel=0,
+ language="Japanese",
+ speaker=str(idx),


If speaker information is not available, just omit this field.

pzelasko · 2024-05-02T18:35:44Z

lhotse/recipes/reazonspeech.py

+from lhotse.utils import Pathlike
+
+
+def prepare_reazonspeech(


Since reazonspeech is a large dataset this method may require a lot of cpu memory before it writes anything. Would you consider modifying the recipe to resemble gigaspeech more closely? It writes examples as it processes them for reduced memory usage https://github.com/lhotse-speech/lhotse/blob/master/lhotse/recipes/gigaspeech.py

pzelasko · 2024-05-02T18:36:55Z

Also, could you add an entry in the dataset table in docs/corpus.rst?

Triplecq · 2024-05-02T20:14:09Z

Thanks for your quick feedback and suggestions! I will get back to your comments soon. :)

pzelasko

thank you, LGTM once tests pass

Triplecq · 2024-05-20T15:55:46Z

Thanks for your quick feedback! I'll fix those checks very soon. :)

pzelasko · 2024-05-29T11:13:32Z

It seems the tests are failing on importing num2words, can you make it into a local import guarded by is_module_available (pls search lhotse sources for is_module_available to see an example of import guard for optional dependencies).

Triplecq · 2024-05-29T18:19:48Z

@pzelasko Thanks for the note! I've already changed it to local import. Please feel free to let me know if there's anything else I need to improve. :)

fujimotos and others added 4 commits December 18, 2023 15:47

Add stub ReazonSpeech recipe

57d939c

I created this recipe by copying "aishell4" recipe, and stripping the most of the contents. Signed-off-by: Fujimoto Seiji <fujimoto@ceptord.net>

Merge tag 'rs-experiment' of kdm00:/mnt/syno128/volume1/fujimotos/git…

085482b

…/lhotse Experiemntal version for ReazonSpeech

Merge remote-tracking branch 'upstream/master' into reazonspeech-recipe

96d03ac

Format the script with black to meet style guidelines

2dfa49f

Triplecq mentioned this pull request May 2, 2024

Zipformer recipe for ReazonSpeech k2-fsa/icefall#1611

Merged

pzelasko reviewed May 2, 2024

View reviewed changes

Triplecq added 4 commits May 8, 2024 20:20

Add ReazonSpeech to the dataset table

49a4591

Add a download method and refactor the prepare function

734fdff

Merge branch 'master' into reazonspeech-recipe

a4d23c2

Fix the TypeError when download the subset

63baf08

pzelasko previously approved these changes May 20, 2024

View reviewed changes

Format to follow the code style

da7726b

Triplecq dismissed pzelasko’s stale review via da7726b May 20, 2024 16:10

pzelasko previously approved these changes May 21, 2024

View reviewed changes

pzelasko enabled auto-merge (squash) May 21, 2024 00:20

Triplecq added 2 commits May 29, 2024 14:01

Merge branch 'master' into reazonspeech-recipe

b755119

Change to local import

9ec9c80

auto-merge was automatically disabled May 29, 2024 18:04
Head branch was pushed to by a user without write access

Triplecq dismissed pzelasko’s stale review via 9ec9c80 May 29, 2024 18:04

pzelasko previously approved these changes May 29, 2024

View reviewed changes

pzelasko added this to the v1.24.0 milestone May 29, 2024

Format to follow the code style

5bab888

Triplecq dismissed pzelasko’s stale review via 5bab888 May 29, 2024 20:21

pzelasko approved these changes May 29, 2024

View reviewed changes

pzelasko enabled auto-merge (squash) May 29, 2024 20:33

pzelasko merged commit c778520 into lhotse-speech:master May 29, 2024
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add the ReazonSpeech recipe #1330

Add the ReazonSpeech recipe #1330

Triplecq commented May 2, 2024

pzelasko left a comment

pzelasko May 2, 2024

pzelasko May 2, 2024

pzelasko commented May 2, 2024

Triplecq commented May 2, 2024

pzelasko left a comment •

edited

Loading

Triplecq commented May 20, 2024

pzelasko commented May 29, 2024

Triplecq commented May 29, 2024

Add the ReazonSpeech recipe #1330

Add the ReazonSpeech recipe #1330

Conversation

Triplecq commented May 2, 2024

pzelasko left a comment

Choose a reason for hiding this comment

pzelasko May 2, 2024

Choose a reason for hiding this comment

pzelasko May 2, 2024

Choose a reason for hiding this comment

pzelasko commented May 2, 2024

Triplecq commented May 2, 2024

pzelasko left a comment • edited Loading

Choose a reason for hiding this comment

Triplecq commented May 20, 2024

pzelasko commented May 29, 2024

Triplecq commented May 29, 2024

pzelasko left a comment •

edited

Loading