From 1155ec46ec37ebe334aa61eeb18958f57987fad8 Mon Sep 17 00:00:00 2001 From: lixinjian Date: Wed, 2 Mar 2022 13:56:03 -0500 Subject: [PATCH] Mandarin ISO id should be CMN instead of ZHO --- egs2/README.md | 20 ++++++++++---------- 1 file changed, 10 insertions(+), 10 deletions(-) diff --git a/egs2/README.md b/egs2/README.md index 0f955869759..d03884f2ceb 100755 --- a/egs2/README.md +++ b/egs2/README.md @@ -8,19 +8,19 @@ See: https://espnet.github.io/espnet/espnet2_tutorial.html#recipes-using-espnet2 | Directory name | Corpus name | Task | Language | URL | Note | | ----------------------- | --------------------------------------------------------------------------------------- | ----------------------- | --------------------- | ------------------------------------------------------------------------------------------------------------ | ------------ | -| aidatatang_200zh | Aidatatang_200zh A free Chinese Mandarin speech corpus | ASR | ZHO | http://www.openslr.org/resources/62 | | -| aishell | AISHELL-ASR0009-OS1 Open Source Mandarin Speech Corpus | ASR | ZHO | http://www.aishelltech.com/kysjcp | | -| aishell3 | AISHELL3 Mandarin multi-speaker text-to-speech | TTS | ZHO | https://www.openslr.org/93/ | | +| aidatatang_200zh | Aidatatang_200zh A free Chinese Mandarin speech corpus | ASR | CMN | http://www.openslr.org/resources/62 | | +| aishell | AISHELL-ASR0009-OS1 Open Source Mandarin Speech Corpus | ASR | CMN | http://www.aishelltech.com/kysjcp | | +| aishell3 | AISHELL3 Mandarin multi-speaker text-to-speech | TTS | CMN | https://www.openslr.org/93/ | | | ami | The AMI Meeting Corpus | ASR | ENG | http://groups.inf.ed.ac.uk/ami/corpus/ | | | an4 | CMU AN4 database | ASR/TTS | ENG | http://www.speech.cs.cmu.edu/databases/an4/ | | | babel | IARPA Babel corups | ASR | ~20 languages | https://www.iarpa.gov/index.php/research-programs/babel | | | bn_openslr53 | Large bengali ASR training dataset | ASR | BEN | https://openslr.org/53/ | | -| catslu | CATSLU-MAPS | SLU | ZHO | https://sites.google.com/view/catslu/home | | +| catslu | CATSLU-MAPS | SLU | CMN | https://sites.google.com/view/catslu/home | | | chime4 | The 4th CHiME Speech Separation and Recognition Challenge | ASR/Multichannel ASR | ENG | http://spandh.dcs.shef.ac.uk/chime_challenge/chime2016/ | | | cmu_indic | CMU INDIC | TTS | 7 languages | http://festvox.org/cmu_indic/ | | | commonvoice | The Mozilla Common Voice | ASR | 13 languages | https://voice.mozilla.org/datasets | | | csj | Corpus of Spontaneous Japanese | ASR | JPN | https://pj.ninjal.ac.jp/corpus_center/csj/en/ | | -| csmsc | Chinese Standard Mandarin Speech Copus | TTS | ZHO | https://www.data-baker.com/open_source.html | | +| csmsc | Chinese Standard Mandarin Speech Copus | TTS | CMN | https://www.data-baker.com/open_source.html | | | css10 | CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages | TTS | 10 langauges | https://github.com/Kyubyong/css10 | | | dirha_wsj | Distant-speech Interaction for Robust Home Applications | Multichannel ASR | ENG | https://dirha.fbk.eu/, https://github.com/SHINE-FBK/DIRHA_English_wsj | | | dns_ins20 | Deep Noise Suppression Challenge – INTERSPEECH 2020 | SE | 7 languages + singing | https://www.microsoft.com/en-us/research/academic-program/deep-noise-suppression-challenge-interspeech-2020/ | | @@ -30,7 +30,7 @@ See: https://espnet.github.io/espnet/espnet2_tutorial.html#recipes-using-espnet2 | fsc_challenge | Fluent Speech Commands Dataset MASE Eval Challenge splits | SLU | ENG | https://github.com/maseEval/mase | | | gigaspeech | GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio | ASR | ENG | https://github.com/SpeechColab/GigaSpeech | | | grabo | Grabo dataset | SLU | ENG + NLD | https://www.esat.kuleuven.be/psi/spraak/downloads/ | | -| hkust | HKUST/MTS: A very large scale Mandarin telephone speech corpus | ASR | ZHO | https://catalog.ldc.upenn.edu/LDC2005S15 | | +| hkust | HKUST/MTS: A very large scale Mandarin telephone speech corpus | ASR | CMN | https://catalog.ldc.upenn.edu/LDC2005S15 | | | hui_acg | HUI-audio-corpus-german | TTS | DEU | https://opendata.iisys.de/datasets.html#hui-audio-corpus-german | | | how2 | How2: A Large-scale Dataset for Multimodal Language Understanding | ASR/MT/ST | ENG->POR | https://github.com/srvk/how2-dataset | | | iemocap | IEMOCAP database: The Interactive Emotional Dyadic Motion Capture database | SLU | ENG | https://sail.usc.edu/iemocap/ | | @@ -58,13 +58,13 @@ See: https://espnet.github.io/espnet/espnet2_tutorial.html#recipes-using-espnet2 | nsc | National Speech Corpus | ASR | ENG-SG | https://www.imda.gov.sg/programme-listing/digital-services-lab/national-speech-corpus | | | open_li52 | Corpus combination with 52 languages(Commonvocie + voxforge) | Multilingual ASR | 52 languages | | | | polyphone_swiss_french | Swiss French Polyphone corpus | ASR | FRA | http://catalog.elra.info/en-us/repository/browse/ELRA-S0030_02 | | -| primewords_chinese | Primewords Chinese Corpus Set 1 | ASR | ZHO | https://www.openslr.org/47/ | | +| primewords_chinese | Primewords Chinese Corpus Set 1 | ASR | CMN | https://www.openslr.org/47/ | | | puebla_nahuatl | Highland Puebla Nahuatl corpus (endangered language in central Mexico) | ASR | HPN | https://www.openslr.org/92/ | | | reverb | REVERB (REverberant Voice Enhancement and Recognition Benchmark) challenge | ASR | ENG | https://reverb2014.dereverberation.com/ | | | ru_open_stt | Russian Open Speech To Text (STT/ASR) Dataset | ASR | RUS | https://github.com/snakers4/open_stt | | | ruslan | RUSLAN: Russian Spoken Language Corpus For Speech Synthesis | TTS | RUS | https://ruslan-corpus.github.io/ | | | snips | SNIPS: A dataset for spoken language understanding | SLU | ENG | https://github.com/sonos/spoken-language-understanding-research-datasets | | -| seame | SEAME: a Mandarin-English Code-switching Speech Corpus in South-East Asia | ASR | ENG + ZHO | https://catalog.ldc.upenn.edu/LDC2015S04 | | +| seame | SEAME: a Mandarin-English Code-switching Speech Corpus in South-East Asia | ASR | ENG + CMN | https://catalog.ldc.upenn.edu/LDC2015S04 | | | siwis | SIWIS: Spoken Interaction with Interpretation in Switzerland | TTS | FRA | https://https://datashare.ed.ac.uk/handle/10283/2353 | | | slue-voxceleb | SLUE: Spoken Language Understanding Evaluation | SLU | ENG | https://github.com/asappresearch/slue-toolkit | | | slurp | SLURP: A Spoken Language Understanding Resource Package | SLU | ENG | https://github.com/pswietojanski/slurp | | @@ -76,7 +76,7 @@ See: https://espnet.github.io/espnet/espnet2_tutorial.html#recipes-using-espnet2 | swbd | Switchboard Corpus for 2-channel Conversational Telephone Speech (300h) | ASR | ENG | https://catalog.ldc.upenn.edu/LDC97S62 | | | swbd_da | NXT Switchboard Annotations | SLU | ENG | https://catalog.ldc.upenn.edu/LDC2009T26 | | | tedlium2 | TED-LIUM corpus release 2 | ASR | ENG | https://www.openslr.org/19/, http://www.lrec-conf.org/proceedings/lrec2014/pdf/1104_Paper.pdf | | -| thchs30 | A Free Chinese Speech Corpus Released by CSLT@Tsinghua University | TTS | ZHO | https://www.openslr.org/18/ | | +| thchs30 | A Free Chinese Speech Corpus Released by CSLT@Tsinghua University | TTS | CMN | https://www.openslr.org/18/ | | | timit | TIMIT Acoustic-Phonetic Continuous Speech Corpus | ASR | ENG | https://catalog.ldc.upenn.edu/LDC93S1 | | | totonac | Highland Totonac corpus (endangered language in central Mexico) | ASR | TOS | http://www.openslr.org/107/ | | | tsukuyomi | つくよみちゃんコーパス | TTS | JPN | https://tyc.rei-yumesaki.net/material/corpus | | @@ -84,7 +84,7 @@ See: https://espnet.github.io/espnet/espnet2_tutorial.html#recipes-using-espnet2 | vctk_noisyreverb | Noisy reverberant speech database (48kHz) | SE | ENG | https://datashare.ed.ac.uk/handle/10283/2826 | | | vivos | VIVOS (Vietnamese corpus for ASR) | ASR | VIE | https://ailab.hcmus.edu.vn/vivos/ | | | voxforge | VoxForge | ASR | 7 languages | http://www.voxforge.org/ | | -| wenetspeech | WenetSpeech: A 10000+ Hours Multi-domain Chinese Corpus for Speech Recognition | ASR | ZHO | https://wenet-e2e.github.io/WenetSpeech/ | | +| wenetspeech | WenetSpeech: A 10000+ Hours Multi-domain Chinese Corpus for Speech Recognition | ASR | CMN | https://wenet-e2e.github.io/WenetSpeech/ | | | wham | The WSJ0 Hipster Ambient Mixtures (WHAM!) dataset | SE | ENG | https://wham.whisper.ai/ | | | whamr | WHAMR!: Noisy and Reverberant Single-Channel Speech Separation | SE | ENG | https://wham.whisper.ai/ | | | wsj | CSR-I (WSJ0) Complete, CSR-II (WSJ1) Complete | ASR | ENG | https://catalog.ldc.upenn.edu/LDC93S6A,https://catalog.ldc.upenn.edu/LDC94S13A | |