Skip to content

Commit

Permalink
Merge pull request espnet#4125 from xinjli/xinjli/fix_mandarin_iso_id
Browse files Browse the repository at this point in the history
Mandarin ISO id should be CMN instead of ZHO
  • Loading branch information
ftshijt authored Mar 2, 2022
2 parents 9863980 + 1155ec4 commit b274c4e
Showing 1 changed file with 10 additions and 10 deletions.
20 changes: 10 additions & 10 deletions egs2/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -8,19 +8,19 @@ See: https://espnet.github.io/espnet/espnet2_tutorial.html#recipes-using-espnet2

| Directory name | Corpus name | Task | Language | URL | Note |
| ----------------------- | --------------------------------------------------------------------------------------- | ----------------------- | --------------------- | ------------------------------------------------------------------------------------------------------------ | ------------ |
| aidatatang_200zh | Aidatatang_200zh A free Chinese Mandarin speech corpus | ASR | ZHO | http://www.openslr.org/resources/62 | |
| aishell | AISHELL-ASR0009-OS1 Open Source Mandarin Speech Corpus | ASR | ZHO | http://www.aishelltech.com/kysjcp | |
| aishell3 | AISHELL3 Mandarin multi-speaker text-to-speech | TTS | ZHO | https://www.openslr.org/93/ | |
| aidatatang_200zh | Aidatatang_200zh A free Chinese Mandarin speech corpus | ASR | CMN | http://www.openslr.org/resources/62 | |
| aishell | AISHELL-ASR0009-OS1 Open Source Mandarin Speech Corpus | ASR | CMN | http://www.aishelltech.com/kysjcp | |
| aishell3 | AISHELL3 Mandarin multi-speaker text-to-speech | TTS | CMN | https://www.openslr.org/93/ | |
| ami | The AMI Meeting Corpus | ASR | ENG | http://groups.inf.ed.ac.uk/ami/corpus/ | |
| an4 | CMU AN4 database | ASR/TTS | ENG | http://www.speech.cs.cmu.edu/databases/an4/ | |
| babel | IARPA Babel corups | ASR | ~20 languages | https://www.iarpa.gov/index.php/research-programs/babel | |
| bn_openslr53 | Large bengali ASR training dataset | ASR | BEN | https://openslr.org/53/ | |
| catslu | CATSLU-MAPS | SLU | ZHO | https://sites.google.com/view/catslu/home | |
| catslu | CATSLU-MAPS | SLU | CMN | https://sites.google.com/view/catslu/home | |
| chime4 | The 4th CHiME Speech Separation and Recognition Challenge | ASR/Multichannel ASR | ENG | http://spandh.dcs.shef.ac.uk/chime_challenge/chime2016/ | |
| cmu_indic | CMU INDIC | TTS | 7 languages | http://festvox.org/cmu_indic/ | |
| commonvoice | The Mozilla Common Voice | ASR | 13 languages | https://voice.mozilla.org/datasets | |
| csj | Corpus of Spontaneous Japanese | ASR | JPN | https://pj.ninjal.ac.jp/corpus_center/csj/en/ | |
| csmsc | Chinese Standard Mandarin Speech Copus | TTS | ZHO | https://www.data-baker.com/open_source.html | |
| csmsc | Chinese Standard Mandarin Speech Copus | TTS | CMN | https://www.data-baker.com/open_source.html | |
| css10 | CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages | TTS | 10 langauges | https://github.com/Kyubyong/css10 | |
| dirha_wsj | Distant-speech Interaction for Robust Home Applications | Multichannel ASR | ENG | https://dirha.fbk.eu/, https://github.com/SHINE-FBK/DIRHA_English_wsj | |
| dns_ins20 | Deep Noise Suppression Challenge – INTERSPEECH 2020 | SE | 7 languages + singing | https://www.microsoft.com/en-us/research/academic-program/deep-noise-suppression-challenge-interspeech-2020/ | |
Expand All @@ -30,7 +30,7 @@ See: https://espnet.github.io/espnet/espnet2_tutorial.html#recipes-using-espnet2
| fsc_challenge | Fluent Speech Commands Dataset MASE Eval Challenge splits | SLU | ENG | https://github.com/maseEval/mase | |
| gigaspeech | GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio | ASR | ENG | https://github.com/SpeechColab/GigaSpeech | |
| grabo | Grabo dataset | SLU | ENG + NLD | https://www.esat.kuleuven.be/psi/spraak/downloads/ | |
| hkust | HKUST/MTS: A very large scale Mandarin telephone speech corpus | ASR | ZHO | https://catalog.ldc.upenn.edu/LDC2005S15 | |
| hkust | HKUST/MTS: A very large scale Mandarin telephone speech corpus | ASR | CMN | https://catalog.ldc.upenn.edu/LDC2005S15 | |
| hui_acg | HUI-audio-corpus-german | TTS | DEU | https://opendata.iisys.de/datasets.html#hui-audio-corpus-german | |
| how2 | How2: A Large-scale Dataset for Multimodal Language Understanding | ASR/MT/ST | ENG->POR | https://github.com/srvk/how2-dataset | |
| iemocap | IEMOCAP database: The Interactive Emotional Dyadic Motion Capture database | SLU | ENG | https://sail.usc.edu/iemocap/ | |
Expand Down Expand Up @@ -58,13 +58,13 @@ See: https://espnet.github.io/espnet/espnet2_tutorial.html#recipes-using-espnet2
| nsc | National Speech Corpus | ASR | ENG-SG | https://www.imda.gov.sg/programme-listing/digital-services-lab/national-speech-corpus | |
| open_li52 | Corpus combination with 52 languages(Commonvocie + voxforge) | Multilingual ASR | 52 languages | | |
| polyphone_swiss_french | Swiss French Polyphone corpus | ASR | FRA | http://catalog.elra.info/en-us/repository/browse/ELRA-S0030_02 | |
| primewords_chinese | Primewords Chinese Corpus Set 1 | ASR | ZHO | https://www.openslr.org/47/ | |
| primewords_chinese | Primewords Chinese Corpus Set 1 | ASR | CMN | https://www.openslr.org/47/ | |
| puebla_nahuatl | Highland Puebla Nahuatl corpus (endangered language in central Mexico) | ASR | HPN | https://www.openslr.org/92/ | |
| reverb | REVERB (REverberant Voice Enhancement and Recognition Benchmark) challenge | ASR | ENG | https://reverb2014.dereverberation.com/ | |
| ru_open_stt | Russian Open Speech To Text (STT/ASR) Dataset | ASR | RUS | https://github.com/snakers4/open_stt | |
| ruslan | RUSLAN: Russian Spoken Language Corpus For Speech Synthesis | TTS | RUS | https://ruslan-corpus.github.io/ | |
| snips | SNIPS: A dataset for spoken language understanding | SLU | ENG | https://github.com/sonos/spoken-language-understanding-research-datasets | |
| seame | SEAME: a Mandarin-English Code-switching Speech Corpus in South-East Asia | ASR | ENG + ZHO | https://catalog.ldc.upenn.edu/LDC2015S04 | |
| seame | SEAME: a Mandarin-English Code-switching Speech Corpus in South-East Asia | ASR | ENG + CMN | https://catalog.ldc.upenn.edu/LDC2015S04 | |
| siwis | SIWIS: Spoken Interaction with Interpretation in Switzerland | TTS | FRA | https://https://datashare.ed.ac.uk/handle/10283/2353 | |
| slue-voxceleb | SLUE: Spoken Language Understanding Evaluation | SLU | ENG | https://github.com/asappresearch/slue-toolkit | |
| slurp | SLURP: A Spoken Language Understanding Resource Package | SLU | ENG | https://github.com/pswietojanski/slurp | |
Expand All @@ -76,15 +76,15 @@ See: https://espnet.github.io/espnet/espnet2_tutorial.html#recipes-using-espnet2
| swbd | Switchboard Corpus for 2-channel Conversational Telephone Speech (300h) | ASR | ENG | https://catalog.ldc.upenn.edu/LDC97S62 | |
| swbd_da | NXT Switchboard Annotations | SLU | ENG | https://catalog.ldc.upenn.edu/LDC2009T26 | |
| tedlium2 | TED-LIUM corpus release 2 | ASR | ENG | https://www.openslr.org/19/, http://www.lrec-conf.org/proceedings/lrec2014/pdf/1104_Paper.pdf | |
| thchs30 | A Free Chinese Speech Corpus Released by CSLT@Tsinghua University | TTS | ZHO | https://www.openslr.org/18/ | |
| thchs30 | A Free Chinese Speech Corpus Released by CSLT@Tsinghua University | TTS | CMN | https://www.openslr.org/18/ | |
| timit | TIMIT Acoustic-Phonetic Continuous Speech Corpus | ASR | ENG | https://catalog.ldc.upenn.edu/LDC93S1 | |
| totonac | Highland Totonac corpus (endangered language in central Mexico) | ASR | TOS | http://www.openslr.org/107/ | |
| tsukuyomi | つくよみちゃんコーパス | TTS | JPN | https://tyc.rei-yumesaki.net/material/corpus | |
| vctk | English Multi-speaker Corpus for CSTR Voice Cloning Toolkit | TTS | ENG | http://www.udialogue.org/download/cstr-vctk-corpus.html | |
| vctk_noisyreverb | Noisy reverberant speech database (48kHz) | SE | ENG | https://datashare.ed.ac.uk/handle/10283/2826 | |
| vivos | VIVOS (Vietnamese corpus for ASR) | ASR | VIE | https://ailab.hcmus.edu.vn/vivos/ | |
| voxforge | VoxForge | ASR | 7 languages | http://www.voxforge.org/ | |
| wenetspeech | WenetSpeech: A 10000+ Hours Multi-domain Chinese Corpus for Speech Recognition | ASR | ZHO | https://wenet-e2e.github.io/WenetSpeech/ | |
| wenetspeech | WenetSpeech: A 10000+ Hours Multi-domain Chinese Corpus for Speech Recognition | ASR | CMN | https://wenet-e2e.github.io/WenetSpeech/ | |
| wham | The WSJ0 Hipster Ambient Mixtures (WHAM!) dataset | SE | ENG | https://wham.whisper.ai/ | |
| whamr | WHAMR!: Noisy and Reverberant Single-Channel Speech Separation | SE | ENG | https://wham.whisper.ai/ | |
| wsj | CSR-I (WSJ0) Complete, CSR-II (WSJ1) Complete | ASR | ENG | https://catalog.ldc.upenn.edu/LDC93S6A,https://catalog.ldc.upenn.edu/LDC94S13A | |
Expand Down

0 comments on commit b274c4e

Please sign in to comment.