Skip to content

Latest commit

 

History

History
59 lines (46 loc) · 2.82 KB

automatic_speech_recognition.md

File metadata and controls

59 lines (46 loc) · 2.82 KB

Automatic Speech Recognition

VLSP 2018 Shared Task: Automatic Speech Recognition

In the ASR task, participants were asked to transcribe automatically Vietnamese audio files into the spoken word sequences. The committee provided the test set only, while the training data for the acoustic and language models was developed by the teams themselves.

The test set was composed of 796 continuous wav files of news speech for a total duration of two hours, without any information on the sentence segmentation. The speech was recorded in a non-noisy environment, and available in three dialects: Northern, Southern and Central with respectively proportion of 50%, 40% and 10%.

Leaderboard

Model Score Paper/Source Code
WER SER
VAIS 6.29 75.50 Do et al. VLSP'18
Viettel-CSC 7.40 75.38 Nguyen et al. VLSP'18

Miscellaneous

📜 Papers

💫 Services

📁 Dataset