Skip to content

Speech2text system for Ukrainian language (Project at prj-nlp-2020)

Notifications You must be signed in to change notification settings

dsakovych/stt_uk

Repository files navigation

Speech2text for Ukrainian language

This STT system trained using Kaldi framework.
System contains of g2p model and kaldi training recipe.

Current project state:

g2p: https://github.com/kotestyle/g2p_uk
g2p model was trained separately with tf. Details in its repo.

asr model: trained on 84 hour of voxforge and librivox data.
training recipe

Language model: SRILM
Audio features: MFCC and CMVN
Acoustic model: HMM-GMM
Training: Delta+delta-delta, LDA-MLLT, SAT
Alignment: fMLLR

Metrics results

Model LM order (SRILM) train/test, hours WAcc, %
mono 2 1 / 0.1 4 %
mono 2 5 / 1 9 %
Tri5 (LDA + MLLT + SAT) 2-3 83 / 1 31.13 %

Data source

Source link
voxforge http://www.repository.voxforge1.org/downloads/uk/
librivox https://www.caito.de/2019/01/the-m-ailabs-speech-dataset/
youtube (in progress) youtube-data.xlsx

How to run

How to use:

About

Speech2text system for Ukrainian language (Project at prj-nlp-2020)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published