Speech2text for Ukrainian language

This STT system trained using Kaldi framework.
System contains of g2p model and kaldi training recipe.

Current project state:

g2p: https://github.com/kotestyle/g2p_uk
g2p model was trained separately with tf. Details in its repo.

asr model: trained on 84 hour of voxforge and librivox data.
training recipe

Language model: SRILM
Audio features: MFCC and CMVN
Acoustic model: HMM-GMM
Training: Delta+delta-delta, LDA-MLLT, SAT
Alignment: fMLLR

Metrics results

Model	LM order (SRILM)	train/test, hours	WAcc, %
mono	2	1 / 0.1	4 %
mono	2	5 / 1	9 %
Tri5 (LDA + MLLT + SAT)	2-3	83 / 1	31.13 %

Data source

Source	link
voxforge	http://www.repository.voxforge1.org/downloads/uk/
librivox	https://www.caito.de/2019/01/the-m-ailabs-speech-dataset/
youtube (in progress)	youtube-data.xlsx

How to run

build Kaldi from source
prepare voxforge data with sample notebook
prepare librivox data with sample notebook
prepare kaldi project with project notebook
make changes to configs and recipe
cross fingers and hope it will run w/o errors :-)

How to use:

Place model to appropriate folder in kaldi project
fill config.py
run decode_kaldi.py with file_path argument

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
data		data
etc		etc
kaldi		kaldi
models		models
notebooks		notebooks
tmp		tmp
utils		utils
var		var
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
app.py		app.py
config.py		config.py
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt
variables.env		variables.env

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speech2text for Ukrainian language

Current project state:

Metrics results

Data source

How to run

How to use:

About

Releases

Packages

Languages

dsakovych/stt_uk

Folders and files

Latest commit

History

Repository files navigation

Speech2text for Ukrainian language

Current project state:

Metrics results

Data source

How to run

How to use:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages