Setup fine-tuning script for the models #185

nshmyrev · 2020-08-12T03:19:50Z

As in

dpny518 · 2020-08-21T08:01:37Z

can you provide the data/loca/dict for this model
http://alphacephei.com/vosk/models/vosk-model-small-en-us-0.3.zip
i'll help you write the script that downloads the dict and this model and fine tunes the data/train folder and outputs vosk-model-small-en-us-new

federico-zb · 2020-08-21T08:32:54Z

Could you also provide it for "vosk-model-small-es-0.3"? Thank you very much.
I'm trying to fine-tune it and after that, I'll document the process.

nshmyrev · 2020-11-04T18:35:31Z

Better implementation here:

https://github.com/aarora8/kaldi2/blob/opensat_oct2020/egs/OpenSAT2020/s5/local/chain/run_finetune_tl.sh

lalimili6 · 2020-11-18T09:34:26Z

@nshmyrev
I think this is a good and new example. what your opinion?
https://github.com/kaldi-asr/kaldi/blob/master/egs/libri_css/s5_mono/local/chain/tuning/run_tdnn_1d_ft.sh

Ashutosh1995 · 2020-11-21T10:28:31Z

@nshmyrev I am trying to adapt a trained model on Indian English accent to wake word data. I have set up the dataset as per the KALDI format. I am not able to understand how should I change my paths and give the dataset and model paths in it.

Ashutosh1995 · 2020-11-22T11:24:43Z

@nshmyrev could you please provide data/lang, data/local/lang, chain tree-dir for Indian English vosk zip folder ?

nshmyrev · 2020-11-25T08:12:32Z

More straigthforward gist:

https://gist.github.com/daanzu/d29e18abb9e21ccf1cddc8c3e28054ff#file-run_finetune_tdnn_1a_daanzu-sh

Ashutosh1995 · 2020-11-25T09:02:31Z

@nshmyrev can you please provide the necessary files needed for finetuning in daanzu's script for the Indian english accent model

nshmyrev · 2021-03-03T08:37:23Z

Another useful link

https://github.com/zhaoyi2/CVTE_chain_model_finetune

plefebvre91 · 2021-03-26T08:34:21Z

Hi :)

is it planned to complete the documentation on acoustic model finetuning (here: https://alphacephei.com/vosk/adaptation) ? The procedure seems for now very unclear... For example:

is the finetuning possible only for a few models or it is possible with all of them ?
how am I supposed to organized files in the model to run finetuning ? because it seems differents as in Kaldi organization and I'm not sure of what I'm doing...

LuggerMan · 2021-04-28T09:08:30Z

Hi again, Everybody is asking for input files to finetune. When will they be released?

P.S. I don`t quite understand. Since help needed since august, but no upload of files you surely currently have.

Archan2607 · 2021-05-18T08:15:09Z

Hi I am also working on the fine-tuning part on indian english vosk model.
Can anyone please quide me with the information of preparing a proper documentation or the steps to follow?

Also, @Ashutosh1995 i read one of your threads on this issue, have you got any success with that. Can you please discuss?

Thanks

LuggerMan · 2021-12-09T10:58:35Z

So, by #773

you need lats with nnet3/align_lats.sh

align_lats.sh takes feats.scp as input, where could i find that?

nshmyrev · 2021-12-09T11:05:36Z

align_lats.sh takes feats.scp as input, where could i find that?

Feats are created with make_mfcc.sh from the data folder with wav.scp/segments

LuggerMan · 2021-12-09T11:25:31Z

@nshmyrev so i basically need to extract feats from the data on which the model was trained, am i right?

nshmyrev · 2021-12-09T12:03:15Z

@nshmyrev so i basically need to extract feats from the data on which the model was trained, am i right?

From adaptation data, you do not need training data

LuggerMan · 2021-12-09T12:06:56Z

From adaptation data, you do not need training data

Ah, ok, now i get it! Thank you

nabil6391 · 2021-12-14T01:50:51Z

@nshmyrev by any chance is any video tutorial to fine tune kaldi or vosk models. It would be great. Thanks

vikraman22 · 2022-04-26T12:54:41Z

Hi @nshmyrev I'm trying to finetune the us-english model. It requires vosk-model-en-us-0.22-compile/exp/finetune_ali directory to consist of final.mdl, ali.*.gz and tree file. I have got these file for the data with which I'm trying to finetune. But the data previously used to train the model is not available to pubic from alphacep.

I got this data from Kaldi model for which I was trying to train from scratch from kaldi/egs/mini_librispeech/s5/exp/mono directory. Can I actually use the files from this directory or can I use files from other directories such as tri3b, tri2b etc ? Note: I used the same data to train and also using the same to finetune the US-English model.

Whether the data used while training model is also required? Also final.mdl is initially only available in ./exp/chain/tdnn/final.mdl. Can I use the same for ./exp/nnet3/tdnn_sp/ and ./exp/finetune_ali/ directories

Ashutosh1995 · 2022-04-27T06:21:31Z

@Archan2607 apologies for the late reply but I was temporarily involved in the ASR training and couldn't work out the training part completely.

nshmyrev · 2022-04-27T19:45:08Z

@vikraman22 please take a note that we do not have official finetuning tutorial, so that has to be trial and error path.

For trial and error you'd better ask one question a time and try to solve simple question yourself, there no need to ask me to do simple things.

Your chances to get help increase if you submit a documentation on finetuning and finetuning setup as a pull request to our codebase just like we have part on training.

Ratevandr · 2022-07-12T19:00:00Z

Hi! I am trying to finetune the model vosk-model-ru-0.22. I use "run_finetune_tdnn_1a_daanzu.sh" script for this, and I am missing files ali.*.gz. How can I generate them?
I tried using "steps/nnet3/align.sh" script, but got error
ERROR (apply-cmvn[5.5.1009~1-e4940]:Value():util/kaldi-table-inl.h:164) Failed to load object from /home/shmyrev/kaldi/egs/ac/vosk-model-ru-0.22-compile/mfcc/raw_mfcc_test_sova_devices.1.ark:41 (to suppress this error, add the permissive (p, ) option to the rspecifier.

nshmyrev · 2022-07-12T20:41:35Z

How can I generate them?

With steps/nnet3/align.sh

but got error

There must be earlier error since feature files are missing

qp450 · 2024-01-31T13:31:49Z

Hello! I am also trying to run daanzu's finetuning script to finetune the German model vosk-model-de-0.21 and am looking for the ali.*.gz files. I had a look at steps/nnet3/align.sh, as suggested in the previous response, but if I understand correctly, that script requires the data-dir - as in data/train - to run, which is not present in the downloaded model. Could you provide the ali.*.gz files or indicate which directory to use as the data-dir?
Thank you very much in advance!

nshmyrev · 2024-01-31T15:50:02Z

indicate which directory to use as the data-dir

The one with your audio samples you going to use for fine-tuning

qp450 · 2024-01-31T16:12:32Z

Thank you for your reply! I read in this comment of the finetuning discussion, that if the alignment files are generated from the very small amount of finetuning data, as opposed to the large amount of training data, they might be of far inferior quality. This dependency seems to have been confirmed in daanzu's reply, who then provided the alignment files for the english model. This is why I thought that the initial alignment files are necessary.

nshmyrev · 2024-01-31T16:39:10Z

they might be of far inferior quality.

No, it is wrong. The alignment is just timestamps of phonemes. It doesn't depend on amount of data.

You do not need original model alignment to finetune.

qp450 · 2024-02-01T08:09:27Z

Ok great, thanks very much for your help!

nshmyrev added the help wanted Extra attention is needed label Aug 12, 2020

nshmyrev mentioned this issue Mar 3, 2021

About finetune of mandarin model #437

Closed

This was referenced Apr 27, 2021

How to do transfert Learning #505

Closed

[Finetuning] Input files for russian model #506

Closed

nshmyrev mentioned this issue May 17, 2021

Fine-tuning on Indian English Vosk Model #540

Closed

nshmyrev mentioned this issue Jun 8, 2021

Fine-Tuning Farsi Model on new Specific domain dataset #577

Closed

LuggerMan mentioned this issue Aug 23, 2021

model trainable dynamic graph files for finetuning #628

Closed

LuggerMan mentioned this issue Dec 9, 2021

Finetuning a VOSK model #781

Closed

nshmyrev mentioned this issue Apr 22, 2022

Finetuning causes Errors #934

Closed

nshmyrev mentioned this issue Apr 12, 2023

【中文模型微调】How to finetune chinses mdel? Has anyone tried it? #1333

Closed

Gautam-Rajeev mentioned this issue Jan 20, 2024

Being able to recognize a subset of spoken Hindi words through offline models Samagra-Development/ai-tools#285

Open

nshmyrev mentioned this issue Apr 15, 2024

how to fine tuning Uzbek language dataset for Vosk model ? #1558

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Setup fine-tuning script for the models #185

Setup fine-tuning script for the models #185

nshmyrev commented Aug 12, 2020

dpny518 commented Aug 21, 2020

federico-zb commented Aug 21, 2020

nshmyrev commented Nov 4, 2020

lalimili6 commented Nov 18, 2020

Ashutosh1995 commented Nov 21, 2020

Ashutosh1995 commented Nov 22, 2020 •

edited

Loading

nshmyrev commented Nov 25, 2020

Ashutosh1995 commented Nov 25, 2020

nshmyrev commented Mar 3, 2021

plefebvre91 commented Mar 26, 2021 •

edited

Loading

LuggerMan commented Apr 28, 2021 •

edited

Loading

Archan2607 commented May 18, 2021

LuggerMan commented Dec 9, 2021 •

edited

Loading

nshmyrev commented Dec 9, 2021

LuggerMan commented Dec 9, 2021

nshmyrev commented Dec 9, 2021

LuggerMan commented Dec 9, 2021

nabil6391 commented Dec 14, 2021

vikraman22 commented Apr 26, 2022 •

edited

Loading

Ashutosh1995 commented Apr 27, 2022

nshmyrev commented Apr 27, 2022

Ratevandr commented Jul 12, 2022

nshmyrev commented Jul 12, 2022

qp450 commented Jan 31, 2024 •

edited

Loading

nshmyrev commented Jan 31, 2024

qp450 commented Jan 31, 2024

nshmyrev commented Jan 31, 2024

qp450 commented Feb 1, 2024

Setup fine-tuning script for the models #185

Setup fine-tuning script for the models #185

Comments

nshmyrev commented Aug 12, 2020

dpny518 commented Aug 21, 2020

federico-zb commented Aug 21, 2020

nshmyrev commented Nov 4, 2020

lalimili6 commented Nov 18, 2020

Ashutosh1995 commented Nov 21, 2020

Ashutosh1995 commented Nov 22, 2020 • edited Loading

nshmyrev commented Nov 25, 2020

Ashutosh1995 commented Nov 25, 2020

nshmyrev commented Mar 3, 2021

plefebvre91 commented Mar 26, 2021 • edited Loading

LuggerMan commented Apr 28, 2021 • edited Loading

Archan2607 commented May 18, 2021

LuggerMan commented Dec 9, 2021 • edited Loading

nshmyrev commented Dec 9, 2021

LuggerMan commented Dec 9, 2021

nshmyrev commented Dec 9, 2021

LuggerMan commented Dec 9, 2021

nabil6391 commented Dec 14, 2021

vikraman22 commented Apr 26, 2022 • edited Loading

Ashutosh1995 commented Apr 27, 2022

nshmyrev commented Apr 27, 2022

Ratevandr commented Jul 12, 2022

nshmyrev commented Jul 12, 2022

qp450 commented Jan 31, 2024 • edited Loading

nshmyrev commented Jan 31, 2024

qp450 commented Jan 31, 2024

nshmyrev commented Jan 31, 2024

qp450 commented Feb 1, 2024

Ashutosh1995 commented Nov 22, 2020 •

edited

Loading

plefebvre91 commented Mar 26, 2021 •

edited

Loading

LuggerMan commented Apr 28, 2021 •

edited

Loading

LuggerMan commented Dec 9, 2021 •

edited

Loading

vikraman22 commented Apr 26, 2022 •

edited

Loading

qp450 commented Jan 31, 2024 •

edited

Loading