Train sentences #16

Frank995 · 2022-02-04T09:21:15Z

Hi. I was able to train an italian model almost perfectly with the exception of few words that are intrinsecally ambiguous without context. Since your model is similar to the bert transformer what do you think would be the best solution to let the model learn word with context? Passing the sentences would be enough? Or a MLM should be implemented?

cschaefer26 · 2022-02-05T12:57:43Z

Hi, glad it's been working for you. If there is a lot of ambiguity I would say it could work if you feed the whole sentences, albeit quite memory hungry. You could also try to use the autoregressive model and feed each word plus context as input and just the word-phonemes as target (make sure you use some word separator for the context).

Jueun0505 · 2023-09-22T14:02:54Z

Hi, Thanks a lot for your work!

Relating to this issues, I would like to get some advice on training my model. Specifically, I want to see whether a transformer/autoregressive transformer model could learn liaison in French. To this end, I generated a training data where one or two words are as grapheme and their corresponding phonemic transcription as phoneme listed in each line (e.g., line1: Nous / nu, line2: Nous étions / nuz etjɔ̃, here you can see that liaison /z/ in the word 'nous' occurs in a certain context, in this case /e/ after the first word).

I have trained with the two models and for some reason the model failed to transcribe liaison.
e.g., 'je vous en prie.' is transcribed by the trained model as 'ʒə vu ɑ pʁi' whereas the ground truth is 'ʒə- vuz ɑ̃ pʁˈi' with /z/ in it. And the the sentence is, in fact, taken from the training dataset, which means that the model has already seen the sentence.

I have updated text/phoneme symbols in the config file and decreased the batch size to 16. Other than than, all other things remained same as it was. I also checked the number of liaison occurrence to address potential imbalance between cases with liaison and cases without liaison.

Do you think I miss something such that the trained model does not manage to detect the context to produce liaison at all?

Thanks for your advice in advance.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Train sentences #16

Train sentences #16

Frank995 commented Feb 4, 2022

cschaefer26 commented Feb 5, 2022

Jueun0505 commented Sep 22, 2023

Train sentences #16

Train sentences #16

Comments

Frank995 commented Feb 4, 2022

cschaefer26 commented Feb 5, 2022

Jueun0505 commented Sep 22, 2023