You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have a question regarding the without joint modelling. I am not quite understand this part, "Finally, to study the impact of joint-modelling framework, we pre-train a model with independent word-MLM and phoneme-MLM tasks A3, where each task sees the context from respective sequence only." Could you please elaborate it? "Independent" MLM means two models? or just one model, word sequences and phoneme sequences are inputed to the model parallel, instead of concating them togething?
The text was updated successfully, but these errors were encountered:
Thanks for your paper.
I have a question regarding the without joint modelling. I am not quite understand this part, "Finally, to study the impact of joint-modelling framework, we pre-train a model with independent word-MLM and phoneme-MLM tasks A3, where each task sees the context from respective sequence only." Could you please elaborate it? "Independent" MLM means two models? or just one model, word sequences and phoneme sequences are inputed to the model parallel, instead of concating them togething?
The text was updated successfully, but these errors were encountered: