Question : Conformer RNNT / Decoder training #3737

amejri · 2022-02-23T15:20:18Z

amejri
Feb 23, 2022

Hi again guys,

I have a question. For conformer rnnt model especially the decoder (transducer) part, based on the fact that decoder (predictor or prednet element) doesn't need the audio as input but just transcript text, is there an easy way to train it separately on huge text dataset, and after that charge it into conformer rnnt model.
In other words is there a way to train a transducer separately and charge it in conformer model.

With kind regards,

titu1994 · 2022-02-24T07:10:10Z

titu1994
Feb 24, 2022
Maintainer

I wouldn't call if an easy way, but there is research out there for supporting text only adaptation. We have not implemented it yet but we plan to for the future.

0 replies

eesungkim · 2022-02-25T16:16:58Z

eesungkim
Feb 25, 2022

I don't think there is easy way to train the RNNT without the encoder part (conformer) only with text due to the nature of the transducer model. Instead, there is a recent study that proposed text adaptation using LM with prediction network of RNNT model.

As far as I know, Nemo does not support LM for conformer RNNT models yet. Hopefully, I also hope that it could be supported soon.

2 replies

titu1994 Feb 25, 2022
Maintainer

Yes, we are looking into supporting RNNT adaptation via text only using that paper. It will require a little time before we get all the RNNT features out since it's a relatively more complicated model.

We do plan on that paper for our RNNT adaptation effort btw. Just need to allocate time to setup the code and experiments.

For external LM, @VahidooX was looking into it but there are some higher priority tasks right now and hence that will take a bit of time. Btw, we very much welcome external contributions so if you have the time you could contribute to it.

VahidooX Feb 26, 2022
Collaborator

Please check it out here: #3412 (comment)

zzomg · 2023-02-01T21:07:44Z

zzomg
Feb 1, 2023

@titu1994 @VahidooX Hi! Sorry for re-opening this discussion, I've been looking into domain adaptation of Conformer Transducer models. I noticed you've mentioned that you're planning on implementing what's proposed in this article in NeMo. I was wondering if it has been implemented or there are any other ways of training Conformer Transducer easier that one could use in NeMo at the moment?

0 replies

lee-onidas · 2023-11-14T23:06:30Z

lee-onidas
Nov 14, 2023

@titu1994 Also interested in text only adaption of the RNNT Decoder network. Has anyone managed to implement the method described the paper: https://arxiv.org/pdf/2104.11127.pdf

0 replies

ArtyomZemlyak · 2024-01-07T03:36:25Z

ArtyomZemlyak
Jan 7, 2024

@titu1994 Its very interesting approach! Good if this will be in NeMo

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question : Conformer RNNT / Decoder training #3737

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 5 comments 2 replies

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Question : Conformer RNNT / Decoder training #3737

amejri Feb 23, 2022

Replies: 5 comments · 2 replies

titu1994 Feb 24, 2022 Maintainer

eesungkim Feb 25, 2022

titu1994 Feb 25, 2022 Maintainer

VahidooX Feb 26, 2022 Collaborator

zzomg Feb 1, 2023

lee-onidas Nov 14, 2023

ArtyomZemlyak Jan 7, 2024

amejri
Feb 23, 2022

Replies: 5 comments 2 replies

titu1994
Feb 24, 2022
Maintainer

eesungkim
Feb 25, 2022

titu1994 Feb 25, 2022
Maintainer

VahidooX Feb 26, 2022
Collaborator

zzomg
Feb 1, 2023

lee-onidas
Nov 14, 2023

ArtyomZemlyak
Jan 7, 2024