How much GPU memory is needed for finetuning stt_en_conformer_ctc_small? #3252

tareqalmuntasir7 · 2021-11-26T08:40:23Z

tareqalmuntasir7
Nov 26, 2021

Hi, I am trying to finetune stt_en_conformer_ctc_small, both encoder and decoder with a RTX 3090 24GB GPU. But even when I use batch size 1, it's throwing OOM. Is this model too big or is it a bug? I see there are only 51 million parameters in the model.

Answered by titu1994

Nov 26, 2021

What is your maximum audio length ? It should not be more than 16-20 seconds during training. We can fit about batch size 16 with 32 GB of RAM at fp32, and bs 32 at fp16 (but training is unstable with fp16)

View full answer

titu1994 · 2021-11-26T10:13:08Z

titu1994
Nov 26, 2021
Maintainer

What is your maximum audio length ? It should not be more than 16-20 seconds during training. We can fit about batch size 16 with 32 GB of RAM at fp32, and bs 32 at fp16 (but training is unstable with fp16)

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How much GPU memory is needed for finetuning stt_en_conformer_ctc_small? #3252

{{title}}

Replies: 1 comment

{{title}}

Select a reply

How much GPU memory is needed for finetuning stt_en_conformer_ctc_small? #3252

tareqalmuntasir7 Nov 26, 2021

Replies: 1 comment

titu1994 Nov 26, 2021 Maintainer

tareqalmuntasir7
Nov 26, 2021

titu1994
Nov 26, 2021
Maintainer