Verifying what is sufficient for causal training #7972

cinjon · 2023-12-05T17:34:06Z

cinjon
Dec 5, 2023

I am using this config - https://github.com/NVIDIA/NeMo/blob/ae5d7e81b8e446e5650082b1700eb92dd2e7c1bd/examples/asr/conf/fastconformer/hybrid_cache_aware_streaming/fastconformer_hybrid_transducer_ctc_bpe_streaming.yaml.

And I'd like to train / finetune a model for live streaming that doesn't get any context from future frames. In other words, it must be doing cache-aware streaming and only able to make predictions on what it's heard so far and what it's predicted so far. It seems like doing conv_context_size: "causal" is not sufficient for this, but I actually also need to set the right_context = 0. Is that right?

Answered by titu1994

Jan 5, 2024

NeMo/examples/asr/conf/fastconformer/hybrid_cache_aware_streaming/fastconformer_hybrid_transducer_ctc_bpe_streaming.yaml

Line 117 in ae5d7e8

att_context_size: [70, 13] # -1 means unlimited context

- set this to [left, 0]

NeMo/examples/asr/conf/fastconformer/hybrid_cache_aware_streaming/fastconformer_hybrid_transducer_ctc_bpe_streaming.yaml

Line 131 in ae5d7e8

conv_context_size: causal

to causal

View full answer

titu1994 · 2024-01-05T02:21:04Z

titu1994
Jan 5, 2024
Maintainer

NeMo/examples/asr/conf/fastconformer/hybrid_cache_aware_streaming/fastconformer_hybrid_transducer_ctc_bpe_streaming.yaml

Line 117 in ae5d7e8

att_context_size: [70, 13] # -1 means unlimited context

- set this to [left, 0]

NeMo/examples/asr/conf/fastconformer/hybrid_cache_aware_streaming/fastconformer_hybrid_transducer_ctc_bpe_streaming.yaml

Line 131 in ae5d7e8

conv_context_size: causal

to causal

0 replies

VahidooX · 2024-01-05T20:32:22Z

VahidooX
Jan 5, 2024
Collaborator

Also set causal_downsampling=true to make downsampling causal. Using layernorm instead of bacthnorm or disabling normalization for the preprocessing is also needed to make it easier for streaming. But if you using that streaming config, all these stuff are already set, and you just need to set att_context_size to what @titu1994 suggested.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Verifying what is sufficient for causal training #7972

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 2 comments

{{title}}

{{title}}

Select a reply

Verifying what is sufficient for causal training #7972

cinjon Dec 5, 2023

Replies: 2 comments

titu1994 Jan 5, 2024 Maintainer

VahidooX Jan 5, 2024 Collaborator

cinjon
Dec 5, 2023

titu1994
Jan 5, 2024
Maintainer

VahidooX
Jan 5, 2024
Collaborator