Name		Name	Last commit message	Last commit date
parent directory ..
conformer_ctc		conformer_ctc
conformer_ctc2		conformer_ctc2
conformer_ctc3		conformer_ctc3
conformer_mmi		conformer_mmi
conv_emformer_transducer_stateless		conv_emformer_transducer_stateless
conv_emformer_transducer_stateless2		conv_emformer_transducer_stateless2
local		local
lstm_transducer_stateless		lstm_transducer_stateless
lstm_transducer_stateless2		lstm_transducer_stateless2
lstm_transducer_stateless3		lstm_transducer_stateless3
pruned2_knowledge		pruned2_knowledge
pruned_stateless_emformer_rnnt2		pruned_stateless_emformer_rnnt2
pruned_transducer_stateless		pruned_transducer_stateless
pruned_transducer_stateless2		pruned_transducer_stateless2
pruned_transducer_stateless3		pruned_transducer_stateless3
pruned_transducer_stateless4		pruned_transducer_stateless4
pruned_transducer_stateless5		pruned_transducer_stateless5
pruned_transducer_stateless6		pruned_transducer_stateless6
pruned_transducer_stateless7		pruned_transducer_stateless7
pruned_transducer_stateless7_ctc		pruned_transducer_stateless7_ctc
pruned_transducer_stateless8		pruned_transducer_stateless8
streaming_conformer_ctc		streaming_conformer_ctc
tdnn_lstm_ctc		tdnn_lstm_ctc
transducer		transducer
transducer_lstm		transducer_lstm
transducer_stateless		transducer_stateless
transducer_stateless2		transducer_stateless2
transducer_stateless_multi_datasets		transducer_stateless_multi_datasets
zipformer_mmi		zipformer_mmi
.gitignore		.gitignore
README.md		README.md
RESULTS-100hours.md		RESULTS-100hours.md
RESULTS.md		RESULTS.md
add_alignments.sh		add_alignments.sh
distillation_with_hubert.sh		distillation_with_hubert.sh
generate-lm.sh		generate-lm.sh
prepare.sh		prepare.sh
prepare_giga_speech.sh		prepare_giga_speech.sh
shared		shared

README.md

Introduction

Please refer to https://icefall.readthedocs.io/en/latest/recipes/librispeech/index.html for how to run models in this recipe.

./RESULTS.md contains the latest results.

Transducers

There are various folders containing the name transducer in this folder. The following table lists the differences among them.

	Encoder	Decoder	Comment
`transducer`	Conformer	LSTM
`transducer_stateless`	Conformer	Embedding + Conv1d	Using optimized_transducer from computing RNN-T loss
`transducer_stateless2`	Conformer	Embedding + Conv1d	Using torchaudio for computing RNN-T loss
`transducer_lstm`	LSTM	LSTM
`transducer_stateless_multi_datasets`	Conformer	Embedding + Conv1d	Using data from GigaSpeech as extra training data
`pruned_transducer_stateless`	Conformer	Embedding + Conv1d	Using k2 pruned RNN-T loss
`pruned_transducer_stateless2`	Conformer(modified)	Embedding + Conv1d	Using k2 pruned RNN-T loss
`pruned_transducer_stateless3`	Conformer(modified)	Embedding + Conv1d	Using k2 pruned RNN-T loss + using GigaSpeech as extra training data
`pruned_transducer_stateless4`	Conformer(modified)	Embedding + Conv1d	same as pruned_transducer_stateless2 + save averaged models periodically during training
`pruned_transducer_stateless5`	Conformer(modified)	Embedding + Conv1d	same as pruned_transducer_stateless4 + more layers + random combiner
`pruned_transducer_stateless6`	Conformer(modified)	Embedding + Conv1d	same as pruned_transducer_stateless4 + distillation with hubert
`pruned_transducer_stateless7`	Zipformer	Embedding + Conv1d	First experiment with Zipformer from Dan
`pruned_transducer_stateless7_ctc`	Zipformer	Embedding + Conv1d	Same as pruned_transducer_stateless7, but with extra CTC head
`pruned_transducer_stateless8`	Zipformer	Embedding + Conv1d	Same as pruned_transducer_stateless7, but using extra data from GigaSpeech
`pruned_stateless_emformer_rnnt2`	Emformer(from torchaudio)	Embedding + Conv1d	Using Emformer from torchaudio for streaming ASR
`conv_emformer_transducer_stateless`	ConvEmformer	Embedding + Conv1d	Using ConvEmformer for streaming ASR + mechanisms in reworked model
`conv_emformer_transducer_stateless2`	ConvEmformer	Embedding + Conv1d	Using ConvEmformer with simplified memory for streaming ASR + mechanisms in reworked model
`lstm_transducer_stateless`	LSTM	Embedding + Conv1d	Using LSTM with mechanisms in reworked model
`lstm_transducer_stateless2`	LSTM	Embedding + Conv1d	Using LSTM with mechanisms in reworked model + gigaspeech (multi-dataset setup)

The decoder in transducer_stateless is modified from the paper Rnn-Transducer with Stateless Prediction Network. We place an additional Conv1d layer right after the input embedding layer.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ASR

ASR

README.md

Introduction

Transducers

Files

ASR

Directory actions

More options

Directory actions

More options

Latest commit

History

ASR

Folders and files

parent directory

README.md

Introduction

Transducers