This is a research project, not an official NVIDIA product.
Documentation: https://nvidia.github.io/OpenSeq2Seq/
OpenSeq2Seq main goal is to allow researchers to most effectively explore various sequence-to-sequence models. The efficiency is achieved by fully supporting distributed and mixed-precision training. OpenSeq2Seq is built using TensorFlow and provides all the necessary building blocks for training encoder-decoder models for neural machine translation and automatic speech recognition. We plan to extend it with other modalities in the future.
- Sequence to sequence learning. Currently implemented:
- Neural Machine Translation (text2text)
- Automatic Speech Recognition (speech2text)
- Speech Synthesis (text2speech)
- Data-parallel distributed training
- Multi-GPU
- Multi-node
- Mixed precision training for NVIDIA Volta GPUs
- TensorFlow >= 1.9
- Horovod >= 0.12.0 (using Horovod is not required, but is highly recommended for multi-GPU setup)
Speech-to-text workflow uses some parts of Mozilla DeepSpeech project.
Text-to-text workflow uses some functions from Tensor2Tensor and Neural Machine Translation (seq2seq) Tutorial.
If you use OpenSeq2Seq, please cite this paper
@article{openseq2seq,
title={
OpenSeq2Seq: extensible toolkit for distributed and mixed precision training of sequence-to-sequence models},
author={Kuchaiev, Oleksii and Ginsburg, Boris and Gitman, Igor and Lavrukhin, Vitaly and Case, Carl and Micikevicius, Paulius},
journal={arXiv preprint arXiv:1805.10387},
year={2018}
}