PaddleSpeech

What is PaddleSpeech?

PaddleSpeech is an open-source toolkit on PaddlePaddle platform for two critical tasks in Speech - Speech-to-Text (Automatic Speech Recognition, ASR) and Text-to-Speech Synthesis (TTS), with modules involving state-of-art and influential models.

What can PaddleSpeech do?

Speech-to-Text

PaddleSpeech ASR mainly consists of components below:

Implementation of models and commonly used neural network layers.
Dataset abstraction and common data preprocessing pipelines.
Ready-to-run experiments.

PaddleSpeech ASR provides you with a complete ASR pipeline, including:

Data Preparation
- Build vocabulary
- Compute Cepstral mean and variance normalization (CMVN)
- Featrue extraction
  - linear
  - fbank (also support kaldi feature)
  - mfcc
Acoustic Models
- Deepspeech2 (Streaming and Non-Streaming)
- Transformer (Streaming and Non-Streaming)
- Conformer (Streaming and Non-Streaming)
Decoder
- ctc greedy search (used in DeepSpeech2, Transformer and Conformer)
- ctc beam search (used in DeepSpeech2, Transformer and Conformer)
- attention decoding (used in Transformer and Conformer)
- attention rescoring (used in Transformer and Conformer)

Speech-to-Text helps you training the ASR model very simply.

Text-to-Speech

TTS mainly consists of components below:

Implementation of models and commonly used neural network layers.
Dataset abstraction and common data preprocessing pipelines.
Ready-to-run experiments.

PaddleSpeech TTS provides you with a complete TTS pipeline, including:

Text FrontEnd
- Rule based Chinese frontend.
Acoustic Models
- FastSpeech2
- SpeedySpeech
- TransformerTTS
- Tacotron2
Vocoders
- Multi Band MelGAN
- Parallel WaveGAN
- WaveFlow
Voice Cloning
- Transfer Learning from Speaker Verification to Multispeaker Text-to-Speech Synthesis
- GE2E

Text-to-Speech helps you to train TTS models with simple commands.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

introduction.md

introduction.md

PaddleSpeech

What is PaddleSpeech?

What can PaddleSpeech do?

Speech-to-Text

Text-to-Speech

Files

introduction.md

Latest commit

History

introduction.md

File metadata and controls

PaddleSpeech

What is PaddleSpeech?

What can PaddleSpeech do?

Speech-to-Text

Text-to-Speech