You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We are planning to build Deep Speech 2 (DS2) [1], a powerful Automatic Speech Recognition (ASR) engine, on PaddlePaddle. For the first-stage plan, we have the following short-term goals:
Release a basic distributed implementation of DS2 on PaddlePaddle.
Contribute a chapter of Deep Speech to PaddlePaddle Book.
Intensive system optimization and low-latency inference library (details in [1]) are not yet covered in this first-stage plan.
Tasks
We roughly break down the project into 14 tasks:
Develop an audio data provider:
Json filelist generator
Audio file format transformer.
Spectrogram feature extraction, power normalization etc.
Batch data reader with SortaGrad.
Data augmentation (optional).
Prepare (one or more) public English data sets & baseline.
您好,此issue在近一个月内暂无更新,我们将于今天内关闭。若在关闭后您仍需跟进提问,可重新开启此问题,我们将在24小时内回复您。因关闭带来的不便我们深表歉意,请您谅解~感谢您对PaddlePaddle的支持!
Hello, this issue has not been updated in the past month. We will close it today for the sake of other user‘s experience. If you still need to follow up on this question after closing, please feel free to reopen it. In that case, we will get back to you within 24 hours. We apologize for the inconvenience caused by the closure and thank you so much for your support of PaddlePaddle Group!
We are planning to build Deep Speech 2 (DS2) [1], a powerful Automatic Speech Recognition (ASR) engine, on PaddlePaddle. For the first-stage plan, we have the following short-term goals:
Intensive system optimization and low-latency inference library (details in [1]) are not yet covered in this first-stage plan.
Tasks
We roughly break down the project into 14 tasks:
DenseScanner
indataprovider_converter.py
, etc.ctc_error_evaluator
(CER) to support WER.Task Dependency
Tasks parallelizable within phases:
Issue for each task will be created later. Contributions, discussions and comments are all highly appreciated and welcomed!
Possible Future Work
References
The text was updated successfully, but these errors were encountered: