Add the support of mfcc feature for DS2 #168

kuke · 2017-07-17T11:00:32Z

Add mfcc feature for audio data, and the training of model is in progress:

pkuyym

Almost LGTM

pkuyym · 2017-07-20T02:29:41Z

deep_speech_2/README.md

@@ -38,7 +38,13 @@ python datasets/librispeech/librispeech.py --help
 python compute_mean_std.py
 ```

-`python compute_mean_std.py` computes mean and stdandard deviation for audio features, and save them to a file with a default name `./mean_std.npz`. This file will be used in both training and inferencing.
+`python compute_mean_std.py` computes mean and stdandard deviation for audio features, and save them to a file with a default name `./mean_std.npz`. This file will be used in both training and inferencing. The default feature of audio data is power spectrum, currently the mfcc feature is also supported. To train and infer based on mfcc feature, you can regenerate this file by


currently the mfcc feature is also supported，changing currently to and should be better? There's no need to tell the user that mfcc is added just now.

you can regenerate this file by，why regenerate when first running ds2?

pkuyym · 2017-07-20T02:35:10Z

deep_speech_2/requirements.txt

@@ -2,3 +2,4 @@ wget==3.2
 scipy==0.13.1
 resampy==0.1.5
 https://github.com/kpu/kenlm/archive/master.zip
+python_speech_features


Add a version number.

No version number

xinghai-sun

Great. Looking forward to better experimental results with MFCC.

xinghai-sun · 2017-07-20T03:21:36Z

deep_speech_2/README.md

@@ -38,7 +38,13 @@ python datasets/librispeech/librispeech.py --help
 python compute_mean_std.py
 ```

-`python compute_mean_std.py` computes mean and stdandard deviation for audio features, and save them to a file with a default name `./mean_std.npz`. This file will be used in both training and inferencing.


python compute_mean_std.py computes --> "It will compute"

xinghai-sun · 2017-07-20T03:22:47Z

deep_speech_2/README.md

@@ -38,7 +38,13 @@ python datasets/librispeech/librispeech.py --help
 python compute_mean_std.py
 ```

-`python compute_mean_std.py` computes mean and stdandard deviation for audio features, and save them to a file with a default name `./mean_std.npz`. This file will be used in both training and inferencing.
+`python compute_mean_std.py` computes mean and stdandard deviation for audio features, and save them to a file with a default name `./mean_std.npz`. This file will be used in both training and inferencing. The default feature of audio data is power spectrum, currently the mfcc feature is also supported. To train and infer based on mfcc feature, you can regenerate this file by


you can regenerate --> please regenerate
2.spectrum or spectrogram ?

xinghai-sun · 2017-07-20T03:25:27Z

deep_speech_2/README.md

+python compute_mean_std.py --specgram_type mfcc
+```
+
+and specify the ```specgram_type``` to ```mfcc``` in each step, including training, inference etc.


“in each step, including training, inference etc.” --》 “ when running train.py, infer.py, evaluator.py or tune.py"

specgram_type to mfcc --> --specgram_type mfcc

add mfcc feature for DS2

f1911b1

kuke force-pushed the mfcc_feat_dev branch from 9553264 to f1911b1 Compare July 18, 2017 02:15

kuke requested review from xinghai-sun and pkuyym July 18, 2017 03:56

Yibing Liu added 2 commits July 18, 2017 11:58

Merge branch 'develop' into mfcc_feat_dev

7a2d0ca

update several scripts to support mfcc

fa50fac

pkuyym reviewed Jul 20, 2017

View reviewed changes

xinghai-sun approved these changes Jul 20, 2017

View reviewed changes

follow comments to modify README.md

653d59f

kuke merged commit f8ef7bd into PaddlePaddle:develop Jul 20, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add the support of mfcc feature for DS2 #168

Add the support of mfcc feature for DS2 #168

kuke commented Jul 17, 2017 •

edited

Loading

pkuyym left a comment

pkuyym Jul 20, 2017

pkuyym Jul 20, 2017

kuke Jul 20, 2017

pkuyym Jul 20, 2017

kuke Jul 20, 2017

xinghai-sun left a comment

xinghai-sun Jul 20, 2017

kuke Jul 20, 2017

xinghai-sun Jul 20, 2017

kuke Jul 20, 2017

xinghai-sun Jul 20, 2017

kuke Jul 20, 2017

Add the support of mfcc feature for DS2 #168

Add the support of mfcc feature for DS2 #168

Conversation

kuke commented Jul 17, 2017 • edited Loading

pkuyym left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

xinghai-sun left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kuke commented Jul 17, 2017 •

edited

Loading