Improve audio featurizer and add shift augmentor for DS2. #114

xinghai-sun · 2017-06-21T12:59:57Z

resolve #113

Improve audio featurizer (resample, db_normalize, and random shift), as suggested in the speech_dl codes.
Add shift augmentor.
Update default arguments to be the current best seggestion.
Add checkpoints with pass id.

Training experiment is in progress, and its results will be pasted here ASAP.

1. Improve audio featurizer. 2. Add shift augmentor. 3. Update default argument to be the current best seggestion. 4. Add checkpoints with pass id.

xinghai-sun · 2017-06-21T13:03:08Z

deep_speech_2/data_utils/audio.py

@@ -67,6 +67,54 @@ def from_file(cls, file):
        return cls(samples, sample_rate)

    @classmethod
+    def slice_from_file(cls, file, start=None, end=None):


@reviewers:
No different for slice_from_file and make_silence. Only re-order them.

pkuyym

Almost LGTM.

pkuyym · 2017-06-21T13:28:03Z

deep_speech_2/data_utils/audio.py

+        :type shift_ms: float
+        :raises ValueError: If shift_ms is longer than audio duration.
+        """
+        if shift_ms / 1000.0 > self.duration:


Should be abs(shift_ms) ?

pkuyym · 2017-06-21T13:31:40Z

deep_speech_2/data_utils/featurizer/audio_featurizer.py

+                               extracting spectrogram features.
+    :type target_sample_rate: float
+    :param use_dB_normalization: Whether to normalize the audio to a certain
+                                 decibels before extracting the features.


Better to change decibels to dB for consistency

For comments, full name decibels is used for clarity, while in arguments a short name of dB is used instead.
I think it makes sense?

pkuyym · 2017-06-21T13:33:45Z

deep_speech_2/data_utils/featurizer/audio_featurizer.py

+        if audio_segment.sample_rate != self._target_sample_rate:
+            raise ValueError("Audio sample rate is not supported. "
+                             "Turn allow_downsampling or allow up_sampling on.")
+        # decibel normalization


dB better ?

For comments, full name decibels is used for clarity, while in arguments a short name of dB is used instead.
I think it makes sense?

pkuyym · 2017-06-21T13:34:32Z

deep_speech_2/data_utils/featurizer/speech_featurizer.py

+    :param use_dB_normalization: Whether to normalize the audio to a certain
+                                 decibels before extracting the features.
+    :type use_dB_normalization: bool
+    :param target_dB: Target audio decibels for normalization.


For comments, full name decibels is used for clarity, while in arguments a short name of dB is used instead.
I think it makes sense?

pkuyym

Great. LGTM.

xinghai-sun · 2017-06-26T05:15:04Z

The experimental results for comparing old and new featurizer are as follows. We have a better convergence with the new featurizer.

PaddlePaddle#114).

Patchset for adding missing shift_perturb.py in PR #114.

Improve audio featurizer and add shift augmentor.

1920b77

1. Improve audio featurizer. 2. Add shift augmentor. 3. Update default argument to be the current best seggestion. 4. Add checkpoints with pass id.

xinghai-sun requested review from pkuyym, qingqing01 and chrisxu2016 June 21, 2017 12:59

xinghai-sun commented Jun 21, 2017

View reviewed changes

pkuyym requested changes Jun 21, 2017

View reviewed changes

pkuyym approved these changes Jun 26, 2017

View reviewed changes

Fix a missing abs bug for DS2 AudioSegment.

d8348e2

xinghai-sun merged commit 68caa8c into PaddlePaddle:develop Jun 26, 2017

xinghai-sun added a commit to xinghai-sun/models that referenced this pull request Jun 26, 2017

Patch for adding missing shift_perturb.py in last commmit (pull request

67adf7d

PaddlePaddle#114).

kuke added a commit that referenced this pull request Jun 26, 2017

Merge pull request #130 from xinghai-sun/feature_patch

47e2ccb

Patchset for adding missing shift_perturb.py in PR #114.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve audio featurizer and add shift augmentor for DS2. #114

Improve audio featurizer and add shift augmentor for DS2. #114

xinghai-sun commented Jun 21, 2017 •

edited

Loading

xinghai-sun Jun 21, 2017 •

edited

Loading

pkuyym left a comment

pkuyym Jun 21, 2017

xinghai-sun Jun 26, 2017

xinghai-sun Jun 26, 2017

pkuyym Jun 21, 2017

xinghai-sun Jun 26, 2017

pkuyym Jun 21, 2017

xinghai-sun Jun 26, 2017

pkuyym Jun 21, 2017

xinghai-sun Jun 26, 2017

pkuyym left a comment

xinghai-sun commented Jun 26, 2017

Improve audio featurizer and add shift augmentor for DS2. #114

Improve audio featurizer and add shift augmentor for DS2. #114

Conversation

xinghai-sun commented Jun 21, 2017 • edited Loading

xinghai-sun Jun 21, 2017 • edited Loading

Choose a reason for hiding this comment

pkuyym left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pkuyym left a comment

Choose a reason for hiding this comment

xinghai-sun commented Jun 26, 2017

xinghai-sun commented Jun 21, 2017 •

edited

Loading

xinghai-sun Jun 21, 2017 •

edited

Loading