RNNoise is a noise reduction network, that helps to remove noise from audio signals while maintaining any speech. This is a TFLite quantized version that takes traditional signal processing features and outputs gain values that can be used to remove noise from audio. It also detects if voice activity is present.
This is a 1 step model trained on Noisy speech database for training speech enhancement algorithms and TTS models that requires hidden states to be fed in at each time step.
Dataset license link: https://datashare.ed.ac.uk/handle/10283/2791
This model is converted from FP32 to INT8 using post-training quantization.
Apache-2.0
Platform |
Optimized |
Cortex-A |
✔️ |
Cortex-M |
✔️ |
Mali GPU |
✔️ |
Ethos U |
✔️ |
- ✔️ - Will run on this platform.
- ✖️ - Will not run on this platform.
Dataset: Noisy Speech Database For Training Speech Enhancement Algorithms And Tts Models
Metric |
Value |
Average Pesq |
2.945 |
Optimization |
Value |
Quantization |
INT8 |
Input Node Name |
Shape |
Description |
main_input_int8 |
(1, 1, 42) |
Pre-processed signal features extracted from 480 values of a 48KHz wav file |
vad_gru_prev_state_int8 |
(1, 24) |
Previous GRU state for the voice activity detection GRU |
noise_gru_prev_state_int8 |
(1, 48) |
Previous GRU state for the noise GRU |
denoise_gru_prev_state_int8 |
(1, 96) |
Previous GRU state for the denoise GRU |
Output Node Name |
Shape |
Description |
Identity_int8 |
(1, 1, 96) |
Next GRU state for the denoise GRU |
Identity_1_int8 |
(1, 1, 22) |
Gain values that can be used to remove noise from this audio sample |
Identity_2_int8 |
(1, 1, 48) |
Next GRU state for the noise GRU |
Identity_3_int8 |
(1, 1, 24) |
Next GRU state for the voice activity detection GRU |
Identity_4_int8 |
(1, 1, 1) |
Probability that this audio sample contains voice activity |