Skip to content

Latest commit

 

History

History
 
 

tflite_int8

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 
 
 
 
 

RNNoise INT8

Description

RNNoise is a noise reduction network, that helps to remove noise from audio signals while maintaining any speech. This is a TFLite quantized version that takes traditional signal processing features and outputs gain values that can be used to remove noise from audio. It also detects if voice activity is present. This is a 1 step model trained on Noisy speech database for training speech enhancement algorithms and TTS models that requires hidden states to be fed in at each time step. Dataset license link: https://datashare.ed.ac.uk/handle/10283/2791 This model is converted from FP32 to INT8 using post-training quantization.

License

Apache-2.0

Network Information

Network Information Value
Framework TensorFlow Lite
SHA-1 Hash 2d973fe7116e0bc3674f0f3f0f7185ffe105bba5
Size (Bytes) 113472
Provenance https://arxiv.org/pdf/1709.08243.pdf
Paper https://arxiv.org/pdf/1709.08243.pdf

Performance

Platform Optimized
Cortex-A ✔️
Cortex-M ✔️
Mali GPU ✔️
Ethos U ✔️

Key

  • ✔️ - Will run on this platform.
  • ✖️ - Will not run on this platform.

Accuracy

Dataset: Noisy Speech Database For Training Speech Enhancement Algorithms And Tts Models

Metric Value
Average Pesq 2.945

Optimizations

Optimization Value
Quantization INT8

Network Inputs

Input Node Name Shape Description
main_input_int8 (1, 1, 42) Pre-processed signal features extracted from 480 values of a 48KHz wav file
vad_gru_prev_state_int8 (1, 24) Previous GRU state for the voice activity detection GRU
noise_gru_prev_state_int8 (1, 48) Previous GRU state for the noise GRU
denoise_gru_prev_state_int8 (1, 96) Previous GRU state for the denoise GRU

Network Outputs

Output Node Name Shape Description
Identity_int8 (1, 1, 96) Next GRU state for the denoise GRU
Identity_1_int8 (1, 1, 22) Gain values that can be used to remove noise from this audio sample
Identity_2_int8 (1, 1, 48) Next GRU state for the noise GRU
Identity_3_int8 (1, 1, 24) Next GRU state for the voice activity detection GRU
Identity_4_int8 (1, 1, 1) Probability that this audio sample contains voice activity