Skip to content

Latest commit

 

History

History
170 lines (117 loc) · 4.06 KB

README.md

File metadata and controls

170 lines (117 loc) · 4.06 KB

Audio classification using CNN and LSTM

Data Visualization

MFCC Features

Spectrogram

Raw Audio

Results

CNN
Spectrogram MFCC
Dataset Train Validation Train Validation
urbansound8k 99.914 97.252 100 84.544

CNN-LSTM
Spectrogram MFCC
Dataset Train Validation Train Validation
urbansound8k 99.928 96.451 99.985 82.369

Training Accuracy Plot

Validation Accuracy Plot

Training Error Plot

Validation Error Plot

Installation

Use the package manager pip to install foobar.

pip install requirements.txt 

or

conda create --name <env> --file requirements.txt 

Usage

Dataset

urbansound8k

Pre-process Data

python codes/pre_processing/pre_processing_urbansound.py

Train and Test

python codes/baseline/main.py

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

Please make sure to update tests as appropriate.

License

MIT