Music Genre Classifier

A Simple Music Genre Classifier

Welcome

I have recently found the Deep Audio Classification (https://github.com/despoisj/DeepAudioClassification) repository and i was curious if i can get to work something similar without necessarly use Deep Neural Network to recognize music genres. Please keep in mind that Sources are changing faster than this readme

Disclaimer

This isn't anything professional, it's just a spare time project I'm developing to learn something about Machine Learning and Data Analysis. The software is provided as is without any warranty, please read the attached license.

Setup

To train the regression you have to place your labeled music into the data/ folder. At the moment i am writing this i don't have a large enough dataset to provide a "good" model, so you have to do it! Your file must be named as category_progressivenumber.wav TO easily convert your .mp3 to .wav install and use sox

sox yourfile.mp3 yourfile.wav channels 1

If you don't have mp3 support just install libsox-fmt-mp3 or libsox-fmt-all

Usage

The software is written in Python 3. I personally recommend to install Continuum Anaconda to have anything ready and working. If you don't want to install the complete Anaconda package first install dependencies:

pip3 install -r requirements.txt

Once you've installed it and placed your file in the data folder just run

python3 main.py train

to train the regression. To predict your other files add them in te predict folder and run

python3 main.py predict

At the end your result would be something similar to this:

Prediction score (on training set): 0.777777777778
--- Prediction test ---
Classes are
[0] Classical
[50] Other
[100] Metal
- MajorLazerGetFree.wav: [100] - with prob. [[ 0.18406303  0.22849766  0.58743931]]
- PanteraHeresy.wav: [100] - with prob. [[ 0.17705346  0.21830755  0.604639  ]]
- BeethovenOdeToJoy.wav: [50] - with prob. [[ 0.34649982  0.45182985  0.20167033]]

Note that the probability is ordered by the label value.

How it works

The following explaination is probably full of mistakes and error, write me to correct them please!

The big picture

This kind of problem could be schematized in the following diagram

Features Extraction

I don't know much about music, i know that these are signal, so my first approach is to compute the spectrum and try to parametrize it. To parametrize the spectrum I've used the Welch algorithm that works exceptionally well to suppress the noise. It computes the spectrum for different slices of the song and then average them. My parameters at the end of the computation are the first n 20 Hz multiples. This is purely heuristic and it's based on some spectrum plot i saw. I'm trying to figure out some better tecnique to characterize a song, based not only on the spectrum.

Model

The model in this kind of application is a Logistic Regression. I've used the fantastic sk-learn implementation (http://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LogisticRegression.html) I will try to do classification with a simple neural network, if it's possible.

To Do

Improve features extraction. Find better parameters to characterize a song
Create a large enough dataset to train the model with a better accouracy
Add direct mp3 support and eyed3 label extraction

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
data		data
predict		predict
.gitignore		.gitignore
FeaturesExtraction.py		FeaturesExtraction.py
LICENSE		LICENSE
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Music Genre Classifier

Welcome

Disclaimer

Setup

Usage

How it works

The big picture

Features Extraction

Model

To Do

About

Releases

Packages

Languages

License

danigamba/MusicGenreClassifier

Folders and files

Latest commit

History

Repository files navigation

Music Genre Classifier

Welcome

Disclaimer

Setup

Usage

How it works

The big picture

Features Extraction

Model

To Do

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages