instrument-classifier

Uses tree-based machine learning to classify audio samples by musical instrument.

Data Sources

University of Iowa Electronic Music Studios Musical Instrument Sample Database

UK Philharmonia Orchestra Sound Samples

Data Technologies Used

Python

BeautifulSoup for web scraping
pandas for data wrangling
librosa for audio processing
matplotlib for visualization

AWS

S3 for raw audio samples
RDS (MySQL) for signal-processed data

Under the Hood

Onset detection is used to isolate the time of the attack. Onset detection is done by finding peaks of the onset strength as computed by librosa.

The spectral content in four audio frames near the attack is measured across 28 dimensions, giving a 112 dimensional representation of each sample. These 28 dimensions include mel frequency cepstrum coefficients, as well as chromatic data. The reason for looking at chromatic data both at and after the attack is to measure the decay rate of harmonics. The following chromogram illustrates the die off of other harmonics of a Bb being played on flute:

This resulting representation is then classified using XGBoost with 500 estimators. The classifier was trained on 24 distinct instrument types representing common orchestral music. XGBoost was compared with chosen other tree-based methods and outperformed them in classification accuracy.

Results

Around 25% of samples were held out as test data. We obtained an 83% accuracy rate on test data, which is a 4.5 times improvement over the baseline model. It seems to perform best on percussion instruments, which is to be expected since the relevant spectral information is temporally located close to the attack.

Future Work

This work could easily be improved by expanding the training set of audio samples, both of the 24 instruments already included, and of other non-orchestral instruments. It may also be worthwhile to build onset detection into the machine learning pipeline.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
Iowa2012		Iowa2012
OLD		OLD
.gitignore		.gitignore
Feature Engineering.ipynb		Feature Engineering.ipynb
Fourier Analysis.ipynb		Fourier Analysis.ipynb
Instrument Sample Classifier.pdf		Instrument Sample Classifier.pdf
Iowa Web Scraping.ipynb		Iowa Web Scraping.ipynb
Model Training version C.ipynb		Model Training version C.ipynb
README.md		README.md
chromagram.png		chromagram.png
confusion_matrix.png		confusion_matrix.png
onset.png		onset.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

instrument-classifier

Data Sources

Data Technologies Used

Under the Hood

Results

Future Work

About

Releases

Packages

Languages

jmsmdy/instrument-classifier

Folders and files

Latest commit

History

Repository files navigation

instrument-classifier

Data Sources

Data Technologies Used

Under the Hood

Results

Future Work

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages