recognizing harmony qualities from music audio
Author: Sivan Ding, Vio Chung, Rave Rajan
It maps any chord to a polar coordinates
We modified a chord recognition model to be a tension embedding extractor, then feed it into a MLP to do regression on chord
orientation
- Initialize a chord recognition model from
crema
- Get chord and tension metrics
Please follow ./Notebook/demo.ipynb
- Initialize a fixed and non-fixed tension model
- Train and validate both tension models
- Evaluate both tension models
- Run the training process diagnostics
Please follow ./Notebook/analysis.ipynb
- Create model architecture and load model weights for both fixed and non-fixed tension model
- Compare models through spectrograms, forward and backward GRU, and cqt
- Show model results of fixed vs. unfixed models
- Observe the performance of the neuro-harmonilizer in individual chords using
metric_filter
function - Observe the performance of the neuro-harmonilizer in triads vs. tetrads
- Observe the performance of indidvidual tension class
Yes, it does! We compared a naive mapping baseline and our modified neural network based method and it shows some advantages. The experiments are done using JazzNet, a dataset that contains chords/arpeggio/scales independent piano audio.
Baseline: Chord recognition -> map chord directly to harmony colors
Ours: Chord embedding extractor -> classifier -> harmony colors
Higher level musical quality extraction. It is intended to use for controllable music audio data analysis and generation.