Song Chords Recognizer - ACR Pipeline

Song Chords Recognizer based on Deep Learning and statistical models coded and trained in Python.

Prerequisites

Python 3.8
python libraries - sklearn, tensorflow, librosa, mir_eval
ACR_Training folder

Usage

You can check the Jupiter Notebook Demo.

Shell

python SongChordsRecognizer.py

Standard Input ->

{
    "Waveform": [SONGs WAVEFORM],
    "SampleRate": [WAVEFORMs SAMPLE RATE]
}

Example

{
    "Waveform": [0.3215, 0.1235, 0.6213, -0.941, 0.523],
    "SampleRate": 44100
}

Standard Output ->

{
    "Key": [KEY DESCRIPTION],
    "BPM": [BPM VALUE],
    "ChordSequence": [LIST OF CHORDS],
    "BeatTimes": [LIST OF BEAT TIMES IN SECONDS],
    "BarQuarters": [NUMBER OF QUARTERS IN ONE BAR]
}

Example

{
    "Key": "C",
    "BPM": "120.323",
    "ChordSequence": ['A', 'B', 'A', 'A', 'C'], 
    "BeatTimes": [0.123, 0.32, 0.4, 0.55],
    "BarQuarters": 4
}

.NET

// Prepare Audio Wav Input
AudioSourceWav wav = new AudioSourceWav(audioBytes, audio.FileName);
string json_request = createJsonRequestBody(wav.GetMonoWaveform(), wav.SampleRate)

// Initialize Process
ProcessStartInfo python_SongChordRecognizer = new ProcessStartInfo();
python_SongChordRecognizer.FileName = python_path;

// Prepare command with arguments
python_SongChordRecognizer.Arguments = script_path;

// Python process configuration
python_SongChordRecognizer.UseShellExecute = false;
python_SongChordRecognizer.CreateNoWindow = true;
python_SongChordRecognizer.RedirectStandardInput = true;
python_SongChordRecognizer.RedirectStandardOutput = true;
python_SongChordRecognizer.RedirectStandardError = true;

// Execute process
string json_response = "";
string errors = "";
using (Process process = Process.Start(python_SongChordRecognizer))
{
    StreamWriter streamWriter = process.StandardInput;
    // Send Json request
    streamWriter.WriteLine(json_request);
    streamWriter.Close();
    // Get Json response
    json_response = process.StandardOutput.ReadToEnd();
    errors = process.StandardError.ReadToEnd();
}

Structure

Models

There are two models. The first one is trained on original songs. The second one is trained only on songs transposed to C ionion key and its mode alternatives. The transposed model has better accuracy score.

Preprocess

Audio waveform is preprocessed to the 23s long sequences of cqt spectrograms.

Key Prediction

Based on already predicted chords, the key with the most fitting chords is choosed. Each key has seven corresponding chords that are checked.

Beats Voting

This part uses librosa library to get the BPM value and the Beats list. We use two approaches

Each beat duration is summarized and the most common chord is the one mapped to this beat.
The BPM is used to estimate how many chords correspond to one beat. After that we precreate first draft of beat chord sequence where each chord corresponds to some beat. This sequence is used for the tempo signature estimation. After that we supply or remove missing chords or single chords to complete full bar of the same chord.

We proceed with the second approach that seems to be more accurate.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ReadMe.md

ReadMe.md

Song Chords Recognizer - ACR Pipeline

Prerequisites

Usage

Shell

.NET

Structure

Models

Preprocess

Key Prediction

Beats Voting

Files

ReadMe.md

Latest commit

History

ReadMe.md

File metadata and controls

Song Chords Recognizer - ACR Pipeline

Prerequisites

Usage

Shell

.NET

Structure

Models

Preprocess

Key Prediction

Beats Voting