EUSIPCO 2017 Jingju singing voice phoneme classification

The phoneme classification code for EUSIPCO 2017 paper review:

Timbre Analysis of Music Audio Signals with Convolutional Neural Networks

Steps for reproducting the experiment results

Clone this repository
Download Jingju a capella singing dataset from http://doi.org/10.5281/zenodo.344932
Change dataset_path variable in parameters.py to locate the above dataset
Install dependencies (see below)
Choose dataset in parameters.py to run experiment on dan or laosheng dataset
Run experiment by `python doPhonemeClassification.py'

Steps for calculating the mel bands features

Execute the steps 1, 2, 3 in Steps for reproducting the experiment results
Choose dataset and am variables in parameters. Example, dataset='qmLonUpfLaosheng' and am='cnn' means we would like to extract the laosheng features for convolutional neural networks (proposed, Choi models).
Run python phonemeSampleCollection.py to extract the mel bands features
Code for extracting features for MLP model is not included.

Steps for training proposed, Choi, MLP and GMM models

Download pre-computed mel-bands features from http://doi.org/10.5281/zenodo.344935
Create a folder named trainingData in the root of this repository, then put all '.pickle.gz` feature files into this folder
If you don't want to download the pre-computed features, please follow Steps for calculating the mel bands features
The model training code are located in pretrainedDLModels folder. keras_cnn* code is for training CNN models (proposed and Choi modes). keras_dnn* code is for training MLP model
To train GMM models, please set am='gmm' in parameters.py, then execute steps 1, 2 in Steps for calculating the mel bands features

Dependencies

Steps for reproducting the experiment results requires below packages:

python2 numpy scipy scikit-learn matplotlib essentia

Steps for calculating the mel bands features requires below packages:

python2 numpy scipy scikit-learn essentia

Steps for training proposed, Choi, MLP and GMM models requires below packages:

python2 numpy scipy scikit-learn essentia keras theano hyperot

License

Affero GNU General Public License version 3

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.idea		.idea
gmmModels		gmmModels
pretrainedDLModels		pretrainedDLModels
.gitignore		.gitignore
Fdeltas.py		Fdeltas.py
Fprev_sub.py		Fprev_sub.py
LICENSE		LICENSE
README.md		README.md
doPhonemeClassification.py		doPhonemeClassification.py
parameters.py		parameters.py
phonemeClassification.py		phonemeClassification.py
phonemeMap.py		phonemeMap.py
phonemeSampleCollection.py		phonemeSampleCollection.py
pinyinMap.py		pinyinMap.py
textgrid.py		textgrid.py
textgridParser.py		textgridParser.py
trainTestSeparation.py		trainTestSeparation.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EUSIPCO 2017 Jingju singing voice phoneme classification

Steps for reproducting the experiment results

Steps for calculating the mel bands features

Steps for training proposed, Choi, MLP and GMM models

Dependencies

License

About

Releases

Packages

Languages

License

ronggong/EUSIPCO2017

Folders and files

Latest commit

History

Repository files navigation

EUSIPCO 2017 Jingju singing voice phoneme classification

Steps for reproducting the experiment results

Steps for calculating the mel bands features

Steps for training proposed, Choi, MLP and GMM models

Dependencies

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages