Birdclef 2023

The goal was to identify which birds are calling in long recordings made in Kenya. This project idea is taken from Kaggle's Birdclef 2023 Challenge. To develop this project we used a pipeline built on top of torchaudio and librosa, using Weight & Biases to track the experiments.

The Data

The dataset comprises 16,000 distinct audio samples of varying lengths. Here are the key features included in the dataset:

Species: This serves as the primary label.
Type: Manually annotated tags such as “call,” “song,” “flight call,” and “adult.”
Latitude and Longitude Information: Provides positional data.
Rating: Manually assigned quality ratings.

We have decided to remove all audio samples with a rating lower than 3 as they accounted for only 14% of the data. Moreover we immediatly noticed a problem of class unbalance which has to be dealt with.

Our pipeline

The typical pipeline consists of converting an audio waveform into a Mel Spectrogram, followed by using a Convolutional Neural Network (CNN) for classification. The Mel Spectrogram’s representation plays a crucial role in shifting an audio classification problem into an image classification task based on the generated spectrogram.

To improve the results we decided to employ an adaptive procedure named perchannel energy normalization PCEN which allows to better separate bird calls from the background noise.

Dealing With Unbalance

We decided to perform oversampling in order to deal with class unbalance, the number of samples is based on the class count while the audio to be oversampled is choosen based on its relative length on the total length of that class audio. The following graph shows the resulting per-class F1 score with the relative class prevalence.

Name		Name	Last commit message	Last commit date
Latest commit History 94 Commits
.github/workflows		.github/workflows
birdclef		birdclef
img		img
nbs		nbs
.gitignore		.gitignore
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
settings.ini		settings.ini
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Birdclef 2023

The Data

Our pipeline

Dealing With Unbalance

About

Releases

Packages

Contributors 2

Languages

License

Chavelanda/birdclef_2023

Folders and files

Latest commit

History

Repository files navigation

Birdclef 2023

The Data

Our pipeline

Dealing With Unbalance

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages