Song Recognizer

This project provides a minimal example of training a neural network to recognise short audio clips. It is built on top of PyTorch and torchaudio and exposes a small command line interface for training and prediction.

Features

Convert audio files to Mel-spectrograms on the fly
Simple convolutional neural network architecture
CLI commands for training and live microphone prediction

Installation

Create a virtual environment (optional but recommended)
Install the dependencies

pip install -r requirements.txt

Usage

Training

Place your audio files (e.g. WAV or MP3) in a directory and run:

python main.py train /path/to/audio

The trained model will be saved to song_recognizer.pth.

Prediction

To make a prediction using the microphone run:

python main.py predict

or provide a prerecorded file:

python main.py predict --input_file sample.wav

Project Structure

.
├── song_recognizer
│   ├── __init__.py
│   ├── data.py
│   ├── model.py
│   ├── recognition.py
│   └── train.py
├── main.py
├── requirements.txt
└── README.md

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
song_recognizer		song_recognizer
tests		tests
.gitignore		.gitignore
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Song Recognizer

Features

Installation

Usage

Training

Prediction

Project Structure

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

avelev99/Ambient_Song_Recognition

Folders and files

Latest commit

History

Repository files navigation

Song Recognizer

Features

Installation

Usage

Training

Prediction

Project Structure

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages