PixMus Project Repository

Welcome to the GitHub repository for the PixMus project, a comprehensive exploration of using diffusion models conditioned on video content for synthesizing background music. This repository includes my thesis, presentation slides, the curated dataset, and example output videos demonstrating the capabilities of the PixMus model.

Thesis

The thesis document details the theoretical background, methodologies, experiments, and results of the PixMus model. It covers the application of diffusion models in generating music that aligns with the emotional and thematic elements of videos.

Thesis Document

Dataset

The PixMus dataset is specially curated to facilitate research in video-conditioned music generation. It consists of carefully selected video clips paired with corresponding background music, ideal for training and testing music generation models.

Dataset Overview: Includes 53,378 samples with a mix of videos and thumbnails.
Access the Dataset: You can access the dataset from huggingface.
- PixMus Dataset

Output Videos

Below are four output videos from the PixMus model. These videos showcase the quality and relevance of the generated background music in synchronization with the video content.

Contributions and Contact

Contributions to this project are welcome, whether they involve enhancing the model, expanding the dataset, or improving the documentation. Please feel free to fork the repository, make your changes, and submit a pull request.

For any questions or further information, please contact me at [tilaksharma1114@gmail.com].

Thank you for visiting this repository, and I hope you find the resources helpful for your research or projects!

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
__pycache__		__pycache__
audioldm		audioldm
mlflow		mlflow
outputs		outputs
plots		plots
tools		tools
unet		unet
.gitignore		.gitignore
README.md		README.md
app.py		app.py
captions.json		captions.json
clap.ipynb		clap.ipynb
clap_metric.py		clap_metric.py
compute_latents.py		compute_latents.py
compute_test_metrics.ipynb		compute_test_metrics.ipynb
compute_text_encoding.py		compute_text_encoding.py
config.json		config.json
data_module.py		data_module.py
diffusion.py		diffusion.py
explore.ipynb		explore.ipynb
explore_metrics.ipynb		explore_metrics.ipynb
master_data.csv		master_data.csv
model.py		model.py
model_utils.py		model_utils.py
modelling_deberta_v2.py		modelling_deberta_v2.py
pixmus_thesis.pdf		pixmus_thesis.pdf
playground.py		playground.py
predict_testset.py		predict_testset.py
predict_testset_audioldm.py		predict_testset_audioldm.py
sample_generation_helper.py		sample_generation_helper.py
sequence_data_module.py		sequence_data_module.py
train.py		train.py
train_seq.py		train_seq.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PixMus Project Repository

Thesis

Dataset

Output Videos

Contributions and Contact

About

Releases

Packages

Languages

Tilak1114/music-diffusion

Folders and files

Latest commit

History

Repository files navigation

PixMus Project Repository

Thesis

Dataset

Output Videos

Contributions and Contact

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages