fad_pytorch

Install

pip install fad_pytorch

About

Features:

runs in parallel on multiple processors and multiple GPUs (via accelerate)
supports multiple embedding methods:
- VGGish and PANN, both mono @ 16kHz
- OpenL3 and (LAION-)CLAP, stereo @ 48kHz
favors ops in PyTorch rather than numpy (or tensorflow)
fad_gen supports WebDataset (audio data stored in S3 buckets)
runs on CPU, CUDA, or MPS

This is designed to be run as 3 command-line scripts in succession. The latter 2 (fad_embed and fad_score) are probably what most people will want:

fad_gen: produces directories of real & fake audio. See fad_gen docs for calling sequence.
fad_embed [options] <real_audio_dir> <fake_audio_dir>: produces directories of embeddings of real & fake audio
fad_score [optoions] <real_emb_dir> <fake_emb_dir>: reads the embeddings & generates FAD score, for real (“$r$”) and fake (“$f$”):

$$ FAD = || \mu_r - \mu_f ||^2 + tr\left(\Sigma_r + \Sigma_f - 2 \sqrt{\Sigma_r \Sigma_f}\right)$$

Comments / FAQ / Troubleshooting

“RuntimeError: CUDA error: invalid device ordinal”: This happens when you have a “bad node” on an AWS cluster. Haven’t yet figured out what causes it or how to fix it. Workaround: Just add the current node to your SLURM --exclude list, exit and retry. Note: it may take as many as 5 to 7 retries before you get a “good node”.
“FAD scores obtained from different embedding methods are wildly different!” …Yea. It’s not obvious that scores from different embedding methods should be comparable. Rather, compare different groups of audio files using the same embedding method, and/or check that FAD scores go down as similarity improves.
“FAD score for the same dataset repeated (twice) is not exactly zero!” …Yea. There seems to be an uncertainty of around +/- 0.008. I’d say, don’t quote any numbers past the first decimal point.

Contributing

This repo is still fairly “bare bones” and will benefit from more documentation and features as time goes on. Note that it is written using nbdev, so the things to are:

fork this repo
clone your fork to your (local) machine
Install nbdev: python3 -m pip install -U nbdev
Make changes by editing the notebooks in nbs/, not the .py files in fad_pytorch/.
Run nbdev_export to export notebook changes to .py files
For good measure, run nbdev_install_hooks and nbdev_clean - especially if you’ve added any notebooks.
Do a git status to see all the .ipynb and .py files that need to be added & committed
git add those files and then git commit, and then git push
Take a look in your GitHub Actions tab, and see if the “test” and “deploy” CI runs finish properly (green light) or fail (red light)
One you get green lights, send in a Pull Request!

Feel free to ask me for tips with nbdev, it has quite a learning curve. You can also ask on fast.ai forums and/or fast.ai Discord

Related Repos

There are [several] others, but this one is mine. These repos didn’t have all the features I wanted, but I used them for inspiration:

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
.github/workflows		.github/workflows
fad_pytorch		fad_pytorch
nbs		nbs
.gitignore		.gitignore
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
settings.ini		settings.ini
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

fad_pytorch

Install

About

Comments / FAQ / Troubleshooting

Contributing

Related Repos

About

Releases

Packages

Languages

License

klay-music/fad_pytorch

Folders and files

Latest commit

History

Repository files navigation

fad_pytorch

Install

About

Comments / FAQ / Troubleshooting

Contributing

Related Repos

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages