ComfyUI-StableAudioSampler

The New Stable Audio Open 1.0 Sampler In a ComfyUI Node. Make some beats!

Your browser does not support the audio element.

Caveats

The longer your audio, the more VRAM you need to stitch it together
on a 3060, we've tried up to 10 seconds so far

Installation

Download the Model and Config

Go to Stable Audio Open on HuggingFace and download the model.safetensors and model.config.json files.
Place the files in the models/audio_checkpoints folder. If you don't have one, make one in your comfy folder.
Open Comfy and StableAudioLoader will see your model and config

With a HuggingFace Token

Make sure you have your HF_TOKEN environment variable for hugging face because model loading doesn't work just yet directly from a saved file
Go ahead and download model from here for when we fix that Stable Audio Open on HuggingFace
Make sure to run pip install -r requirements.txt inside the repo folder if you're not using Manager
It should just run if you've got your environment variable set up

There will definitely be issues because this is so new and it was coded quickly so we couldn't test it out.

This is not an official StableAudioOpen repository.

Current Features

Load your own models!
Runs in half precision optional
Nodes
- A Sampler Node: now with seed control, positive and negative prompts
- A Pre-Conditioning Node: kind of like empty latent audio with batch option
- A Prompt Node: Pipes conditioning
- A Model Loading Node: Includes repo options and scans models/audio_checkpoints for models and config.json files
control_after_generate option
Audio to Audio (like in the Gradio Example)
Can still use HF env key if you want
Generates audio and outputs raw bytes and a sample rate for use with VHS
Includes all of the original Stable Audio Open parameters
Sampler outputs a Spectrogram image (experimental)
Can save audio to file

Roadmap and Requested Features

Keeping track of requests and ideas as they come in:

Stereo output
Nodes
- A Mixer Node (mix your audio outputs with some sort of mastering)
- A Tiling Sampler (concatenate the audios)
More Sampler Node Options
- Gain
- Possibly Clipping at some dB
- Cleaning up some of the current options with selectors, etc.
Upfi (upscaling fidelity)

We are very open to anyone who wants to contribute from the open source community. Make your forks and pull requests. We will build something cool.

Feature Requests

If you have a request for a feature, open an issue about it and it will be seen.

Happy Diffusing!

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
assets		assets
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
nodes.py		nodes.py
requirements.txt		requirements.txt
util_config.py		util_config.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ComfyUI-StableAudioSampler

Caveats

Installation

Download the Model and Config

With a HuggingFace Token

Current Features

Roadmap and Requested Features

Feature Requests

About

Releases

Packages

Languages

kft334/ComfyUI-StableAudioSampler

Folders and files

Latest commit

History

Repository files navigation

ComfyUI-StableAudioSampler

Caveats

Installation

Download the Model and Config

With a HuggingFace Token

Current Features

Roadmap and Requested Features

Feature Requests

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages