-
Notifications
You must be signed in to change notification settings - Fork 517
Description
Beyond some weirdness with downloading the models in a temp folder that i had to move manually, the github install went fine, no errors.
I initially just downloaded the simple All-in-one safetensors file and the basic workflow shared by comfy to use in their nightly ui update, and ran some tests, was pretty blown away by how well it worked.
Then i saw all the additional options like retake/edit etc in the gradio and installed the github. I get completely different results, mostly ones that seem to accept the lyrics written but ignores the tags and music styles.
So, question is, what are the right settings here? Comfy uses simple ksampler with default res_multistep + 4.0 cfg and a sd3 flow shifter in the workflow. Whatever settings under the hood there, how do they translate to this app?
I haven't tried every combo yet (ive tried changing to cfg/star/messed with guidance scales) but i can clearly say the default settings arent working the same. Either it's bugged or can someone enlighten me to the right settings to use?
Edit: just to clarify, i get music outputs (so no functional issues on my windows 12.6 cuda setup), it's just prompt adherence seems to barely work if it all when it comes to music type, whereas the comfy setup, as limited as it is, follows my tags much more closely to the point where i wasn't really questioning if it was working. If i tried Gradio as my first experience with this i'd have probably been unaware and discarded it.