-
Notifications
You must be signed in to change notification settings - Fork 507
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Colab dreambooth notebook fail #252
Comments
Here's the requirement cell that works.
|
returns an error |
Still returns "Exception: CUDA SETUP: Setup Failed!" |
Was able to successfully execute with modified code. However, when attempting to use the generated model with Stable Diffusion was getting the following error: *** Error verifying pickled file from D:\l......*.ckpt Adding recommended command line argument allowed Stable Diffusion to utilize the model |
Changing this line: To this: Got it working for me again. |
You need to save the checkpoint as safetensors |
Still returns CUDA SETUP: Setup Failed. |
Does anyone get it to work now? |
Same issue here after trying all suggestions, CUDA SETUP: Setup Failed |
I have the same issue. Can you please fix? |
I’m not sure but I think this is a problem with the library bitsandbytes. I have opened a ticket here: bitsandbytes-foundation/bitsandbytes#950 |
Has anyone managed to launch it? |
same issue with me |
Yeah, it works with the fix at bitsandbytes-foundation/bitsandbytes#950 |
Good day. Broke down again. If someone has a working version, please send it. |
Yes please |
I'd even settle for someone to just explain to me why it's broken so i can try and fix it. I've got it to work several times by installing different versions of torch with cuda, xformers, triton, and torchtext, torchaudio, torchvision, and torchdata, as well as gotten it to work by installing kaleido, pycairo, tiktoken, and openai--- but the problem is I was just throwing shit at a wall and hoping it sticks since I fundamentally don't understand what's happening and happened to hit pay dirt, so replicating it has proven difficult. Impossible, actually, in the last week specifically. Not sure if another update screwed the pooch on another module, but it's frustrating. I've tried to learn kohya_ss and I'm very, very bad at it, and regardless of following tutorials, it never works or maybe I'm just stupid. Either way, there's no user-friendly (in the lowest sense of the word) choice except this shivram colab, which in an of itself took an ungodly amount of trial and error to get it how it makes sense to me and for me getting good results. Now, though, it seems like no one gives a crap because it's outdated technology, and training LoRAs and now Stable Cascade and with stable diffusion 3 right around the corner for a public release, I'm afraid we won't see a fix. It's just frustrating as someone who does this as a hobbyist and not professionally, all the talking about it doesn't provide clearly defined, easy to follow solutions. It's all assumptive and predicated on you knowing what everyone is talking about, and not a step-by-step idiot-proof type of guide, which I feel like so many of us need to get good results but are too embarrassed or feel like we'll be made fun of or reprimanded somehow if we ask stupid questions. The whole thing is elitist, and it doesn't do anyone any good. It turns regular quazi-nerds like myself off from diving into this world head first, which you never know, you could be turning someone off to the whole thing that was the next visionary that would've written code or developed something that could have changed the game. That's a long shot, but, I think you get my point. It's frustrating that there's no information on this, and the solutions that are out there, are half-assed, typed out with the assumption that you're already a python developer and we're not. We're regular dudes who use this for fun, hobbies, and some of us used to make money off training models for people, or to get work done for our day jobs. And, look, I get it-- it's forced evolution, right? Figure it the eff out, or stop complaining and stop using it. But, for something that seems like it should be so damn easy to fix, I just don't understand the lack of anyone even wanting to try and help. It's disheartening. Sorry for the rant, tonight has been really frustrating and I'm no closer to getting pending work done-- I've got new characters to train into a model that'll let me finish up a comic book series I've had to back-burner for the last 3 months because of this, and I promise if someone helps me fix it I'll never use the damn software again and stop bugging everyone. Thanks. |
Bravo, great speech. But I'm afraid it won't bear positive fruit. Personally, in my opinion, this is not an outdated technology, it's just that in addition to Lora, Dreambooth technology gives very good results in terms of similarity. So far, I have not seen the same similarity in the new products. It seems that the guy was just cut off the Internet or taken to the army). |
If anyone interested, I successfully trained my model by installing requirements as follows: |
Thanks for the hint! But this option did not last long, it gives an error again. Maybe there will be masters and fix it? |
Someone on another site suggested adding this to the requirements and it's working for me. Hope it helps. |
Describe the bug
The Dreambooth Colab notebook fails at the training stage. Seems to be an issue with bitsandbytes.
Reproduction
Run the Dreambooth Colab notebook. It fails at training.
https://colab.research.google.com/github/ShivamShrirao/diffusers/blob/main/examples/dreambooth/DreamBooth_Stable_Diffusion.ipynb
Logs
System Info
Google Colab
The text was updated successfully, but these errors were encountered: