@fenwang's 7 GB to 5.5 GB (512x512) #25

ClashSAN · 2023-03-22T08:31:41Z

https://github.com/fengwang/Stable-Diffusion-NCNN

hi, It looked like there was this memory optimization applicable to 8gb laptops (no swap requirement)

Is this also applicable to android mobile, specifically?
You have mentioned a while ago the f16 512x512 version working on your 12gb ram android.
Also, is the 256x256 model a fixed shape, and is a quantized model?

EdVince · 2023-03-22T08:41:22Z

For some of the metrices, I have not updated them for some time and suggest you actually test them. All the models I have used are fp16, not quantitative one.

ClashSAN · 2023-03-23T21:05:55Z

thanks. my bad, the models were obviously labeled.
I used your diffusers conversion repository successfully, may I ask if the current vae decoder provided via cloud drive is the separate nai vae, or the one built-in (regular SD)?
The int8 and/or uint8 quantization process is easy enough with onnx, but I don't know how to do this with .pt (pnnx)

Your ncnn implementation is currently as fast as the openvino implementation, and supports multiple sizes.
I am interested in quantization because you have an apk release that supports 6gb currently, and 4gb may be supported if the quantized model is used, or maybe some low ram optimization. I'm not sure.

EdVince · 2023-03-24T05:23:49Z

But considering that diffusion is a noisy computational process, I don't think quantifying such data will give a good result.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

@fenwang's 7 GB to 5.5 GB (512x512) #25

@fenwang's 7 GB to 5.5 GB (512x512) #25

ClashSAN commented Mar 22, 2023

EdVince commented Mar 22, 2023

ClashSAN commented Mar 23, 2023

EdVince commented Mar 24, 2023

@fenwang's 7 GB to 5.5 GB (512x512) #25

@fenwang's 7 GB to 5.5 GB (512x512) #25

Comments

ClashSAN commented Mar 22, 2023

EdVince commented Mar 22, 2023

ClashSAN commented Mar 23, 2023

EdVince commented Mar 24, 2023