Use Spandrel for upscaling and face restoration architectures #14425

akx · 2023-12-25T13:21:00Z

Description

This PR yeets most of the copy-pasted or otherwise vendored model architectures in favor of just using
Spandrel.

Converted models are:
- CodeFormer
- ESRGAN
- GFPGAN
- RealESRGAN
- ScuNET
- SwinIR
Not converted is LDSR; it doesn't exist in Spandrel.
There's still some more cleanup that could be done – there are multiple implementations of tiled inference right now, for one, and the model loading/downloading/... code is kind of a mess (should continue where I left off with Upscaler model loading cleanup #10823), but I'll hold off on that for this PR.
As an added bonus, this adds (experimental, works-on-my-machine) support for HAT upscaling models.

Screenshots/videos:

No visual changes. This seems to Work On My Machine but it'd be lovely if someone else tried this out too.

Checklist:

I have read contributing wiki page
I have performed a self-review of my own code
My code follows the style guidelines
My code passes tests

akx · 2023-12-26T23:47:04Z

extensions-builtin/SwinIR/scripts/swinir_model.py

@@ -185,8 +180,7 @@ def on_ui_settings():

    shared.opts.add_option("SWIN_tile", shared.OptionInfo(192, "Tile size for all SwinIR.", gr.Slider, {"minimum": 16, "maximum": 512, "step": 16}, section=('upscaling', "Upscaling")))
    shared.opts.add_option("SWIN_tile_overlap", shared.OptionInfo(8, "Tile overlap, in pixels for SwinIR. Low values = visible seam.", gr.Slider, {"minimum": 0, "maximum": 48, "step": 1}, section=('upscaling', "Upscaling")))
-    if int(torch.__version__.split('.')[0]) >= 2 and platform.system() != "Windows":    # torch.compile() require pytorch 2.0 or above, and not on Windows


We're always torch >= 2.0, and we now just try compile without checking the platform.

akx · 2023-12-27T07:33:36Z

.github/workflows/run_tests.yaml

+        env:
+          IGNORE_CMD_ARGS_ERRORS: "1"


The new test will fail without this since a module attempts to read the pytest options as regular webui arguments.

akx · 2023-12-27T07:37:46Z

modules/realesrgan_model.py

+            img,
+            tile_size=opts.ESRGAN_tile,
+            tile_overlap=opts.ESRGAN_tile_overlap,
+            # TODO: `outscale`?


Do we need to downscale too-large images here according to info.scale? AIUI, there might be some other process that also does that?

akx · 2023-12-27T07:38:52Z

modules/upscaler_utils.py

+    return Image.fromarray(output, 'RGB')
+
+
+def upscale_with_model(model, img: Image.Image, *, tile_size: int, tile_overlap: int = 0):


This is used by both esrgan and realesrgan.

akx · 2023-12-27T09:09:14Z

extensions-builtin/SwinIR/scripts/swinir_model.py

+def inference(
+    img,
+    model,
+    *,
+    tile: int,
+    tile_overlap: int,
+    window_size: int,
+    scale: int,
+    device,
+):


This smells like tiled_upscale, but with tile overlap handled by weight scaling.

gel-crabs · 2023-12-28T23:00:42Z

Oh yeah, I've been using this PR for a couple days now; it works.

akx · 2023-12-29T10:52:09Z

@gel-crabs Thanks for trying it out! I (force-)pushed this branch to update spandrel to a newer version, as well as add experimental support for HAT upscalers, if you want to try that out. (You'll need to bring your own models and put them in models/HAT/.)

gel-crabs · 2023-12-29T21:02:47Z

@gel-crabs Thanks for trying it out! I (force-)pushed this branch to update spandrel to a newer version, as well as add experimental support for HAT upscalers, if you want to try that out. (You'll need to bring your own models and put them in models/HAT/.)

It works! Admittedly it has issues with deepcache where it adds black splotches to the image during hires fix, but otherwise working.

I tried to hack in support for DAT as well by copying hat_model.py and replacing HAT with DAT, but it just made the image go full black.

Edit: It actually has nothing to do with deepcache, or any extensions at all. I'm going to try testing with different models.

I tried with a different 4x HAT upscaler and it gives full black images, so the HAT support doesn't seem to be working correctly.

AUTOMATIC1111 · 2023-12-30T11:40:42Z

I'm generally not pumped about adding new dependencies, but this removes a lot of code we just copy pasted, so that seems nice.

Some questions:

what's with __init__.py?
what's with commented code in webui.py?
for tests, on the new machine (which is always the case for github servers), it looks to me that it will download the model. Maybe those testscould be disabled by default? Also since you're not actually checking any changes in faces, we could reuse the existing img2img_basic.png instead of adding a new pic.
what happens when you put a checkpoint in a wrong dir? Say, ESRGAN checkpoint into swinir dir. Or a codeformer model into ESRGAN dir?
did you test all models you converted to use spandrel?

akx · 2023-12-30T13:24:07Z

I'm generally not pumped about adding new dependencies, but this removes a lot of code we just copy pasted, so that seems nice.

I think this actually leads to less dependencies in total (I'll run the numbers later). The Spandrel folks seem nice and responsive too. :)

what's with __init__.py?

Autogenerated by PyCharm when refactoring code. Will yeet, my bad.

what's with commented code in webui.py?

Also accidentally added to this PR (since I was tired of having a gazillion WebUI tabs get auto-opened), my bad. Will yeet.

for tests, on the new machine (which is always the case for github servers), it looks to me that it will download the model.

I can also add an actions/cache action so we cache the models/ directory (like Spandrel's tests do).

Also since you're not actually checking any changes in faces, we could reuse the existing img2img_basic.png instead of adding a new pic.

Since we do facexlib to detect faces and only act on the face patches, using an image that doesn't have any faces will not exercise the code that would actually run the Spandrel model 😁

I'll add a simple "output image was different" check!

what happens when you put a checkpoint in a wrong dir? Say, ESRGAN checkpoint into swinir dir. Or a codeformer model into ESRGAN dir?

Good question - since Spandrel auto-detects the model arch from the checkpoint, it'd happily load it, and maybe fail with a parameter error down the line when we try to call the architecture with kwargs it doesn't get. I can add isinstance checks to see we loaded the correct model (and warn and fail if so) instead of just blindly forging ahead.

did you test all models you converted to use spandrel?

I did, on my machine (Macbook).

…from GFPGAN and LDSR)

…PGAN

wcde · 2024-01-03T10:35:43Z

Looks like SwinIR x2 is not working now. I get this in any model:

File "...\modules\images.py", line 286, in resize_image
  res = resize(im, width, height)
File "...\modules\images.py", line 278, in resize
  im = upscaler.scaler.upscale(im, scale, upscaler.data_path)
File "...\modules\upscaler.py", line 65, in upscale
  img = self.do_upscale(img, selected_model)
File "...\extensions-builtin\SwinIR\scripts\swinir_model.py", line 48, in do_upscale
  img = upscaler_utils.upscale_2(
File "...\modules\upscaler_utils.py", line 181, in upscale_2
  output = tiled_upscale_2(
File "...\modules\upscaler_utils.py", line 149, in tiled_upscale_2
  ].add_(out_patch)
RuntimeError: The size of tensor a (2560) must match the size of tensor b (1280) at non-singleton dimension 3

akx · 2024-01-03T12:31:32Z

@wcde Thanks, I'll take a peek – what's your SwinIR tile size and overlap setting, and the size of the image you're trying to upscale?

wcde · 2024-01-03T19:32:18Z

In code hardcoded scale to 4.
Should be something like that:

img = upscaler_utils.upscale_2(
    img,
    model,
    tile_size=shared.opts.SWIN_tile,
    tile_overlap=shared.opts.SWIN_tile_overlap,
    scale=model.scale,
    desc="SwinIR",
)

Second problem - model is loaded with dtype devices.dtype, but in upscale_2 input casted to fp32:

tensor = pil_image_to_torch_bgr(img).float()

Which give:

RuntimeError: Input type (float) and bias type (struct c10::Half) should be the same

akx · 2024-01-03T20:32:34Z

@wcde In fairness, scale has always been hard-coded to 4 unless I overlooked something:

stable-diffusion-webui/extensions-builtin/SwinIR/scripts/swinir_model.py

Line 63 in cf2772f

def load_model(self, path, scale=4):

I'll take a look at the half issue, thanks for pointing it out.

light-and-ray · 2024-01-28T04:05:04Z

I guess it will happen with a lot of extensions after updating. Maybe it should be mentioned in changelog?

akx force-pushed the spandrel branch 2 times, most recently from b1a61e9 to e61c70b Compare December 25, 2023 13:45

akx mentioned this pull request Dec 25, 2023

GFPGANv1Clean: assign missing fields chaiNNer-org/spandrel#81

Merged

akx force-pushed the spandrel branch 5 times, most recently from ecee8df to 8bfe7cf Compare December 25, 2023 21:56

akx marked this pull request as ready for review December 26, 2023 23:45

akx requested a review from AUTOMATIC1111 as a code owner December 26, 2023 23:45

akx commented Dec 26, 2023

View reviewed changes

akx commented Dec 27, 2023

View reviewed changes

akx force-pushed the spandrel branch from 8bfe7cf to 55ac6c3 Compare December 27, 2023 08:18

akx marked this pull request as draft December 27, 2023 09:02

akx force-pushed the spandrel branch from 55ac6c3 to f6a3f11 Compare December 27, 2023 09:06

akx commented Dec 27, 2023

View reviewed changes

akx force-pushed the spandrel branch 2 times, most recently from ce58f5d to 37458e6 Compare December 27, 2023 09:52

akx marked this pull request as ready for review December 27, 2023 10:12

akx force-pushed the spandrel branch from 37458e6 to 92cc3df Compare December 29, 2023 10:48

akx added 3 commits December 30, 2023 16:24

Add types to split_grid

7aa27b0

Add tile_count property to Grid

12c6f37

Refactor esrgan_upscale to more generic upscale_with_model

e472383

akx added 4 commits December 30, 2023 16:24

Use Spandrel for upscaling and face restoration architectures (aside …

b0f5934

…from GFPGAN and LDSR)

Unify CodeFormer and GFPGAN restoration backends, use Spandrel for GF…

b621a63

…PGAN

Add experimental HAT model

c756133

Verify architecture for loaded Spandrel models

4ad0c0c

akx force-pushed the spandrel branch from 92cc3df to 4ad0c0c Compare December 30, 2023 14:38

AUTOMATIC1111 approved these changes Dec 30, 2023

View reviewed changes

AUTOMATIC1111 merged commit cd12c0e into AUTOMATIC1111:dev Dec 30, 2023
2 of 3 checks passed

akx deleted the spandrel branch December 30, 2023 15:08

This was referenced Dec 30, 2023

Drop basicsr dependency #14467

Merged

Soften Spandrel model-architecture check to just a warning #14473

Merged

akx mentioned this pull request Jan 3, 2024

Fix SwinIR issues #14524

Merged

4 tasks

light-and-ray mentioned this pull request Jan 28, 2024

update requirements.txt for webui dev branch continue-revolution/sd-webui-segment-anything#186

Merged

w-e-w mentioned this pull request Feb 17, 2024

1.8.0-RC #14948

Closed

clayne mentioned this pull request Mar 2, 2024

upscaler_utils: Reduce logging #15084

Merged

4 tasks

akx mentioned this pull request Apr 2, 2024

[BREAKING FIX] [torchvision 0.17] Change functional_tensor import XPixelGroup/BasicSR#650

Merged

pawel665j mentioned this pull request Apr 16, 2024

## 1.8.0-RC #15537

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use Spandrel for upscaling and face restoration architectures #14425

Use Spandrel for upscaling and face restoration architectures #14425

akx commented Dec 25, 2023 •

edited

Loading

akx Dec 26, 2023

akx Dec 27, 2023

akx Dec 27, 2023

akx Dec 27, 2023

akx Dec 27, 2023

gel-crabs commented Dec 28, 2023

akx commented Dec 29, 2023

gel-crabs commented Dec 29, 2023 •

edited

Loading

AUTOMATIC1111 commented Dec 30, 2023

akx commented Dec 30, 2023 •

edited

Loading

wcde commented Jan 3, 2024

akx commented Jan 3, 2024

wcde commented Jan 3, 2024

akx commented Jan 3, 2024

light-and-ray commented Jan 28, 2024

		return Image.fromarray(output, 'RGB')


		def upscale_with_model(model, img: Image.Image, *, tile_size: int, tile_overlap: int = 0):

Use Spandrel for upscaling and face restoration architectures #14425

Use Spandrel for upscaling and face restoration architectures #14425

Conversation

akx commented Dec 25, 2023 • edited Loading

Description

Screenshots/videos:

Checklist:

akx Dec 26, 2023

Choose a reason for hiding this comment

akx Dec 27, 2023

Choose a reason for hiding this comment

akx Dec 27, 2023

Choose a reason for hiding this comment

akx Dec 27, 2023

Choose a reason for hiding this comment

akx Dec 27, 2023

Choose a reason for hiding this comment

gel-crabs commented Dec 28, 2023

akx commented Dec 29, 2023

gel-crabs commented Dec 29, 2023 • edited Loading

AUTOMATIC1111 commented Dec 30, 2023

akx commented Dec 30, 2023 • edited Loading

wcde commented Jan 3, 2024

akx commented Jan 3, 2024

wcde commented Jan 3, 2024

akx commented Jan 3, 2024

light-and-ray commented Jan 28, 2024

akx commented Dec 25, 2023 •

edited

Loading

gel-crabs commented Dec 29, 2023 •

edited

Loading

akx commented Dec 30, 2023 •

edited

Loading