I integrated BoxDiff into diffusers #14

zjysteven · 2024-05-14T21:29:11Z

Feel free to check out https://github.com/huggingface/diffusers/tree/main/examples/community#stable-diffusion-boxdiff

Example use case:

import torch
from PIL import Image, ImageDraw
from copy import deepcopy

from examples.community.pipeline_stable_diffusion_boxdiff import StableDiffusionBoxDiffPipeline

def draw_box_with_text(img, boxes, names):
    colors = ["red", "olive", "blue", "green", "orange", "brown", "cyan", "purple"]
    img_new = deepcopy(img)
    draw = ImageDraw.Draw(img_new)

    W, H = img.size
    for bid, box in enumerate(boxes):
        draw.rectangle([box[0] * W, box[1] * H, box[2] * W, box[3] * H], outline=colors[bid % len(colors)], width=4)
        draw.text((box[0] * W, box[1] * H), names[bid], fill=colors[bid % len(colors)])
    return img_new

pipe = StableDiffusionBoxDiffPipeline.from_pretrained(
    "stabilityai/stable-diffusion-2-1-base",
    torch_dtype=torch.float16,
)
pipe.to("cuda")

# example 1
prompt = "as the aurora lights up the sky, a herd of reindeer leisurely wanders on the grassy meadow, admiring the breathtaking view, a serene lake quietly reflects the magnificent display, and in the distance, a snow-capped mountain stands majestically, fantasy, 8k, highly detailed"
phrases = [
    "aurora",
    "reindeer",
    "meadow",
    "lake",
    "mountain"
]
boxes = [[1,3,512,202], [75,344,421,495], [1,327,508,507], [2,217,507,341], [1,135,509,242]]

# example 2
# prompt = "A rabbit wearing sunglasses looks very proud"
# phrases = ["rabbit", "sunglasses"]
# boxes = [[67,87,366,512], [66,130,364,262]]

boxes = [[x / 512 for x in box] for box in boxes]

images = pipe(
    prompt,
    boxdiff_phrases=phrases,
    boxdiff_boxes=boxes,
    boxdiff_kwargs={
        "attention_res": 16,
        "normalize_eot": True
    },
    num_inference_steps=50,
    guidance_scale=7.5,
    generator=torch.manual_seed(42),
    safety_checker=None
).images

draw_box_with_text(images[0], boxes, phrases).save("output.png")

Sierkinhane · 2024-05-15T01:34:53Z

Great! Thanks for your efforts! :)

Qianvenh · 2024-07-20T07:50:30Z

Thanks for your efforts! Can it support the checkpoint of GLIGEN?

zjysteven · 2024-07-20T13:34:41Z

Unfortunately no. One would need to implement another pipeline by referencing the HF's gligen pipeline, which shouldn't be hard though.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

I integrated BoxDiff into diffusers #14

I integrated BoxDiff into diffusers #14

zjysteven commented May 14, 2024

Sierkinhane commented May 15, 2024

Qianvenh commented Jul 20, 2024

zjysteven commented Jul 20, 2024

I integrated BoxDiff into diffusers #14

I integrated BoxDiff into diffusers #14

Comments

zjysteven commented May 14, 2024

Sierkinhane commented May 15, 2024

Qianvenh commented Jul 20, 2024

zjysteven commented Jul 20, 2024