From 225dcc87a263ac2172e4a396392a89d2dadcb920 Mon Sep 17 00:00:00 2001 From: Hoe Jiun Tian Date: Mon, 25 Nov 2024 13:39:03 +0800 Subject: [PATCH] Update README.md --- README.md | 9 +++++++++ 1 file changed, 9 insertions(+) diff --git a/README.md b/README.md index e9fd2a0..ff1c088 100644 --- a/README.md +++ b/README.md @@ -71,6 +71,13 @@ 17.32 0.00585 + + XLv1.0 + -- + 35.60 + 16.65 + 0.00560 + @@ -84,12 +91,14 @@ We provide three checkpoints with different training strategies. | v1.0 | HICO-DET | v1.4| [HF Hub](https://huggingface.co/jiuntian/interactiondiffusion-weight/blob/main/interact-diffusion-v1.pth) | | v1.1 | HICO-DET | v1.5| [HF Hub](https://huggingface.co/jiuntian/interactiondiffusion-weight/blob/main/interact-diffusion-v1-1.pth) | | v1.2 | HICO-DET + VisualGenome | v1.5| [HF Hub](https://huggingface.co/jiuntian/interactiondiffusion-weight/blob/main/interact-diffusion-v1-2.pth) | +| XLv1.0 | HICO-DET | XL | coming soon | Note that the experimental results in our paper is referring to v1.0. - v1.0 is based on Stable Diffusion v1.4 and GLIGEN. We train at batch size of 16 for 250k steps on HICO-DET. **Our paper is based on this.** - v1.1 is based on Stable Diffusion v1.5 and GLIGEN. We train at batch size of 32 for 250k steps on HICO-DET. - v1.1 is based on InteractDiffusion v1.1. We train further at batch size of 32 for 172.5k steps on HICO-DET and VisualGenome. +- XLv1.0 is based on StableDiffusion XL v1.0 and GLIGEN-XL (which we have trained it). We train InteractDiffusion XL at batch size of 32 for 250k steps on HICO-DET, at 512x512 resolution. More details is coming soon. ## Extension for AutomaticA111's Stable Diffusion WebUI