ControlCity: A Multimodal Diffusion Model Based Approach for Accurate Geospatial Data Generation and Urban Morphology Analysis

Fangshuo Zhou^† · Huaxia Li^† · Rui Hu · Sensen Wu

Hailin Feng · Zhenhong Du · Liuchang Xu^*

^*Corresponding authors ^†Equal contribution

📢 News

2024-09-26: Added the preprint to arxiv .
2024-09-24: We uploaded sample data for visitors to perform inference with the model.
2024-09-19: We uploaded the model to HuggingFace .
2024-09-13: ControlCity official github repository is officially created.

📦 Repository

Clone the repository (requires git):

git clone https://github.com/fangshuoz/ControlCity.git

pip install -r requirements.txt

🚀 Quickstart

from PIL import Image
from controlcity import (
    OSMControlNetModel,
    DiffusionOSMControlnetPipeline,
    metadata_normalize,
    convert_binary,
)
from diffusers import UniPCMultistepScheduler
import torch

# load pipeline
controlnet = OSMControlNetModel.from_pretrained(
    trained_controlnet_model_path,
    torch_dtype=torch.float16, use_safetensors=True,
    low_cpu_mem_usage=False, device_map=None
)
pipe = DiffusionOSMControlnetPipeline.from_pretrained(
    sdxl_model_path,
    controlnet=controlnet,
    torch_dtype=torch.float16,
    use_safetensors=True,
)
pipe.load_lora_weights(
    trained_lora_model_path,
)
pipe.scheduler = UniPCMultistepScheduler.from_config(pipe.scheduler.config)
pipe.to('cuda:1')

# load condition(text, metadata, cond_image, etc.)
metadata = [-122.3382568359375, 47.61727258456622]
prompt = "A black and white map of city buildings, Located in Seattle, Mostly urban area with numerous buildings, parking lots, ..."
image_road = Image.open('road/15/Seattle/5248_11443.png').convert("RGB")
image_landuse = Image.open('landuse/15/Seattle/5248_11443.png').convert("RGB")

metadata = metadata_normalize(metadata).tolist()

# inference
image = pipe(
    prompt=prompt,
    metadata=metadata,
    negative_prompt="Low quality.",
    image_road=image_road,
    image_landuse=image_landuse,
    guidance_scale=5.0,
    num_inference_steps=25,
    generator=torch.manual_seed(42)
).images[0]

image_bin = convert_binary(image, thr=60, mode="RGB", image_landuse=image_landuse)[0]

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
assets		assets
controlcity		controlcity
examples		examples
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ControlCity: A Multimodal Diffusion Model Based Approach for Accurate Geospatial Data Generation and Urban Morphology Analysis

📢 News

📦 Repository

🚀 Quickstart

About

Releases

Packages

Languages

License

fangshuoz/ControlCity

Folders and files

Latest commit

History

Repository files navigation

ControlCity: A Multimodal Diffusion Model Based Approach for Accurate Geospatial Data Generation and Urban Morphology Analysis

📢 News

📦 Repository

🚀 Quickstart

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages