GitHub - vislearn/PrimeDepth: PrimeDepth: Efficient Monocular Depth Estimation with a Stable Diffusion Preimage

PrimeDepth: Efficient Monocular Depth Estimation with a Stable Diffusion Preimage

Denis Zavadski^* · Damjan Kalšan^* · Carsten Rother

Computer Vision and Learning Lab,
IWR, Heidelberg University

^*equal contribution

ACCV 2024

PrimeDepth is a diffusion-based monocular depth estimator which leverages the rich representation of the visual world stored within Stable Diffusion. The representation, termed preimage, is extracted in a single diffusion step from frozen Stable Diffusion 2.1 and adjusted towards depth prediction. PrimeDepth yields detailed predictions while simulatenously being fast at inference time due to the single-step approach.

Introduction

This is an inference codebase for PrimeDepth based on Stable Diffusion 2.1. Further details and visual examples can be found on the project page.

Installation

Create and activate a virtual environment:

conda create -n PrimeDepth python=3.9
conda activate PrimeDepth

Install dependencies:
```
pip3 install -r requirements.txt
```
Download the weights
Adjust the attribute ckpt_path in configs/inference.yaml to point to the downloaded weights from the previous step

Usage

from scripts.utils import InferenceEngine


config_path = "./configs/inference.yaml"
image_path = "./images/comparisons/vertical_resized/goodBoy.png"

ie = InferenceEngine(pd_config_path=config_path, device="cuda")

depth_ssi, depth_color = ie.predict(image_path)

PrimeDepth predicts in inverse space. The raw model predictions are stored in depth_ssi, while a colorized prediction depth_color is precomputed for visualization convenience:

depth_color.save("goodBoy_primedepth.png")

Citation

@misc{zavadski2024primedepth,
    title={PrimeDepth: Efficient Monocular Depth Estimation with a Stable Diffusion Preimage}, 
    author={Denis Zavadski and Damjan Kalšan and Carsten Rother},
    year={2024},
    eprint={2409.09144},
    archivePrefix={arXiv},
    primaryClass={cs.CV},
    url={https://arxiv.org/abs/2409.09144}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
configs		configs
css		css
fonts		fonts
images		images
js		js
ldm		ldm
scripts		scripts
.nojekyll		.nojekyll
LICENSE		LICENSE
LICENSE-SD		LICENSE-SD
LICENSE-SD-MODEL		LICENSE-SD-MODEL
README.md		README.md
index.html		index.html
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Licenses found

Repository files navigation

PrimeDepth: Efficient Monocular Depth Estimation with a Stable Diffusion Preimage

Introduction

Installation

Usage

Citation

About

Licenses found

Releases

Packages

Contributors 2

Languages

License

Licenses found

vislearn/PrimeDepth

Folders and files

Latest commit

History

Repository files navigation

PrimeDepth: Efficient Monocular Depth Estimation with a Stable Diffusion Preimage

Introduction

Installation

Usage

Citation

About

Topics

Resources

License

Licenses found

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages