PROJECT NOT UNDER ACTIVE MANAGEMENT

This project will no longer be maintained by Intel.
Intel has ceased development and contributions including, but not limited to, maintenance, bug fixes, new releases, or updates, to this project.
Intel no longer accepts patches to this project.
If you have an ongoing need to use this project, are interested in independently developing it, or would like to maintain patches for the open source software community, please create your own fork of this project.

Vision Transformers for Dense Prediction

This repository contains code and models for our paper:

Vision Transformers for Dense Prediction
René Ranftl, Alexey Bochkovskiy, Vladlen Koltun

Changelog

[March 2021] Initial release of inference code and models

Setup

Download the model weights and place them in the weights folder:

Monodepth:

Segmentation:

Set up dependencies:
```
pip install -r requirements.txt
```
The code was tested with Python 3.7, PyTorch 1.8.0, OpenCV 4.5.1, and timm 0.4.5

Usage

Place one or more input images in the folder input.
Run a monocular depth estimation model:
```
python run_monodepth.py
```
Or run a semantic segmentation model:
```
python run_segmentation.py
```
The results are written to the folder output_monodepth and output_semseg, respectively.

Use the flag -t to switch between different models. Possible options are dpt_hybrid (default) and dpt_large.

Additional models:

Monodepth finetuned on KITTI: dpt_hybrid_kitti-cb926ef4.pt Mirror
Monodepth finetuned on NYUv2: dpt_hybrid_nyu-2ce69ec7.pt Mirror

Run with

python run_monodepth -t [dpt_hybrid_kitti|dpt_hybrid_nyu]

Evaluation

Hints on how to evaluate monodepth models can be found here: https://github.com/intel-isl/DPT/blob/main/EVALUATION.md

Citation

Please cite our papers if you use this code or any of the models.

@article{Ranftl2021,
	author    = {Ren\'{e} Ranftl and Alexey Bochkovskiy and Vladlen Koltun},
	title     = {Vision Transformers for Dense Prediction},
	journal   = {ArXiv preprint},
	year      = {2021},
}

@article{Ranftl2020,
	author    = {Ren\'{e} Ranftl and Katrin Lasinger and David Hafner and Konrad Schindler and Vladlen Koltun},
	title     = {Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer},
	journal   = {IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)},
	year      = {2020},
}

Acknowledgements

Our work builds on and uses code from timm and PyTorch-Encoding. We'd like to thank the authors for making these libraries available.

License

MIT License

Name	Name	Last commit message	Last commit date
Latest commit sfblackl-intel Update README.md Dec 18, 2024 cd3fe90 · Dec 18, 2024 History 117 Commits
dpt	dpt	Fix interpolation behaviour	Jun 10, 2021
input	input	initial commit	Jun 24, 2019
output_monodepth	output_monodepth	new pallete function	Mar 22, 2021
output_semseg	output_semseg	new pallete function	Mar 22, 2021
util	util	Add --absolute_depth flag	Jun 8, 2021
weights	weights	make code ready for release	Mar 22, 2021
.gitignore	.gitignore	make code ready for release	Mar 22, 2021
EVALUATION.md	EVALUATION.md	Add KITTI evaluation gist	Jun 10, 2021
LICENSE	LICENSE	Update LICENSE	Mar 24, 2021
README.md	README.md	Update README.md	Dec 18, 2024
requirements.txt	requirements.txt	Upgrade Pytorch version to circumvent CUDNN issues	Jul 7, 2021
run_monodepth.py	run_monodepth.py	Add --absolute_depth flag	Jun 8, 2021
run_segmentation.py	run_segmentation.py	refactor	Mar 23, 2021
setup.py	setup.py	Add setup file and init files.	Mar 28, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PROJECT NOT UNDER ACTIVE MANAGEMENT

Vision Transformers for Dense Prediction

Changelog

Setup

Usage

Evaluation

Citation

Acknowledgements

License

About

Releases

Packages

Contributors 5

Languages

License

isl-org/DPT

Folders and files

Latest commit

History

Repository files navigation

PROJECT NOT UNDER ACTIVE MANAGEMENT

Vision Transformers for Dense Prediction

Changelog

Setup

Usage

Evaluation

Citation

Acknowledgements

License

About

Resources

License

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages