GitHub - VisionXLab/PointOBB-v2: [ICLR'25] Official repo of "PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection"

PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection

⭐️ Highlights

TL;DR: We propose PointOBB-v2, a simpler, faster, and stronger method to generate pseudo rotated boxes from points without relying on any other prior. It achieves a training speed 15.58x faster and an accuracy improvement of 11.60%/25.15%/21.19% on the DOTA-v1.0/v1.5/v2.0 datasets compared to the previous state-of-the-art, PointOBB.

🛠️ Installation

Please refer to the Installation, we copy it here.

conda create -n open-mmlab python=3.7 pytorch==1.7.0 cudatoolkit=10.1 torchvision -c pytorch -y
conda activate open-mmlab
pip install openmim
mim install mmcv-full
mim install mmdet
git clone https://github.com/taugeren/PointOBB-v2.git
cd mmrotate
pip install -r requirements/build.txt
pip install -v -e .

📊 Data Preparation

Please follow data_preparation to prepare formatting data

🏋️ Train CPM

If you want to visualize CPM result during training, please set visualize=True in train_config

Please modified the config code that contains the visualize directory path

The learning rate for n GPU card and batch size m is 0.0125 * n * m

For single GPU

# Basic format: python tools/train.py ${CONFIG_FILE} [optional arguments]
python tools/train.py configs/pointobbv2/train_cpm_dotav10.py --work-dir work_dirs/cpm_dotav10 --gpu-ids 0

For multiple GPU

# Basic format: ./tools/dist_train.sh ${CONFIG_FILE} ${GPU_NUM} [optional arguments]
CUDA_VISIBLE_DEVICES=0,1 PORT=29801 ./tools/dist_train.sh configs/pointobbv2/train_cpm_dotav10.py 2

🏋️ Generate Pseudo Label

Please modified the config code that contains the directory path

For single GPU

# Basic fromat: python tools/train.py ${CONFIG_FILE} --resume-from ${CPM_CHECKPOINT_FILE} [other arguments]
python tools/train.py configs/pointobbv2/generate_pseudo_label_dotav10.py --resume-from work_dirs/cpm_dotav10/epoch_6.pth --work-dir work_dirs/cpm_dotav10 --gpu-ids 0

For multiple GPU

# Basic format: ./tools/dist_train_resume.sh ${CONFIG_FILE} ${CPM_CHECKPOINT_FILE} ${GPU_NUM} [optional arguments]
CUDA_VISIBLE_DEVICES=0,1 PORT=29801 ./tools/dist_train_resume.sh /ssd1/renbotao/github_submission/mmrotate/configs/pointobbv2/generate_pseudo_label_dotav10.py work_dirs/cpm_dotav10/epoch_6.pth 2

🏋️ Train Detector

You can use different oriented object detection detector in MMRotate Config

Please modify the pseudo label path in config file

For example, using Redet, the training command:

# single GPU
python tools/train.py configs/pointobbv2/redet_dotav10.py --work-dir work_dirs/cpm_dotav10 --gpu-ids 0

# multiple GPU
CUDA_VISIBLE_DEVICES=0,1 PORT=29801 ./tools/dist_train.sh configs/pointobbv2/redet_dotav10.py 2

the testing command:

# single GPU
python tools/test.py work_dirs/redet_dotav10/redet_dotav10.py work_dirs/redet_dotav10/epoch_12.pth --gpu-ids 0 --format-only --eval-options submission_dir=testmodel/redet_dotav10_epoch12

# multiple GPU
CUDA_VISIBLE_DEVICES=0,1,2,3 PORT=29816 tools/dist_test1.sh work_dirs/redet_dotav10/redet_dotav10.py work_dirs/redet_dotav10/epoch_12.pth 4

🚀 Released Models

Dataset	Config	Log	Checkpoint	mAP
DOTA-v1.0	config	quark hugging face	quark hugging face	44.85
DOTA-v1.5	config	quark hugging face	quark hugging face	36.39
DOTA-v2.0	config	quark hugging face	quark hugging face	27.22

🖊️ Citation

If you find this work helpful for your research, please consider giving this repo a star ⭐ and citing our papers:

@article{pointobbv2,
  title={PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection},
  author={Ren, Botao and Yang, Xue and Yu, Yi and Luo, Junwei and Deng, Zhidong},
  journal={arXiv preprint arXiv:2410.08210},
  year={2024}
}

@inproceedings{pointobb,
  title={PointOBB: Learning Oriented Object Detection via Single Point Supervision},
  author={Luo, Junwei and Yang, Xue and Yu, Yi and Li, Qingyun and Yan, Junchi and Li, Yansheng},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={16730--16740},
  year={2024}
}

@inproceedings{point2rbox,
  title={Point2RBox: Combine Knowledge from Synthetic Visual Patterns for End-to-end Oriented Object Detection with Single Point Supervision},
  author={Yu, Yi and Yang, Xue and Li, Qingyun and Da, Feipeng and Dai, Jifeng and Qiao, Yu and Yan, Junchi},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={16783--16793},
  year={2024}
}

📃 License

This project is released under the Apache License 2.0.

Name	Name	Last commit message	Last commit date
Latest commit History 7 Commits
configs	configs	Update redet_re50_refpn_1x_dota_le90.py	Mar 27, 2025
demo	demo	init repo	Oct 9, 2024
docker	docker	init repo	Oct 9, 2024
docs	docs	init repo	Oct 9, 2024
mmrotate	mmrotate	Fixes issue #1 : config key error	Oct 11, 2024
requirements	requirements	init repo	Oct 9, 2024
resources	resources	init repo	Oct 9, 2024
tests	tests	init repo	Oct 9, 2024
tools	tools	init repo	Oct 9, 2024
.gitattributes	.gitattributes	init repo	Oct 9, 2024
.gitignore	.gitignore	init repo	Oct 9, 2024
.pre-commit-config.yaml	.pre-commit-config.yaml	init repo	Oct 9, 2024
.pylintrc	.pylintrc	init repo	Oct 9, 2024
.readthedocs.yml	.readthedocs.yml	init repo	Oct 9, 2024
CITATION.cff	CITATION.cff	init repo	Oct 9, 2024
LICENSE	LICENSE	init repo	Oct 9, 2024
MANIFEST.in	MANIFEST.in	init repo	Oct 9, 2024
README.md	README.md	Fix url	Oct 17, 2024
README_zh-CN.md	README_zh-CN.md	init repo	Oct 9, 2024
model-index.yml	model-index.yml	init repo	Oct 9, 2024
pointobbv2.jpg	pointobbv2.jpg	init repo	Oct 9, 2024
requirements.txt	requirements.txt	init repo	Oct 9, 2024
setup.cfg	setup.cfg	init repo	Oct 9, 2024
setup.py	setup.py	init repo	Oct 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection

⭐️ Highlights

🛠️ Installation

📊 Data Preparation

🏋️ Train CPM

🏋️ Generate Pseudo Label

🏋️ Train Detector

🚀 Released Models

🖊️ Citation

📃 License

About

Releases

Packages

Contributors 2

Languages

License

VisionXLab/PointOBB-v2

Folders and files

Latest commit

History

Repository files navigation

PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection

⭐️ Highlights

🛠️ Installation

📊 Data Preparation

🏋️ Train CPM

🏋️ Generate Pseudo Label

🏋️ Train Detector

🚀 Released Models

🖊️ Citation

📃 License

About

Resources

License

Citation

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages