🔥ScoreLiDAR🔥

[ICCV'25 Oral] Distilling Diffusion Models to Efficient 3D LiDAR Scene Completion

by Shengyuan Zhang¹, An Zhao¹, Ling Yang², Zejian Li*¹, Chenye Meng¹, Haoran Xu³, Tianrun Chen¹, AnYang Wei³,Perry Pengyun GU³, Lingyun Sun¹

¹Zhejiang University ²Peking University ³Zhejiang Green Zhixing Technology co., ltd

Abstract

Diffusion models have been applied to 3D LiDAR scene completion due to their strong training stability and high completion quality. However, the slow sampling speed limits the practical application of diffusion-based scene completion models since autonomous vehicles require an efficient perception of surrounding environments. This paper proposes a novel distillation method tailored for 3D LiDAR scene completion models, dubbed ScoreLiDAR, which achieves efficient yet high-quality scene completion. ScoreLiDAR enables the distilled model to sample in significantly fewer steps after distillation. To improve completion quality, we also introduce a novel Structural Loss, which encourages the distilled model to capture the geometric structure of the 3D LiDAR scene. The loss contains a scene-wise term constraining the holistic structure and a point-wise term constraining the key landmark points and their relative configuration. Extensive experiments demonstrate that ScoreLiDAR significantly accelerates the completion time from 30.55 to 5.37 seconds per frame ( $>$ 5 $\times$ ) on SemanticKITTI and achieves superior performance compared to state-of-the-art 3D LiDAR scene completion models.

Environment setup

The following commands are tested with Python 3.8 and CUDA 11.1.

Install required packages:

sudo apt install build-essential python3-dev libopenblas-dev

pip3 install -r requirements.txt

Install MinkowskiEngine for sparse tensor processing:

pip3 install -U MinkowskiEngine==0.5.4 --install-option="--blas=openblas" -v --no-deps

Setup the code on the code main directory:

pip3 install -U -e .

Inference

First, download the models 'ScoreLiDAR_diff_net.ckpt' and 'refine_net.ckpt' from here and place it in the following directory:

checkpoints/*.ckpt

Or you can download it from our huggingface repo with:

huggingface-cli download happywind/ScoreLiDAR ScoreLiDAR_diff_net.ckpt --local-dir checkpoints
huggingface-cli download happywind/ScoreLiDAR refine_net.ckpt --local-dir checkpoints

Then run the inference script with the following command:

python3 tools/diff_completion_pipeline.py --denoising_steps 8 --cond_weight 3.5

This script will read all .ply files under scorelidar/Datasets/test/ directory as model input and the results will be saved under scorelidar/results/. We have provided a .ply file sample for inference.

You can visualize the result with the following command:

python3 vis_pcd.py -p <path_to_.ply_file>

Training

We used The SemanticKITTI dataset for training.

The SemanticKITTI dataset has to be downloaded from the official site and extracted in the following structure:

./scorelidar/
└── Datasets/
    └── SemanticKITTI
        └── dataset
          └── sequences
            ├── 00/
            │   ├── velodyne/
            |   |       ├── 000000.bin
            |   |       ├── 000001.bin
            |   |       └── ...
            │   └── labels/
            |       ├── 000000.label
            |       ├── 000001.label
            |       └── ...
            ├── 08/ # for validation
            ├── 11/ # 11-21 for testing
            └── 21/
                └── ...

Ground truth scenes are not provided explicitly in SemanticKITTI. To generate the ground complete scenes you can run the map_from_scans.py script. This will use the dataset scans and poses to generate the sequence map to be used as ground truth during training:

python3 map_from_scans.py --path Datasets/SemanticKITTI/dataset/sequences/

We used the pre-trained model of LiDiff. Download the teacher model weights 'diff_net.ckpt' from here or the official release of LiDiff and place it at checkpoints/diff_net.ckpt.

Or you can download it from our huggingface repo with:

huggingface-cli download happywind/ScoreLiDAR diff_net.ckpt --local-dir checkpoints

Once the sequences map is generated and the teacher model is ready you can then train the model.

For training the model, the configurations are defined in config/config.yaml, and the training can be started with:

python3 train.py

Citation

If you find our paper useful or relevant to your research, please kindly cite our papers:

@article{zhang2024distillingdiffusionmodels,
    title={Distilling Diffusion Models to Efficient 3D LiDAR Scene Completion}, 
    author={Shengyuan Zhang and An Zhao and Ling Yang and Zejian Li and Chenye Meng and Haoran Xu and Tianrun Chen and AnYang Wei and Perry Pengyun GU and Lingyun Sun,
    journal={arXiv:2412.03515},
    year={2024}
}

Credits

ScoreLiDAR is highly built on the following amazing open-source projects:

Lidiff: Scaling Diffusion Models to Real-World 3D LiDAR Scene Completion

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
pics		pics
scorelidar		scorelidar
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🔥ScoreLiDAR🔥

[ICCV'25 Oral] Distilling Diffusion Models to Efficient 3D LiDAR Scene Completion

Abstract

Environment setup

Inference

Training

Citation

Credits

About

Uh oh!

Releases

Packages

Contributors 3

Languages

happyw1nd/ScoreLiDAR

Folders and files

Latest commit

History

Repository files navigation

🔥ScoreLiDAR🔥

[ICCV'25 Oral] Distilling Diffusion Models to Efficient 3D LiDAR Scene Completion

Abstract

Environment setup

Inference

Training

Citation

Credits

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages