Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

请问occupancy有大model的config么? #17

Open
WuDianQiBian opened this issue Feb 25, 2023 · 3 comments
Open

请问occupancy有大model的config么? #17

WuDianQiBian opened this issue Feb 25, 2023 · 3 comments

Comments

@WuDianQiBian
Copy link

WuDianQiBian commented Feb 25, 2023

作者您好,首先非常感谢做出这样优秀的作品。
paper中提到训练occupancy时的TPV resolution是200x200x16,并且dim是128,然而在tpv04_occupancy.py中,TPV resolution是100x100x8, dim是256:
image

请问可以重新上传一个config么?最好和paper保持一致,这样方便大家复现。
万分感谢~!

(顺便说一下,新上传的可视化代码在visualization文件夹中,但是有一些package import用的是visualize这个词,一个minor bug,请知晓)

@huang-yh
Copy link
Collaborator

huang-yh commented Feb 25, 2023

Sorry for the confusion. We actually use [100*100*8 tpv resolution, 256 feature dimension] for 3D semantic occupancy prediction and [200*200*16 tpv resolution, 128 feature dimension] for lidar segmentation in the paper. Note that there is no necessary connection between the resolution of the tpv planes and the voxel resolution for visualization, since we can upsample the tpvplanes as shown in Fig. 6 at test time.
On the other hand, finer details could be expected if tpv planes of higher resolution are used.
Also, thanks for reporting the bug to us.

@WuDianQiBian
Copy link
Author

I see. Thanks for the reply.
I can understand we can upsample the tpvplanes during test time. But is there any reason not to use 200x200x16 for training in your paper? Did you observe minor improvements when increasing from 100x100x8 (with 2x upsample at test time) to 200x200x16 (no upsample at time time)?

@huang-yh
Copy link
Collaborator

In fact, we did not notice substantial improvement qualitatively through visualization, when training with a resolution of 200x200x16. We think it might be due to the sparse nature of LiDAR supervision, which is further sparsified with higher resolution.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants