Solution for Fast Voxelization of Large Point Clouds in Single Frames in FUTR3D #63

Small-NengNeng · 2024-12-19T13:15:31Z

Dear Author,

Thank you for providing this fascinating framework. I am currently attempting to adapt FUTR3D for a new dataset, but I have encountered some challenges, particularly during the voxelization process of the point cloud.

So far, we have aligned our data format with nuScenes and can train the model successfully. However, the training time for our LiDAR-only setup is over one month(35DAYS 24EPOCH), which is 8x longer than training on the nuScenes dataset(4DAYS 24EPOCH). We are using 4*A100 80GB GPUs.

Our single-frame point cloud contains 1M+ points, whereas nuScenes' single-frame point cloud has only 260k points. After debugging, we identified the main bottleneck in the following function:
futr3d/plugin/futr3d/models/detectors/futr3d.py

  def voxelize(self, points):
        voxels, coors, num_points = [], [], []
        for res in points:
            res_voxels, res_coors, res_num_points = self.pts_voxel_layer(res)#<---------------------------Here

Upon further investigation into mmcv-1.6.2/mmcv/ops/voxelize.py, we found that the primary delay occurs in this section:

def forward_train(self,points=None,img_metas=None,gt_bboxes_3d=None,gt_labels_3d=None, gt_labels=None,gt_bboxes=None,
img=None,radar=None,proposals=None,gt_bboxes_ignore=None)
        img_feats, pts_feats, radar_feats = self.extract_feat(
            points=points, img=img, radar=radar, img_metas=img_metas)#<-----------------------And Here
        
        losses = dict()
        losses_pts = self.forward_pts_train(pts_feats, img_feats, radar_feats, gt_bboxes_3d,
                                            gt_labels_3d, img_metas,
                                            gt_bboxes_ignore)
        losses.update(losses_pts)
        
        return losses

After debugging the related variables, we observed the following outputs:

voxels_out.shape
torch.Size([120000, 10, 5])

coors_out.shape
torch.Size([120000, 3])

num_points_per_voxel_out.shape
torch.Size([120000])

Here, 120000 is defined by max_voxels in the configuration file:

model = dict(
    type='FUTR3D',
    aux_weight=0.5,
    pts_voxel_layer=dict(
        max_num_points=10, 
        voxel_size=voxel_size, 
        max_voxels=(120000, 160000),
        point_cloud_range=point_cloud_range),
)

We suspect that some voxels are being discarded due to this limit. However, with my limited experience, I am unsure how to further optimize this process. As an initial attempt, I tried cropping the original point cloud by reducing the number of points and shrinking the detection range (while maintaining point_cloud_range = [-54, -54, -5.0, 54, 54, 3.0]) for training.

Nevertheless, I believe this is not an ideal solution. If you have any insights or suggestions on how to accelerate this task, I would greatly appreciate your advice.

Thank you very much for your time and consideration!

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Solution for Fast Voxelization of Large Point Clouds in Single Frames in FUTR3D #63

Solution for Fast Voxelization of Large Point Clouds in Single Frames in FUTR3D #63

Small-NengNeng commented Dec 19, 2024

Solution for Fast Voxelization of Large Point Clouds in Single Frames in FUTR3D #63

Solution for Fast Voxelization of Large Point Clouds in Single Frames in FUTR3D #63

Comments

Small-NengNeng commented Dec 19, 2024