Skip to content
This repository has been archived by the owner on Jan 1, 2025. It is now read-only.

Latest commit

 

History

History
44 lines (38 loc) · 1.72 KB

README.md

File metadata and controls

44 lines (38 loc) · 1.72 KB

KIITI VO scene 6 - frame skip k=3

The videos below show RidgeSfM reconstructions for KITTI VO scene 6 (using every 3rd frame).

We trained a depth prediction network on the KITTI depth prediction training set. We then processed scene 6 from the KITTI Visual Odometry dataset. We used the 'camera 2' image sequences, cropping the input to RGB images of size 1216x320. We used R2D2 as the keypoint detector.

For each scene, we use the reconstructed depth and camera parameters to reproject the pixels to form a point cloud. Each point in the cloud has the form (x,y,z,r,g,b) ∈ ℝ6. To simplify the point-cloud, we use K-Means to extract 1,000,000 centroids.

For each video
Top left: the rendered point cloud.
Top right: The focal-plane trajectory for the predicted camera locations.
Bottom left: The input video. Bottom right: The focal-plane trajectory of the ground truth camera locations.
RidgeSfm - KITTI VO scene 6 frameskip k=3
Using MeshLab to display the point cloud:
KITTI reconstruction