KIITI VO scene 6 - frame skip k=3

The videos below show RidgeSfM reconstructions for KITTI VO scene 6 (using every 3rd frame).

We trained a depth prediction network on the KITTI depth prediction training set. We then processed scene 6 from the KITTI Visual Odometry dataset. We used the 'camera 2' image sequences, cropping the input to RGB images of size 1216x320. We used R2D2 as the keypoint detector.

For each scene, we use the reconstructed depth and camera parameters to reproject the pixels to form a point cloud. Each point in the cloud has the form (x,y,z,r,g,b) ∈ ℝ⁶. To simplify the point-cloud, we use K-Means to extract 1,000,000 centroids.

For each video
Top left: the rendered point cloud.	Top right: The focal-plane trajectory for the predicted camera locations.
Bottom left: The input video.	Bottom right: The focal-plane trajectory of the ground truth camera locations.

Using MeshLab to display the point cloud:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

KIITI VO scene 6 - frame skip k=3

Files

README.md

Latest commit

History

README.md

File metadata and controls

KIITI VO scene 6 - frame skip k=3