Add SelectByIndex and improve minor of SelectByMask #5150

yuecideng · 2022-06-06T06:00:51Z

Change SelectPoints -> SelectByMask in tensor based PointCloud.
Remove tensor clone inside SelectByMask.
Add SelectByIndex (both unit test and pybind) in tensor based PointCloud.

This change is

update-docs · 2022-06-06T06:00:54Z

Thanks for submitting this pull request! The maintainers of this repository would appreciate if you could update the CHANGELOG.md based on your changes.

yuecideng · 2022-06-06T10:05:29Z

I also add legacy Transform and SelectByIndex in benchmark.
The Transform works fine for tensor based. (The legacy version can also be parallel using openmp)

The SelectByIndex is slow compared with legacy version (tensor based version is wrapped from SelectByMask, original SelectPoints)

yxlao

Looks good in general with some comments. After the boolean type change, the tensor-based can have some performance gains.

Reviewed 3 of 4 files at r1, 1 of 1 files at r3, all commit messages.
Reviewable status: 4 of 5 files reviewed, 6 unresolved discussions (waiting on @yuecideng and @yxlao)

cpp/open3d/t/geometry/PointCloud.h line 327 at r3 (raw file):

    /// output point cloud.
    ///

Remove blank line.

cpp/open3d/t/geometry/PointCloud.h line 328 at r3 (raw file):

    ///

    /// \param indices Int32 indexing tensor of shape {n,} containing

Int64? Also mention how we handle duplicated indices.

cpp/open3d/t/geometry/PointCloud.h line 332 at r3 (raw file):

    /// \param invert Set to `True` to invert the selection of indices.
    PointCloud SelectByIndex(const core::Tensor &indices,
                             bool invert = false) const;

It might not always be desirable to remove duplicated indices automatically, e.g., people may expect the output point cloud length to be the same as the input point cloud length.

Shall we add an additional parameter called remove_duplicates = false, and by default, we do not remove the duplicates? This can also improve the default performance.

cpp/open3d/t/geometry/PointCloud.cpp line 256 at r3 (raw file):

                                     bool invert /* = false */) const {
    const int64_t length = GetPointPositions().GetLength();
    core::AssertTensorDtype(indices, core::Dtype::Int64);

Use core::Int64 instead of core::Dtype::xxx. Same for the rest.

cpp/open3d/t/geometry/PointCloud.cpp line 265 at r3 (raw file):

            core::Tensor::Zeros({length}, core::Dtype::Int16, GetDevice());
    mask.SetItem(core::TensorKey::IndexTensor(indices),
                 core::Tensor(std::vector<int16_t>{1}, {}, core::Dtype::Int16,

We need some code change to allow Bool dtype.

diff --git a/cpp/open3d/core/kernel/IndexGetSetCPU.cpp b/cpp/open3d/core/kernel/IndexGetSetCPU.cpp
index 329f78091..ca35e3aff 100644
--- a/cpp/open3d/core/kernel/IndexGetSetCPU.cpp
+++ b/cpp/open3d/core/kernel/IndexGetSetCPU.cpp
@@ -71,7 +71,7 @@ void IndexGetCPU(const Tensor& src,
             CPUCopyObjectElementKernel(src, dst, object_byte_size);
         });
     } else {
-        DISPATCH_DTYPE_TO_TEMPLATE(dtype, [&]() {
+        DISPATCH_DTYPE_TO_TEMPLATE_WITH_BOOL(dtype, [&]() {
             LaunchAdvancedIndexerKernel(ai, CPUCopyElementKernel<scalar_t>);
         });
     }
@@ -91,7 +91,7 @@ void IndexSetCPU(const Tensor& src,
             CPUCopyObjectElementKernel(src, dst, object_byte_size);
         });
     } else {
-        DISPATCH_DTYPE_TO_TEMPLATE(dtype, [&]() {
+        DISPATCH_DTYPE_TO_TEMPLATE_WITH_BOOL(dtype, [&]() {
             LaunchAdvancedIndexerKernel(ai, CPUCopyElementKernel<scalar_t>);
         });
     }
diff --git a/cpp/open3d/core/kernel/IndexGetSetCUDA.cu b/cpp/open3d/core/kernel/IndexGetSetCUDA.cu
index ef0de0c1a..547e943ca 100644
--- a/cpp/open3d/core/kernel/IndexGetSetCUDA.cu
+++ b/cpp/open3d/core/kernel/IndexGetSetCUDA.cu
@@ -80,7 +80,7 @@ void IndexGetCUDA(const Tensor& src,
                     CUDACopyObjectElementKernel(src, dst, object_byte_size);
                 });
     } else {
-        DISPATCH_DTYPE_TO_TEMPLATE(dtype, [&]() {
+        DISPATCH_DTYPE_TO_TEMPLATE_WITH_BOOL(dtype, [&]() {
             LaunchAdvancedIndexerKernel(
                     src.GetDevice(), ai,
                     // Need to wrap as extended CUDA lambda function
@@ -108,7 +108,7 @@ void IndexSetCUDA(const Tensor& src,
                     CUDACopyObjectElementKernel(src, dst, object_byte_size);
                 });
     } else {
-        DISPATCH_DTYPE_TO_TEMPLATE(dtype, [&]() {
+        DISPATCH_DTYPE_TO_TEMPLATE_WITH_BOOL(dtype, [&]() {
             LaunchAdvancedIndexerKernel(
                     src.GetDevice(), ai,
                     // Need to wrap as extended CUDA lambda function
diff --git a/cpp/open3d/t/geometry/PointCloud.cpp b/cpp/open3d/t/geometry/PointCloud.cpp
index 96547c01a..04fa4f158 100644
--- a/cpp/open3d/t/geometry/PointCloud.cpp
+++ b/cpp/open3d/t/geometry/PointCloud.cpp
@@ -259,13 +259,11 @@ PointCloud PointCloud::SelectByIndex(const core::Tensor &indices,
     // The indices may have duplicate index value and will result in identity
     // point cloud attributes. We convert indices Tensor into mask Tensor to and
     // call SelectByMask to avoid this situation.
-    core::Tensor mask =
-            core::Tensor::Zeros({length}, core::Dtype::Int16, GetDevice());
+    core::Tensor mask = core::Tensor::Zeros({length}, core::Bool, GetDevice());
     mask.SetItem(core::TensorKey::IndexTensor(indices),
-                 core::Tensor(std::vector<int16_t>{1}, {}, core::Dtype::Int16,
-                              GetDevice()));
+                 core::Tensor::Init<bool>(true, GetDevice()));
 
-    PointCloud pcd = SelectByMask(mask.To(core::Dtype::Bool), invert);
+    PointCloud pcd = SelectByMask(mask, invert);
 
     utility::LogDebug("Pointcloud down sampled from {} points to {} points.",
                       length, pcd.GetPointPositions().GetLength());

After this, we avoid some copies and type conversions. Although, it is still 2x slower than the legacy on my machine.

cpp/tests/t/geometry/PointCloud.cpp line 707 at r3 (raw file):

                                       {0.2, 0.4, 0.2}},
                                      device));
    const core::Tensor indices = core::Tensor::Init<int64_t>({0, 3}, device);

Add another test case to test duplicated indices, and test the remove_duplicates=xxx flag.

yuecideng

Reviewable status: 4 of 5 files reviewed, 6 unresolved discussions (waiting on @yuecideng and @yxlao)

cpp/open3d/t/geometry/PointCloud.h line 327 at r3 (raw file):