Euclidean distance transform: fix bad choice of block parameters #393

grlee77 · 2022-08-22T13:56:31Z

closes #392

This PR fixes #392 and also makes it more friendly for use with user-provided block_params. In general, most users should not be providing that argument, but it can be used to compare different settings for performance optimization. In case of user-provided block_params, the implementation now automatically pad the shape to an appropriate least common multiple of the warp_size and the m1, m2 and m3 block parameters.

More extensive unit tests over a range of image sizes and block_params settings are now implemented.

jakirkham

Thanks Greg! 🙏

Had a couple comments below

jakirkham · 2022-08-22T17:36:49Z

python/cucim/src/cucim/core/operations/morphology/_pba_2d.py

+    def _lcm(a, b):
+        return abs(b * (a // math.gcd(a, b)))


NumPy also has an lcm implementation FWIW

Okay, I didn't know that. This way is not quite as fast as math.lcm for scalar a, b but still faster than numpy.lcm in that case.

jakirkham · 2022-08-22T17:38:02Z

python/cucim/src/cucim/core/operations/morphology/_pba_2d.py

+    @functools.lru_cache()
+    def lcm(*args):


Curious what the motivation is for caching here

When using math.lcm directly in Python 3.9+ it is much faster (~150 ns vs. 2.8 µs for this fallback) so the cache was just to avoid that overhead prior to kernel launch. I think in practice it is not very important and can remove it if you prefer.

I figured often the (m1, m2, m3) block parameters would usually be the same. This function is not used at all when block_params is left at the default of None, though.

jakirkham · 2022-08-22T17:40:16Z

python/cucim/src/cucim/core/operations/morphology/_pba_2d.py

-        m1, m2, m3 = block_params
+        if any(p < 1 for p in block_params):
+            raise ValueError("(m1, m2, m3) in blockparams must be >= 1")
+        m1, m2, m3 = map(int, block_params)


Are these allowed to be floats or something? Should we do a type check?

No, they definitely should be integers. I am not sure why I had that explicit map call. I can just remove it.

For non-float m1, m2, m3 the user will get an error on kernel launch as CUDA will not accept non-integer block/grid size. I think that is fine. In general the recommendation is to use the default block_params=None and use the automated m1, m2, m3 that gets chosen in that case.

gigony

Looks good to me besides John's comments and a minor question.

gigony · 2022-08-22T17:45:33Z

python/cucim/src/cucim/core/operations/morphology/_pba_2d.py

        # m2 must also be a power of two
        m2 = 2**math.floor(math.log2(m2))
        if padded_size % m2 != 0:
            raise RuntimeError("error in setting default m2")
-
-        # should be <= 64. texture size must be a multiple of m3
+        # should be <= 64. image size must be a multiple of m3


Do we need a statement for checking the value is <=64 somewhere?

I think that was an outdated/stray comment and will just remove it. The check should be relative to "block_size" as in the check shortly below that:

if m3 > padded_size // block_size: raise ValueError("m3 too large. must be <= arr.shape[1] // 32")

I will update that string to use block_size instead of 32 in case we change block_size in the future.

grlee77 · 2022-09-01T15:07:19Z

Thanks for reviewing, I think I have addressed the comments.

In case of user-provided block_params, automatically pad the shape to an appropriate multiple.

grlee77 added bug Something isn't working non-breaking Introduces a non-breaking change labels Aug 22, 2022

grlee77 added this to the v22.08.01 milestone Aug 22, 2022

grlee77 requested a review from a team as a code owner August 22, 2022 13:56

grlee77 changed the title ~~fix bad choice of m1 parameter for larger 2D image sizes~~ Euclidean distance transform: fix bad choice of block parameters Aug 22, 2022

jakirkham reviewed Aug 22, 2022

View reviewed changes

gigony approved these changes Aug 22, 2022

View reviewed changes

jakirkham changed the base branch from branch-22.10 to branch-22.08 September 1, 2022 15:58

jakirkham requested a review from a team as a code owner September 1, 2022 15:58

grlee77 added 2 commits September 1, 2022 12:17

fix bad choice of m1 parameter for larger 2D image sizes

5a52b35

In case of user-provided block_params, automatically pad the shape to an appropriate multiple.

address reviewer feedback

6227ba2

grlee77 force-pushed the fix-distance-transform-edt branch from 27bf8a6 to 6227ba2 Compare September 1, 2022 16:18

jakirkham approved these changes Sep 1, 2022

View reviewed changes

raydouglass merged commit 0b4f061 into rapidsai:branch-22.08 Sep 1, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Euclidean distance transform: fix bad choice of block parameters #393

Euclidean distance transform: fix bad choice of block parameters #393

grlee77 commented Aug 22, 2022

jakirkham left a comment

jakirkham Aug 22, 2022

grlee77 Sep 1, 2022

jakirkham Aug 22, 2022

grlee77 Sep 1, 2022 •

edited

Loading

jakirkham Aug 22, 2022

grlee77 Sep 1, 2022

grlee77 Sep 1, 2022

gigony left a comment

gigony Aug 22, 2022

grlee77 Sep 1, 2022

grlee77 commented Sep 1, 2022

Euclidean distance transform: fix bad choice of block parameters #393

Euclidean distance transform: fix bad choice of block parameters #393

Conversation

grlee77 commented Aug 22, 2022

jakirkham left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

grlee77 Sep 1, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gigony left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

grlee77 commented Sep 1, 2022

grlee77 Sep 1, 2022 •

edited

Loading