Add max threads checking for Metal #5588

xndcn · 2020-12-22T01:47:37Z

Originally, this checking will be asserted by Metal API Validation
in Xcode, otherwise the program will crash or output wrong results.

xndcn · 2020-12-22T01:48:59Z

Xcode will do some checking like this:

steven-johnson · 2020-12-22T02:16:45Z

Please look at (and fix) the clang-tidy errors.

Originally, this checking will be asserted by Metal API Validation in Xcode, otherwise the program will crash or output wrong results.

xndcn · 2020-12-22T02:42:28Z

Please look at (and fix) the clang-tidy errors.

Thanks, fixed.

shoaibkamil · 2020-12-22T15:45:17Z

test/error/metal_threads_too_large.cpp

+    f.realize(output);
+    output.copy_to_host();
+
+    for (int32_t i = 0; i < output.width(); i++) {


Is this loop necessary if we expect an error?

It's not necessary indeed. Leave it just for code integrity...

shoaibkamil · 2020-12-22T15:45:28Z

Thanks for the PR! In principle, I'm not opposed to adding this check, though we don't do similar checks on any other backend. It might be better to only do the check in a debug target.

I'd like to move towards a standard set of checks we do in non-debug and debug targets across all the GPU runtimes. Curious to hear what @zvookin thinks.

shoaibkamil · 2020-12-22T18:02:34Z

test/error/metal_threads_too_large.cpp

+    Var x("x"), y("y");
+
+    f(x, y) = im(x, y) + 42;
+    f.gpu_blocks(y).gpu_threads(x, DeviceAPI::Metal);


What happens if the machine doesn't support Metal? I think this needs a check for whether the JIT target supports Metal, otherwise it should be skipped.

Yeah, even if the machine supports metal, we should skip if the target doesn't support it, eg

if (!get_jit_target_from_environment().has_feature(Target::Metal)) { printf("[SKIP] error/metal_threads_too_large ignored for targets without Metal enabled.\n"); _halide_user_assert(0); }

Thank you, since now it only checks in debug runtime, so I add a specific target, like "metal_vector_too_large" did.

xndcn · 2020-12-23T07:08:23Z

@shoaibkamil Thanks, it's reasonable to only do this check in debug runtime, so I have moved it.

For Cuda and OpenCL backends, they seem to do this checking in library. "cuLuanchKernel" and "clEnqueueNDRangeKernel" both have a return value to indicate this error: "cudaErrorLaunchOutOfResources" and "CL_INVALID_WORK_ITEM_SIZE".

For OpenGL Compute backend, after "glDispatchCompute" is called, "GL_INVALID_VALUE" is generated to indicate this error, too.

For D3D12Compute backend, since the threadgroups are encoded in the shader source, "D3DCompile" will failed when it exceeds the limit.

So it seems the Metal backend is an exception. Not sure what behavior the Vulkan is.

steven-johnson

Clone is green, LGTM

steven-johnson · 2021-01-07T22:46:46Z

@shoaibkamil Do you also want to review before this lands?

Add max threads checking for Metal

63b21dd

Originally, this checking will be asserted by Metal API Validation in Xcode, otherwise the program will crash or output wrong results.

xndcn force-pushed the fix-metal branch from 2fb72ab to 63b21dd Compare December 22, 2020 02:24

shoaibkamil reviewed Dec 22, 2020

View reviewed changes

Disable the max threads checking for Metal in non-debug runtime

025520d

Disable error/metal_threads_too_large test for non-OSX target

b63ad8d

alexreinking added this to the v12.0.0 milestone Jan 2, 2021

Merge branch 'master' into pr/5588

5e6162b

steven-johnson approved these changes Jan 7, 2021

View reviewed changes

Merge branch 'master' into pr/5588

c3fa79c

steven-johnson merged commit 2b3aaa8 into halide:master Jan 8, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add max threads checking for Metal #5588

Add max threads checking for Metal #5588

xndcn commented Dec 22, 2020

xndcn commented Dec 22, 2020

steven-johnson commented Dec 22, 2020

xndcn commented Dec 22, 2020

shoaibkamil Dec 22, 2020

xndcn Dec 23, 2020

shoaibkamil commented Dec 22, 2020 •

edited

Loading

shoaibkamil Dec 22, 2020

steven-johnson Dec 22, 2020

xndcn Dec 23, 2020

xndcn commented Dec 23, 2020

steven-johnson left a comment

steven-johnson commented Jan 7, 2021

Add max threads checking for Metal #5588

Add max threads checking for Metal #5588

Conversation

xndcn commented Dec 22, 2020

xndcn commented Dec 22, 2020

steven-johnson commented Dec 22, 2020

xndcn commented Dec 22, 2020

shoaibkamil Dec 22, 2020

Choose a reason for hiding this comment

xndcn Dec 23, 2020

Choose a reason for hiding this comment

shoaibkamil commented Dec 22, 2020 • edited Loading

shoaibkamil Dec 22, 2020

Choose a reason for hiding this comment

steven-johnson Dec 22, 2020

Choose a reason for hiding this comment

xndcn Dec 23, 2020

Choose a reason for hiding this comment

xndcn commented Dec 23, 2020

steven-johnson left a comment

Choose a reason for hiding this comment

steven-johnson commented Jan 7, 2021

shoaibkamil commented Dec 22, 2020 •

edited

Loading