_halide_buffer_crop() needs to check for runtime failures (v2) #6403

steven-johnson · 2021-11-09T23:29:27Z

(Alternate to #6402)

We currently assume that _halide_buffer_crop() will never fail. This is a bad assumption, as it can call device_crop(), which can fail due to unexpected runtime errors, or from a backend simply leaving the device_crop field at the default (unimplemented) case (as is currently the case for the OGLC backend).

When this happens, the dst buffer was left in an inconsistent, invalid state (which was what led to the crashes fixed by #6401).

This change modifies _halide_buffer_crop() to return nullptr in the event of an error, and ensure that all cropped buffers are checked for null at the right point. (This is not optimal, of course, since the specific error returned by device_crop is getting dropped on the floor, but the existence of an error is no longer ignored.)

This addresses at least some of the failure issues we are seeing in performance_async_gpu with the OpenGLCompute backend.

(Also: drive-by whitespace fix in CodegenC)

(Alternate to #6402) We currently assume that _halide_buffer_crop() will never fail. This is a bad assumption, as it can call device_crop(), which can fail due to unexpected runtime errors, or from a backend simply leaving the device_crop field at the default (unimplemented) case (as is currently the case for the OGLC backend). When this happens, the dst buffer was left in an inconsistent, invalid state (which was what led to the crashes fixed by #6401). This change modifies _halide_buffer_crop() to return nullptr in the event of an error, and ensure that all cropped buffers are checked for null at the right point. (This is not optimal, of course, since the specific error returned by device_crop is getting dropped on the floor, but the existence of an error is no longer ignored.) This addresses at least some of the failure issues we are seeing in performance_async_gpu with the OpenGLCompute backend. (Also: drive-by whitespace fix in CodegenC)

steven-johnson · 2021-11-09T23:52:04Z

(For the record, I only noticed this because of the earlier work we did to try to check the results of all runtime calls, internally-called or otherwise...)

steven-johnson requested review from shoaibkamil and abadams November 9, 2021 23:29

steven-johnson mentioned this pull request Nov 9, 2021

_halide_buffer_crop() needs to check for runtime failures #6402

Closed

Oops

92ba5f2

abadams approved these changes Nov 9, 2021

View reviewed changes

steven-johnson merged commit 9ff87ce into master Nov 11, 2021

steven-johnson deleted the srj/halide_buffer_crop_2 branch November 11, 2021 18:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

_halide_buffer_crop() needs to check for runtime failures (v2) #6403

_halide_buffer_crop() needs to check for runtime failures (v2) #6403

steven-johnson commented Nov 9, 2021

steven-johnson commented Nov 9, 2021

_halide_buffer_crop() needs to check for runtime failures (v2) #6403

_halide_buffer_crop() needs to check for runtime failures (v2) #6403

Conversation

steven-johnson commented Nov 9, 2021

steven-johnson commented Nov 9, 2021