Clarify the casting behavior from floating-point / signed integers <-> unsigned integers #489

huningxin · 2023-12-06T03:14:48Z

Opening this issue to track the discussion for cast operator in PR #478.

(raised by @wacky6 in Chromium CL-5056249 review)

Casting to/from fp <-> int or fp is well understood. But what about the casting behavior from fp / int <-> uint?

Would -1.23 fp32 cast to 0 or 255 uint8?

The spec PR mentions cast is implementation defined, which isn't ideal. We should at least provide what caller should expect.

@fdwr commented

what about the casting behavior from fp / int <-> uint?

That's a trickier case for conformance, casting from a wider range into a narrower range, as different hardware gives differing results. On CPU, using SSE vs the classic FPU could return different results. On GPU, you could get different results depending on whether your GPU supported typed UAV's or native structured UAV's. On NPU, I don't even know yet.

Would -1.23 fp32 cast to 0 or 255 uint8?

I can say that locally for -1.0f -> uint8, I get 255 on CPU via C++ static_cast<uint8_t>, but I get 0 on my GPU (because -1.0f is mapped to int32_t -1, which then clamps to [0,255] when written out to the typed UAV (since uint8_t is not a natively supported type within HLSL). Then for -1.0f -> uint16_t, it's a similar story, 0xFFFF on CPU but 0 on GPU. Though, if I tried this on a GPU with D3D12_FEATURE_DATA_D3D12_OPTIONS4::Native16BitShaderOpsSupported true, then I might well get 0xFFFF instead.

So, if we require an always consistent answer in the spec for negative float -> uint cases, then we'd need some intermediate casts. Surprisingly though, this issue evidently hasn't come up so far in the DML EP. Want to open an issue for it?

@wchao1115 mentioned:

Type casting from floating-point to signed/unsigned integer is a complex process, and one without clear industry standard b/c the casting will be both accuracy-lossy and range-dependent. It is generally understood simply as an "undefined" behavior where the outcome may depend on a number of runtime factors including the support in the hardware. This is part of the reason why no other framework attempts to define it concretely to-date and leave part of it as implementation-dependent. The same situation is applied to WebNN.

Another open is if the behavior depends on different hardware internals, would it cause some finger-printing issues?

The text was updated successfully, but these errors were encountered:

zolkis · 2023-12-06T21:24:02Z

As per the last comment from @wchao1115 , this could be resolved with a note saying the same?

This step is implementation defined, because casting is an implementation dependent complex process, balancing accuracy, range, runtime factors, hardware support, etc.

inexorabletash · 2024-07-12T21:09:29Z

Could we add something like the following for the definition of cast()? I believe this is correct given the discussion above, but corrections welcome. Also, we may want to explain more, perhaps in a non-normative note.

...

Casting between data types is specified for some cases and implementation-defined in other cases.

Source Type	Destination Type
Source Type	`"float32"`, `"float16"`	`"int32"`, `"uint32"`, `"int64"`, `"uint64"`, `"int8"`, `"uint8"`
`"float32"`, `"float16"`	If in range, nearest representable value. If out of range, +/-Infinity.	If in range, truncated. If out of range, implementation-defined.
`"int32"`, `"uint32"`, `"int64"`, `"uint64"`, `"int8"`, `"uint8"`	If in range, nearest representable value. If out of range, +/-Infinity.	If in range, same value. If out of range, lowest N bits reinterpreted as destination type, assuming two's complement for signed types.

NOTE: For example, casting -1 from "int8" to "uint8" is specified to yield 255. But casting -1 from "float32" to "uint8" is implementation-defined.

fdwr · 2024-07-16T05:50:52Z

Could we add something like the following for the definition of cast()? I believe this is correct given the discussion above, but corrections welcome.

@inexorabletash The table looks both helpful and accurate (with regards to observed known CPU/GPU/NPU behavior) for all 4 for cast category directions: float->float, int->int, float->int, int->float.

Inspired by the ONNX documentation[1], outline what inference-time casts are well-defined and which are implementation-defined for out-of-range values. Fixes webmachinelearning#489 1: https://onnx.ai/onnx/operators/onnx__Cast.html

* Clarify the cast() op behavior between different data types Inspired by the ONNX documentation[1], outline what inference-time casts are well-defined and which are implementation-defined for out-of-range values. Fixes #489 1: https://onnx.ai/onnx/operators/onnx__Cast.html * Add close TR/TH/TD tags

huningxin mentioned this issue Dec 6, 2023

Add support for operations needed for well-known transformers e.g. Segment Anything, Stable Diffusion, etc. #478

Merged

fdwr mentioned this issue Dec 16, 2023

Implement cast webmachinelearning/webnn-baseline#63

Merged

This was referenced May 2, 2024

Meta: Introduce "Interop" label? #673

Closed

Specifies scalar values casted to match input type. #678

Closed

inexorabletash added operator specific interop labels May 8, 2024

inexorabletash mentioned this issue Jun 27, 2024

WebML WG Teleconference – 27 June 2024 - Open issues and PRs webmachinelearning/meetings#24

Open

inexorabletash mentioned this issue Jul 11, 2024

Improve "underlying platform" references in spec #462

Open

inexorabletash mentioned this issue Jul 16, 2024

Clarify the cast() op behavior between different data types #726

Merged

inexorabletash self-assigned this Jul 16, 2024

fdwr closed this as completed in #726 Jul 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clarify the casting behavior from floating-point / signed integers <-> unsigned integers #489

Clarify the casting behavior from floating-point / signed integers <-> unsigned integers #489

huningxin commented Dec 6, 2023

zolkis commented Dec 6, 2023

inexorabletash commented Jul 12, 2024 •

edited

Loading

fdwr commented Jul 16, 2024 •

edited

Loading

Clarify the casting behavior from floating-point / signed integers <-> unsigned integers #489

Clarify the casting behavior from floating-point / signed integers <-> unsigned integers #489

Comments

huningxin commented Dec 6, 2023

zolkis commented Dec 6, 2023

inexorabletash commented Jul 12, 2024 • edited Loading

fdwr commented Jul 16, 2024 • edited Loading

inexorabletash commented Jul 12, 2024 •

edited

Loading

fdwr commented Jul 16, 2024 •

edited

Loading