ArgMax/Min `selectLastIndex` is not supported on CoreML #652

philloooo · 2024-04-23T21:17:41Z

For argmax/min on CoreML: In case of ties, the identity of the return value is not guaranteed.

Current spec seems to indicate if selectLastIndex is not passed, the first index will be chosen. This can't be satisfied by the CoreML backend.

How much does models actually depend on this? Do we have examples where selectLastIndex is useful?

If not, I'd suggest removing this parameter.

The text was updated successfully, but these errors were encountered:

fdwr · 2024-04-26T06:33:50Z

For element value ties, PyTorch chooses the last index whereas Tensorflow chooses the first index, and because WebNN can be called by any front end, WebNN being able to accommodate either is valuable. It sounds like for CoreML, it either chooses:

(a) semiarbitrarily between ties (which would be most unfortunate 😐)
(b) chooses one over the other, but just doesn't state which it is.

Can you experiment to see which happens? e.g. Given input [0, 1, -5, 3, -5, 4] and axis = 0, does the reduceMin operation pick -5 from index 2 or 4?

With that info (assuming it is deterministic (b)), it should be possible to (not as efficiently, but possible) to use a transpose along the active dimension followed by an inversion of the indices. So CoreML selected index index 2 with selectLastIndex = false, then when selectLastIndex = true, it's the same as passing the transpose [4, -5, 3, -5, 1, 0] to get index 1 followed by a sub(5, indices) to adjust the index to 4.

How much does models actually depend on this?

🤔 That I'm unsure about, but it could make the difference in a list of equal probabilities between presenting one label to the user vs another. Granted, floating point math is imprecise anyway, and you can't expect the exact same labels in the same version of the same library even between different computers, but it would make a bigger difference on integer inputs.

philloooo · 2024-05-02T03:32:03Z

During my local testing, argmax always returns the smaller index. Although I don't think we should rely on that behavior since the documentation clearly states it's not guaranteed. @mwyrzykowski do you know if argmax is guaranteed to return smaller index?

mwyrzykowski · 2024-05-02T04:03:11Z

@philloooo It is not guaranteed and may change

fdwr · 2024-05-02T05:34:07Z

@philloooo It is not guaranteed and may change

Shoot, so sounds like CoreML's tie breaker behavior is undocumented and nondeterministic. So some options include:

Request that the CoreML behavior be codified one way or another. 😉 (or request some future version of CoreML add this tie breaker flag 📅)
See if any of the other Apple ML libraries (BNNS, MPS, MLX) support this operator? (I'm really unsure how interoperable each of these libraries is, but I presume CoreML is implemented in terms of these lower-level layers and there is some way to interop between them?)
Defeat the nondeterminism of lowest index vs highest index by using more operators. For example, one could use min(argMin(x), sub(axisLength, transpose(argMin(x), ...)) or max(argMin(x), sub(axisLength, transpose(argMin(x), ...)) to respectively select the lowest or the highest index. Though, this degree of overhead for an uncommon case warrants a potential 3rd enum of "don't care". So rather than the current boolean select-first-index-upon-ties and select-last-index-upon-ties, you'd also have select-indeterminate-index-on-ties, which should probably just be the default value unless the caller needs one over the other. The quandary here is what happens for a three-way tie? If you have argMax([2,1,1,1,3]), does it return index 1, 2, or 3? If 1 or 3, the above code would work, but if ever 2 (the middle index), then I see no way to even emulate it.
Keep the tiebreaker flag so backends which do support it will return desired results, with the caveat that different backends may diverge. :/
Remove the tiebreaker flag and say in WebNN all ties are undocumented behavior. 😶

Any others come to mind?

philloooo · 2024-07-10T18:47:49Z

Given the feedback from @mwyrzykowski that CoreML won't be able to have deterministic behavior. I think the practical options from @fdwr 's list are:

Keep the tiebreaker flag so backends which do support it will return desired results, with the caveat that different backends may diverge. :/
Remove the tiebreaker flag and say in WebNN In case of ties, the identity of the return value is not guaranteed.

Neither is great, but the second option seems more consistent to me?

mwyrzykowski · 2024-07-10T20:58:18Z

Neither is great, but the second option seems more consistent to me?

I would also prefer the second option, otherwise we end up with a situation where something may work on one platform and fail on another, which is not great for portability.

fdwr · 2024-07-10T21:33:27Z

I think the practical options from @fdwr 's list are ...
I would also prefer the second option

Well, notice that even with the second option, we still end up in a situation where something may work on one platform and not work as-expected on another, as someone may test on Chromium atop XNNPack/TfLite (which chooses first tiebreaker and may match their expectations) and then test atop Safari via CoreML (which might choose the opposite tiebreaker). It's just that with the 2nd option, we call out that it's "implementation defined" to better set expectations. However, I agree that of those two options, it's better to remove the flag and document that it's indeterminate than to have a flag which is ignored because the backend cannot implement it. Additionally, I'm not aware of any models (yet) where it matters, and we could re-add a tie-breaker-first/last enum in the future if we find a case that matters (assuming we also find a way to emulate it in CoreML). So then...?

dictionary MLArgMinMaxOptions {
  sequence<[EnforceRange] unsigned long> axes;
  boolean keepDimensions = false;
- boolean selectLastIndex = false;
+ MLOperandDataType outputDataType; // https://github.com/webmachinelearning/webnn/issues/653
};

Another decision is what the default tie index should be for backends that support both directions like the DML backend:

first index (TensorFlow, and possibly CoreML too albeit undocumented and not guaranteed)
last index (PyTorch) *update - or maybe not anymore, per Ningxin's comment below

The existing DML backend argMin/argMax implementation would need to pick one or the other. 🤔

philloooo · 2024-07-10T21:50:22Z

Assumming the tflite backend behave the same way as Tensorflow, then go with 1?

inexorabletash · 2024-07-10T21:52:56Z

The existing DML backend argMin/argMax implementation would need to pick one or the other. 🤔

We could also flip a coin on each call to pick between the options, so that even on a given platform the behavior is indeterminate. I think we've actually done that for some APIs to try to prevent burn in. But I'm not seriously suggesting it here. 😉

huningxin · 2024-07-11T02:34:07Z

@fdwr

2. last index (PyTorch)

If I read it correctly, torch.argmax would return the first index?

NOTE
If there are multiple maximal values then the indices of the first maximal value are returned.

A simple experiment result looks like aligning with that

>>> a = torch.tensor([1, 2, 5, 3, 4, 5])
>>> torch.argmax(a)
tensor(2)

Same for torch.argmin.

fdwr · 2024-07-11T12:30:27Z

If I read it correctly, torch.argmax would return the first index?

@huningxin - Interesting. I wonder if behavior changed at some point, given Lara's comment here: onnx/onnx#2461
Well that makes the default for the DML backend clearer.

Aha, PyTorch breaking change: pytorch/pytorch#42004

anssiko added bug operator specific labels Apr 25, 2024

inexorabletash mentioned this issue May 2, 2024

Meta: Introduce "Interop" label? #673

Closed

inexorabletash added the interop label May 9, 2024

philloooo mentioned this issue Jul 11, 2024

Remove argmin/max selectLastIndex parameter #722

Merged

inexorabletash assigned philloooo Jul 11, 2024

huningxin closed this as completed in #722 Jul 12, 2024

ibelem mentioned this issue Jul 23, 2024

Fix #16 #17 Add outputDataType option for the ArgMax/ArgMin ops microsoft/webnn-developer-preview#19

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ArgMax/Min `selectLastIndex` is not supported on CoreML #652

ArgMax/Min `selectLastIndex` is not supported on CoreML #652

philloooo commented Apr 23, 2024

fdwr commented Apr 26, 2024 •

edited

Loading

philloooo commented May 2, 2024

mwyrzykowski commented May 2, 2024

fdwr commented May 2, 2024

philloooo commented Jul 10, 2024

mwyrzykowski commented Jul 10, 2024

fdwr commented Jul 10, 2024 •

edited

Loading

philloooo commented Jul 10, 2024

inexorabletash commented Jul 10, 2024

huningxin commented Jul 11, 2024

fdwr commented Jul 11, 2024 •

edited

Loading

ArgMax/Min selectLastIndex is not supported on CoreML #652

ArgMax/Min selectLastIndex is not supported on CoreML #652

Comments

philloooo commented Apr 23, 2024

fdwr commented Apr 26, 2024 • edited Loading

philloooo commented May 2, 2024

mwyrzykowski commented May 2, 2024

fdwr commented May 2, 2024

philloooo commented Jul 10, 2024

mwyrzykowski commented Jul 10, 2024

fdwr commented Jul 10, 2024 • edited Loading

philloooo commented Jul 10, 2024

inexorabletash commented Jul 10, 2024

huningxin commented Jul 11, 2024

fdwr commented Jul 11, 2024 • edited Loading

ArgMax/Min `selectLastIndex` is not supported on CoreML #652

ArgMax/Min `selectLastIndex` is not supported on CoreML #652

fdwr commented Apr 26, 2024 •

edited

Loading

fdwr commented Jul 10, 2024 •

edited

Loading

fdwr commented Jul 11, 2024 •

edited

Loading