Support output tensor in native memory format #173

huningxin · 2021-05-31T01:25:54Z

This issue is to track the ongoing discussion in PR #166 for a potential v2 feature.

Requirement

Allow the compiled graph to be able to produce the result in native (platform-specific) memory format to accommodate frameworks that uses webnn in an op-by-op calling pattern.

Background

Some native ML API, such as oneDNN, uses optimized (platform-specific) memory layout for better vectorization and cache reuse on different platforms. These APIs would require not only the constants (such as weights) but also input and output tensors in the optimized memory layout for best performance. So when the JS frameworks call WebNN compiled graphs for an op-by-op calling pattern but without accessing the intermediate outputs, the memory conversions for intermediate outputs would be unnecessary and could be optimized if WebNN supports output tensor in native memory format.

Proposals

MLTensor interface that supports converting the the output data layout asynchronously.
MLGraphBuilder.convertToStandardFormat by @wchao1115 that allows native memory format graph output as a special case through an optional flag in the graph builder and coupled it with an explicit conversion API.

wchao1115 · 2021-09-25T23:14:44Z

Do we have enough info here to close this issue given what have been discussed in the related thread?

huningxin · 2021-09-26T02:25:51Z

I suppose this is proposed as a v2 feature. I am that we close it for now. Or could we mark it as v2 and treat it lower priority for v1? @anssiko , any thoughts?

anssiko · 2021-09-30T12:59:08Z

I created a new label "v2" and tagged this issue with it.

This allows us to "park" an issue we feel is not making progress toward consensus at this point of development, to be revisited later if/when new information emerges.

yuhonglin · 2021-11-16T05:51:30Z

"MLTensor" can be useful in the model loader API (it best matches the concepts in backends like TFLite, where input/output are tensors). So to make model loader and WebNN APIs consistent, maybe we should consider using "MLTensor" in WebNN too.

Besides, "MLInput", in its current definition (data+dimensions), are exactly tensors. So "MLTensor" is a more accurate name because such a structure is not necessarily an input.

wchao1115 · 2021-11-16T22:56:46Z

I think the core problem we try to summarize in this issue is how WebNN can support block format, or data blob that is optimized towards certain classes of hardware but not publicly defined in a way that can fully interoperate in a web scenario. I don't think it necessarily implies that it's going to be compatible with tf.Tensor definition if that's what you are implying in your comment.

yuhonglin · 2021-11-17T08:17:29Z

Yeah, I understood MLTensor was proposed for another reason. I just want to express that "MLTensor" may be a more accurate name than "MLInput" for this structure which can have other use cases, e.g., in the model loader API.

a-sully · 2024-05-09T23:57:33Z

Just noticed this issue! Can we close it in favor of #544? (see also #542)

anssiko · 2024-05-13T12:13:03Z

Seems appropriate to close, but I'd like to check with @huningxin to make sure all relevant information from this issue and proposals linked to it have been ported over to the active MLBuffer discussion.

anssiko · 2024-05-24T12:28:07Z

@huningxin do you mind checking if we could close this, see also above?

huningxin · 2024-07-04T02:22:52Z

Just noticed this issue! Can we close it in favor of #544? (see also #542)

Sorry for late response, yes, we can close it in favor of MLBuffer proposal.

huningxin mentioned this issue May 31, 2021

Support download data asynchronously #166

Closed

anssiko added the v2 label Sep 30, 2021

inexorabletash added the feature request label Feb 1, 2024

huningxin closed this as completed Jul 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support output tensor in native memory format #173

Support output tensor in native memory format #173

huningxin commented May 31, 2021

wchao1115 commented Sep 25, 2021

huningxin commented Sep 26, 2021

anssiko commented Sep 30, 2021

yuhonglin commented Nov 16, 2021

wchao1115 commented Nov 16, 2021 •

edited

Loading

yuhonglin commented Nov 17, 2021

a-sully commented May 9, 2024

anssiko commented May 13, 2024

anssiko commented May 24, 2024

huningxin commented Jul 4, 2024

Support output tensor in native memory format #173

Support output tensor in native memory format #173

Comments

huningxin commented May 31, 2021

Requirement

Background

Proposals

wchao1115 commented Sep 25, 2021

huningxin commented Sep 26, 2021

anssiko commented Sep 30, 2021

yuhonglin commented Nov 16, 2021

wchao1115 commented Nov 16, 2021 • edited Loading

yuhonglin commented Nov 17, 2021

a-sully commented May 9, 2024

anssiko commented May 13, 2024

anssiko commented May 24, 2024

huningxin commented Jul 4, 2024

wchao1115 commented Nov 16, 2021 •

edited

Loading