OperandType of gemm / matmul return #84

anssiko · 2020-08-27T13:32:38Z

[This issue was originally posted at https://github.com/w3c/machine-learning-workshop/issues/86 ]

@kpu wrote:

The spec says gemm returns "an Operand" (and the same thing for matmul).

If both arguments are tensor-quant8-asymm, what is the OperandType of the return? I can see use cases for tensor-int32 which is how it will actually be generated by existing hardware, tensor-quant8-asymm for a fully quantized model, or even tensor-float32 for people that have only partly quantized their model.

This matters because the spec doesn't appear to have e.g. a requantization operator to convert int32 to int8 and anyway one would need the ability to set the scaling factor used by running the model in advance to measure an appropriate scaling factor.

kpu · 2020-09-25T09:27:06Z

Proposal: follow ONNX to have separate operators:

MatMul that doesn't allow int8
MatMulInteger that does int8 * int8 -> int32
QLinearMatMul that does int8 * int8 -> int8 using a rescaling factor

This is consistent with #17.

inexorabletash · 2024-02-21T18:32:54Z

This seems obsolete or fixed. Can we close @anssiko ?

anssiko mentioned this issue Aug 27, 2020

OperandType of gemm / matmul return w3c/machine-learning-workshop#86

Closed

wchao1115 mentioned this issue Sep 7, 2020

Model Execution API #87

Closed

inexorabletash mentioned this issue Feb 6, 2024

Process: Add documentation for labels, current and proposed #533

Merged

3 tasks

anssiko added the operator specific label Feb 7, 2024

anssiko closed this as completed Feb 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OperandType of gemm / matmul return #84

OperandType of gemm / matmul return #84

anssiko commented Aug 27, 2020

kpu commented Sep 25, 2020

inexorabletash commented Feb 21, 2024

OperandType of gemm / matmul return #84

OperandType of gemm / matmul return #84

Comments

anssiko commented Aug 27, 2020

kpu commented Sep 25, 2020

inexorabletash commented Feb 21, 2024