[ETHOSN] Get buffer sizes from the compiled network #12160

lhutton1 · 2022-07-22T08:43:21Z

The NPU support library compiler sometimes adds padding to input tensors which means the buffer sizes calculated at runtime can sometimes be smaller than necessary. Instead, buffer sizes are now collected at compile time and passed to the runtime so that they match the sizes expected by the compiled network. This was seen when running a fully connected operation with an input that is not a multiple of 1024, so testing has been added to cover this case.

Additionally changed the fully connected test case to use pytest parameterization as part of a general cleanup, and fixed the fully connected testing to support output channels > 1.

cc @Leo-arm @manupa-arm @leandron

The NPU support library compiler sometimes adds padding to input tensors which means the buffer sizes calculated at runtime can sometimes be smaller than necessary. Instead, buffer sizes are now collected at compile time and passed to the runtime so that they match the sizes expected by the compiled network. This was seen when running a fully connected operation with an input that is not a multiple of 1024, so testing has been added to cover this case. Additionally changed the fully connected test case to use pytest parameterization as part of a general cleanup, and fixed an issue with specifying a different output shape and weights with more than 1 output channel. Change-Id: Iad319d75326b9ac41950de982603660a084dc27b

lhutton1 · 2022-08-04T10:39:07Z

friendly ping for review

manupak

LGTM!

manupak · 2022-08-04T13:35:33Z

Thanks @lhutton1 @NicolaLancellotti!

The NPU support library compiler sometimes adds padding to input tensors which means the buffer sizes calculated at runtime can sometimes be smaller than necessary. Instead, buffer sizes are now collected at compile time and passed to the runtime so that they match the sizes expected by the compiled network. This was seen when running a fully connected operation with an input that is not a multiple of 1024, so testing has been added to cover this case. Additionally changed the fully connected test case to use pytest parameterization as part of a general cleanup, and fixed an issue with specifying a different output shape and weights with more than 1 output channel. Change-Id: Iad319d75326b9ac41950de982603660a084dc27b

github-actions bot requested review from leandron and manupak July 22, 2022 08:48

lhutton1 force-pushed the fix-buffer-size branch from 76d6411 to 7af2390 Compare August 4, 2022 10:27

lhutton1 force-pushed the fix-buffer-size branch from 7af2390 to 6ccae1d Compare August 4, 2022 10:28

NicolaLancellotti approved these changes Aug 4, 2022

View reviewed changes

manupak approved these changes Aug 4, 2022

View reviewed changes

manupak merged commit 3731a8c into apache:main Aug 4, 2022

lhutton1 deleted the fix-buffer-size branch August 4, 2022 13:35

AndrewZhaoLuo mentioned this pull request Oct 4, 2022

TVM v0.10.0.rc0 Release Candidate Notes #12979

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ETHOSN] Get buffer sizes from the compiled network #12160

[ETHOSN] Get buffer sizes from the compiled network #12160

lhutton1 commented Jul 22, 2022

lhutton1 commented Aug 4, 2022

manupak left a comment

manupak commented Aug 4, 2022

[ETHOSN] Get buffer sizes from the compiled network #12160

[ETHOSN] Get buffer sizes from the compiled network #12160

Conversation

lhutton1 commented Jul 22, 2022

lhutton1 commented Aug 4, 2022

manupak left a comment

Choose a reason for hiding this comment

manupak commented Aug 4, 2022