Forbid inefficient TensorDescriptor initialization #3393

CAHEK7 · 2024-11-17T22:43:26Z

Initializing TensorDescriptor from std::vector<int> is very inefficient due to extra checks and multiple intermediate vector, since internally std::vector<size_t> is used.

Changed all the initializations to the native size_t, removed constructors with std::vector<int> and added workarounds for a legacy descriptors initializations with int's.

It increased performance for the current RNN implementation for a few percents.

averinevg

LGTM, although there are a couple of minor comments.

src/tensor.cpp

CAHEK7 · 2024-11-21T22:25:57Z

Majority of tests heavily rely on vector<int>

forbid 'vector<int>' descriptor initialization

b5e4971

CAHEK7 requested review from BrianHarrisonAMD, junliume and BradPepersAMD as code owners November 17, 2024 22:43

CAHEK7 added performance quality complexity_low labels Nov 17, 2024

averinevg reviewed Nov 18, 2024

View reviewed changes

src/tensor.cpp Show resolved Hide resolved

src/tensor.cpp Show resolved Hide resolved

averinevg approved these changes Nov 18, 2024

View reviewed changes

CAHEK7 marked this pull request as draft November 21, 2024 22:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Forbid inefficient TensorDescriptor initialization #3393

Forbid inefficient TensorDescriptor initialization #3393

CAHEK7 commented Nov 17, 2024

averinevg left a comment

CAHEK7 commented Nov 21, 2024

Forbid inefficient TensorDescriptor initialization #3393

Are you sure you want to change the base?

Forbid inefficient TensorDescriptor initialization #3393

Conversation

CAHEK7 commented Nov 17, 2024

averinevg left a comment

Choose a reason for hiding this comment

CAHEK7 commented Nov 21, 2024