Fixing and improving indexing type handling #2522

mmigdal-nv · 2023-02-26T14:53:02Z

Fixed issues:

Recompiling kernel if KernelArgumentHolder's indexing mode changes.
Taking into account the output tensors to update indexing mode.
Current indexing type is appended to kernelName() so we can use KernelDb with the key kernel_code_. Currently KernelDb ignores the wrapped code (#defines, runtime library, ...) and relies only on the kernel. Without changing the kernel name we would be getting back the wrong cubins.

Improvements:

Allowing to change tensor indexing mode in KernelArgumentHolder retroactively.
The -1 in collectIndexMode is misleading. In the case of a 1D tensor, having a type that holds the tensor's index is not enough - we need to be able to hold the bound itself (so we can compare index to the bound without overflows).

Changes:

cparams.index_type is not set to DataType::Index so the kernel can be lowered once and we update/set nvfuser_index_t after, as required.

mmigdal-nv · 2023-02-27T02:09:33Z

In the case of matmuls, this happens to fix the cases where:

MNK = 65536, 65536, 128, as the output shape was never taken into account (and overflowed nvfuser_index_t)
Crash when problem launched with small input tensors, followed by large input tensors -(overflow in nvfuser_index_t as we don't recompile even if we compute the right size in that case.
Perf impact of running a large problem (that required 64b indexing), followed by smalls, as we will be running 64b kernels for small problems

csarofeen

Generally this makes sense to me, but I'm concerned about recompilation of the kernel. It doesn't seem good to retrigger non-cached recompilation. With thread recompilation it seemed okay to me since we would just retrigger for high water mark, but if we're going to enable an option to go back from int64 indexing to compile int32 indexing, we should cache both options somehow.

I'm not really sure what we want to do with the caching here. I wonder if it even makes sense to do this on the register side. CCing @naoyam and @jjsjann123 for opinions.

third_party/nvfuser/csrc/executor_kernel_arg.cpp

third_party/nvfuser/csrc/executor.cpp

third_party/nvfuser/csrc/executor.h

third_party/nvfuser/csrc/executor_kernel_arg.cpp

third_party/nvfuser/csrc/executor_kernel_arg.h

third_party/nvfuser/csrc/executor.cpp

naoyam · 2023-03-01T00:42:41Z

As I mentioned to @mmigdal-nv, I think the fix of this PR is sufficient. As long as a fusion is executed through FusionExecutorCache, we should not see back-and-forth recompilations due to index mode changes. The only request I have for @mmigdal-nv is to add a simple C++ test that verifies this behavior. #2522 (comment)

third_party/nvfuser/test/test_gpu3.cpp

naoyam

LGTM. Thanks for the fix and improving the PR.

approved by naoya, and caching is not a problem

This reverts commit 3b85308.

mmigdal-nv force-pushed the rebuild_index_change branch 2 times, most recently from 436a745 to 311563a Compare February 27, 2023 01:37

mmigdal-nv marked this pull request as ready for review February 27, 2023 01:52

mmigdal-nv changed the title ~~Recompiling kernel when nvfuser_index_t changes~~ Fixing and improving indexing type handling Feb 27, 2023

mmigdal-nv force-pushed the rebuild_index_change branch from 311563a to 5a7cfb4 Compare February 27, 2023 02:03

mmigdal-nv force-pushed the rebuild_index_change branch 2 times, most recently from f47a0cc to 87c71a4 Compare February 27, 2023 09:25

mmigdal-nv requested a review from zasdfgbnm February 27, 2023 09:37

mmigdal-nv force-pushed the rebuild_index_change branch from 87c71a4 to c4796f8 Compare February 27, 2023 14:29

Fixing and improving indexing type handling

b09834c

mmigdal-nv force-pushed the rebuild_index_change branch from c4796f8 to b09834c Compare February 27, 2023 14:51

csarofeen previously requested changes Feb 27, 2023

View reviewed changes

third_party/nvfuser/csrc/executor_kernel_arg.cpp Outdated Show resolved Hide resolved

third_party/nvfuser/csrc/executor.cpp Show resolved Hide resolved

naoyam reviewed Mar 1, 2023

View reviewed changes

mmigdal-nv requested review from naoyam, Michoumichmich and csarofeen and removed request for Michoumichmich March 1, 2023 18:18

Addressing reviews

8fd6b0c

mmigdal-nv force-pushed the rebuild_index_change branch from 7438f68 to 8fd6b0c Compare March 1, 2023 18:20

naoyam reviewed Mar 1, 2023

View reviewed changes

third_party/nvfuser/test/test_gpu3.cpp Outdated Show resolved Hide resolved

mmigdal-nv requested review from jjsjann123 and naoyam March 7, 2023 23:57

naoyam approved these changes Mar 8, 2023

View reviewed changes

Addressing reviews and formatting

b7124e5

mmigdal-nv force-pushed the rebuild_index_change branch from abcd5d0 to b7124e5 Compare March 8, 2023 10:45

mmigdal-nv mentioned this pull request Mar 8, 2023

Take internal buffer size into account to decide on indexing type size #2558

Open

mmigdal-nv merged commit 3b85308 into csarofeen:devel Mar 8, 2023

mmigdal-nv deleted the rebuild_index_change branch March 8, 2023 17:16

naoyam added a commit that referenced this pull request Mar 9, 2023

Revert "Fixing and improving indexing type handling (#2522)"

947e353

This reverts commit 3b85308.

naoyam mentioned this pull request Mar 9, 2023

Revert "Fixing and improving indexing type handling" #2568

Merged

naoyam added a commit that referenced this pull request Mar 9, 2023

Revert "Fixing and improving indexing type handling (#2522)" (#2568)

e0c1786

This reverts commit 3b85308.

naoyam mentioned this pull request Mar 10, 2023

Clean up index type handling #2570

Merged

naoyam mentioned this pull request Mar 27, 2023

index type is not computed correctly because output is not considered NVIDIA/Fuser#79

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixing and improving indexing type handling #2522

Fixing and improving indexing type handling #2522

mmigdal-nv commented Feb 26, 2023 •

edited

Loading

mmigdal-nv commented Feb 27, 2023

csarofeen left a comment

naoyam commented Mar 1, 2023

naoyam left a comment

Fixing and improving indexing type handling #2522

Fixing and improving indexing type handling #2522

Conversation

mmigdal-nv commented Feb 26, 2023 • edited Loading

mmigdal-nv commented Feb 27, 2023

csarofeen left a comment

Choose a reason for hiding this comment

naoyam commented Mar 1, 2023

naoyam left a comment

Choose a reason for hiding this comment

mmigdal-nv commented Feb 26, 2023 •

edited

Loading