Skip to content

Commit 2434431

Browse files
[Stride] Set up DenseTensorIterator And Support Stride Kernel For Elementwise_Add (#74637)
* add densetensor_iterator * add HIP config * set flag to true * fix stride kernel bug * add strided input test * change flag name and add standard kernel defination * refine * fix codestyle
1 parent 66d7f98 commit 2434431

File tree

8 files changed

+1025
-1
lines changed

8 files changed

+1025
-1
lines changed

paddle/common/flags.cc

Lines changed: 12 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2180,3 +2180,15 @@ PHI_DEFINE_EXPORTED_bool(check_cuda_error,
21802180
PHI_DEFINE_EXPORTED_bool(use_default_stream,
21812181
false,
21822182
"Whether use default stream.");
2183+
2184+
/**
2185+
* Stride_Compute_Kernel related FLAG
2186+
* Name: FLAGS_use_stride_compute_kernel
2187+
* Since Version: 3.2
2188+
* Value Range: bool, default=false
2189+
* Example:
2190+
* Note: Whether use Stride_Compute_Kernel.
2191+
*/
2192+
PHI_DEFINE_EXPORTED_bool(use_stride_compute_kernel,
2193+
false,
2194+
"Whether use Stride_Compute_Kernel.");

paddle/phi/kernels/CMakeLists.txt

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -31,6 +31,7 @@ file(
3131
RELATIVE "${CMAKE_CURRENT_SOURCE_DIR}"
3232
"gpudnn/*.cu"
3333
"kps/*.cu"
34+
"stride/*.cu"
3435
"legacy/kps/*.cu"
3536
"legacy/gpu/*.cu"
3637
"selected_rows/gpu/*.cu"

0 commit comments

Comments
 (0)