[mlir] [linalg] Fix bufferize error in tensor.parallel_insert_slice op #98312

cxy-1993 · 2024-07-10T13:07:11Z

tensor.parallel_insert_slice op has implicit inplace behavior. In the "copy-before-write" bufferize mode, the resolveConflict function will generate bufferize.copy, making the result incorrect. This patch fixes this issue.

llvmbot · 2024-07-10T13:07:45Z

@llvm/pr-subscribers-mlir-tensor

@llvm/pr-subscribers-mlir

Author: donald chen (cxy-1993)

Changes

tensor.parallel op has implicit inplace behavior. In the "copy-before-write" bufferize mode, the resolveConflict function will generate bufferize.copy, making the result incorrect. This patch fixes this issue.

Full diff: https://github.com/llvm/llvm-project/pull/98312.diff

2 Files Affected:

(modified) mlir/lib/Dialect/Tensor/Transforms/BufferizableOpInterfaceImpl.cpp (+5)
(modified) mlir/test/Dialect/Tensor/bufferize.mlir (+20)

diff --git a/mlir/lib/Dialect/Tensor/Transforms/BufferizableOpInterfaceImpl.cpp b/mlir/lib/Dialect/Tensor/Transforms/BufferizableOpInterfaceImpl.cpp
index d078a575f40dd..eabcff33df98a 100644
--- a/mlir/lib/Dialect/Tensor/Transforms/BufferizableOpInterfaceImpl.cpp
+++ b/mlir/lib/Dialect/Tensor/Transforms/BufferizableOpInterfaceImpl.cpp
@@ -997,6 +997,11 @@ struct ParallelInsertSliceOpInterface
     rewriter.eraseOp(op);
     return success();
   }
+
+  LogicalResult resolveConflicts(Operation *op, RewriterBase &rewriter,
+                                 const AnalysisState &state) const {
+    return success();
+  }
 };
 
 /// Bufferization of tensor.splat. Bufferizes to a new allocation that is filled
diff --git a/mlir/test/Dialect/Tensor/bufferize.mlir b/mlir/test/Dialect/Tensor/bufferize.mlir
index e85d9e740adf4..3a3c8af15e6e4 100644
--- a/mlir/test/Dialect/Tensor/bufferize.mlir
+++ b/mlir/test/Dialect/Tensor/bufferize.mlir
@@ -626,3 +626,23 @@ func.func @tensor.splat_dynamic(%f: f32, %m: index, %n: index) -> tensor<?x3x?xf
   return %0 : tensor<?x3x?xf32>
 }
 
+// -----
+
+// CHECK-LABEL: func.func @parallel_insert_slice_copy_before_write
+func.func @parallel_insert_slice_copy_before_write(%in: tensor<4xf32>, %out: tensor<4xf32>) {
+  %c1 = arith.constant 1 : index
+  %num_threads = arith.constant 4 : index
+
+  // CHECK: scf.forall {{.*}} {
+  %result = scf.forall (%thread_idx) in (%num_threads) shared_outs (%o = %out) -> tensor<4xf32> {
+      %1 = tensor.extract_slice %in[%thread_idx][1][1] : tensor<4xf32> to tensor<1xf32>
+      scf.forall.in_parallel {
+        // CHECK: memref.subview %{{.*}}[%{{.*}}] [1] [1] : memref<4xf32> to memref<1xf32, strided<[1], offset: ?>>
+        // CHECK: memref.subview %{{.*}}[%{{.*}}] [1] [1] : memref<4xf32> to memref<1xf32, strided<[1], offset: ?>>
+        tensor.parallel_insert_slice %1 into %o[%thread_idx][1][1] :
+          tensor<1xf32> into tensor<4xf32>
+      }
+  }
+  // CHECK: }
+  return
+}

mlir/lib/Dialect/Tensor/Transforms/BufferizableOpInterfaceImpl.cpp

mlir/test/Dialect/Tensor/bufferize.mlir

need to take a closer look

mlir/lib/Dialect/Tensor/Transforms/BufferizableOpInterfaceImpl.cpp

tensor.parallel_insert_slice op has implicit inplace behavior. In the "copy-before-write" bufferize mode, the resolveConflict function will generate bufferize.copy making the result incorrect. This patch fixes this issue.

cxy-1993 · 2024-07-11T12:15:36Z

The only chang is add comments after last CI passed. It seems we don't have enough test resources, so we don't wait ci to merge.

llvm#98312) tensor.parallel_insert_slice op has implicit inplace behavior. In the "copy-before-write" bufferize mode, the resolveConflict function will generate bufferize.copy, making the result incorrect. This patch fixes this issue.

cxy-1993 requested a review from matthias-springer July 10, 2024 13:07

cxy-1993 requested review from hanhanW and nicolasvasilache as code owners July 10, 2024 13:07

llvmbot added mlir mlir:tensor labels Jul 10, 2024

matthias-springer previously requested changes Jul 10, 2024

View reviewed changes

mlir/lib/Dialect/Tensor/Transforms/BufferizableOpInterfaceImpl.cpp Show resolved Hide resolved

mlir/test/Dialect/Tensor/bufferize.mlir Show resolved Hide resolved

cxy-1993 force-pushed the fix-bufferize branch from e865860 to da1243d Compare July 10, 2024 15:26

matthias-springer approved these changes Jul 11, 2024

View reviewed changes

matthias-springer reviewed Jul 11, 2024

View reviewed changes

mlir/lib/Dialect/Tensor/Transforms/BufferizableOpInterfaceImpl.cpp Show resolved Hide resolved

cxy-1993 force-pushed the fix-bufferize branch from da1243d to 85196de Compare July 11, 2024 09:54

cxy-1993 force-pushed the fix-bufferize branch from 85196de to 871c741 Compare July 11, 2024 09:58

cxy-1993 merged commit d69e949 into llvm:main Jul 11, 2024
5 of 6 checks passed

cxy-1993 deleted the fix-bufferize branch July 11, 2024 12:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[mlir] [linalg] Fix bufferize error in tensor.parallel_insert_slice op #98312

[mlir] [linalg] Fix bufferize error in tensor.parallel_insert_slice op #98312

cxy-1993 commented Jul 10, 2024 •

edited

Loading

llvmbot commented Jul 10, 2024 •

edited

Loading

cxy-1993 commented Jul 11, 2024 •

edited

Loading

[mlir] [linalg] Fix bufferize error in tensor.parallel_insert_slice op #98312

[mlir] [linalg] Fix bufferize error in tensor.parallel_insert_slice op #98312

Conversation

cxy-1993 commented Jul 10, 2024 • edited Loading

llvmbot commented Jul 10, 2024 • edited Loading

cxy-1993 commented Jul 11, 2024 • edited Loading

cxy-1993 commented Jul 10, 2024 •

edited

Loading

llvmbot commented Jul 10, 2024 •

edited

Loading

cxy-1993 commented Jul 11, 2024 •

edited

Loading