[Tcp] Add boilerplate for TCP dialect #1375

navahgar · 2022-09-15T23:15:05Z

This PR adds the initial boilerplate necessary for the TCP dialect. It includes:

a dummy op in TCP.
a compilation flag, -DTORCH_MLIR_DIALECTS_ENABLE_TCP to enable TCP. This will be OFF by default.

...m-external-projects/torch-mlir-dialects/include/torch-mlir-dialects/Dialect/Tcp/IR/TcpOps.td

externals/llvm-external-projects/torch-mlir-dialects/test/Conversion/TcpToTosa/binary_ops.mlir

sjain-stanford · 2022-09-16T00:52:54Z

Changed base branch from main to mlir-tcp based on @stellaraccident 's suggestion.

.../llvm-external-projects/torch-mlir-dialects/include/torch-mlir-dialects/Conversion/Passes.td

...-external-projects/torch-mlir-dialects/include/torch-mlir-dialects/Dialect/Tcp/IR/TcpBase.td

...m-external-projects/torch-mlir-dialects/include/torch-mlir-dialects/Dialect/Tcp/IR/TcpOps.td

externals/llvm-external-projects/torch-mlir-dialects/lib/Conversion/TcpToTosa/TcpToTosa.cpp

...m-external-projects/torch-mlir-dialects/include/torch-mlir-dialects/Dialect/Tcp/IR/TcpOps.td

silvasean · 2022-09-16T02:16:39Z

Posting this here for persistence: #1223 -- here is an example of adding the e2e framework for mhlo, we should do something similar for TCP. See rationale here for the critical importance of e2e testing in this space: https://github.com/llvm/torch-mlir/blob/main/docs/architecture.md#why-so-much-end-to-end-testing

sjarus · 2022-09-16T04:16:13Z

nit: I recommend renaming the commit prefix to remote the all caps which looks a little "shouty". Just [tcp] as a required prefix should be sufficient to identify every push.

...m-external-projects/torch-mlir-dialects/include/torch-mlir-dialects/Dialect/Tcp/IR/TcpOps.td

burmako · 2022-09-16T05:31:54Z

...m-external-projects/torch-mlir-dialects/include/torch-mlir-dialects/Dialect/Tcp/IR/TcpOps.td

+  );
+
+  let results = (outs
+    Tcp_Tensor:$out


What is the relationship between operands and results of unary and binary ops? Do we want them to be exactly the same or just compatible, for some definition of "compatible"?

In MHLO, we allow "compatible" operand and result types via isCompatibleReturnTypes and HLO_CompatibleOperandsAndResultType which are both implemented via isCompatibleForHloTypeInference, which roughly speaking allows things like:

func @dynamism(%arg0: tensor<?xf32>, %arg1: tensor<1xf32>) { %0 = "mhlo.add"(%arg0, %arg0) : (tensor<?xf32>, tensor<?xf32>) -> tensor<?xf32> %1 = "mhlo.add"(%arg0, %arg0) : (tensor<?xf32>, tensor<?xf32>) -> tensor<1xf32> %2 = "mhlo.add"(%arg0, %arg1) : (tensor<?xf32>, tensor<1xf32>) -> tensor<?xf32> %3 = "mhlo.add"(%arg0, %arg1) : (tensor<?xf32>, tensor<1xf32>) -> tensor<1xf32> %4 = "mhlo.add"(%arg1, %arg0) : (tensor<1xf32>, tensor<?xf32>) -> tensor<?xf32> %5 = "mhlo.add"(%arg1, %arg0) : (tensor<1xf32>, tensor<?xf32>) -> tensor<1xf32> %6 = "mhlo.add"(%arg1, %arg1) : (tensor<1xf32>, tensor<1xf32>) -> tensor<?xf32> %7 = "mhlo.add"(%arg1, %arg1) : (tensor<1xf32>, tensor<1xf32>) -> tensor<1xf32> return }

I wasn't around when this functionality was added to MHLO, but I've been told that it was motivated by similar functionality in the TF dialect, which itself was motivated by the desire to support progressive shape refinement during --tf-shape-inference. If MHLO didn't support these relaxed compatibility checks, then --tf-shape-inference would need to introduce casts when doing shape refinement, which was deemed undesirable.

Are we planning to do shape refinement in TCP, or that's expected to be the job of higher-level layers? If it's the former (which I think could be the right choice), then does anyone have a specific shape inference facility in mind that we could use? In the recent ODM, I remember someone said that there are better tools than --tf-shape-inference (which also isn't available in upstream, so we cannot use it anyway in TCP), and I'm eager to learn more.

There are a variety of ways to perform shape inference. Linalg does a very good job despite the "exact match of the static type" requirement by having patterns that push the casts around to a fixed-point, which typically results in them being absorbed somewhere.

I wrote some more general shape inference thoughts here, though I think a lot of what is described there doesn't apply to TCP/MHLO-like design points link.

Thanks for those points. That is very useful to know.

Will address shape inference in a separate PR.

externals/llvm-external-projects/torch-mlir-dialects/lib/Conversion/TcpToTosa/TcpToTosa.cpp

burmako · 2022-09-16T05:42:19Z

...m-external-projects/torch-mlir-dialects/include/torch-mlir-dialects/Dialect/Tcp/IR/TcpOps.td

+  let assemblyFormat = "$in1 `,` $in2 attr-dict `:` type($in1) `,` type($in2) `->` type($out)";
+}
+
+def Tcp_MatmulOp : Tcp_Op<"matmul", [NoSideEffect]> {


I think it would be good to implement verification + type inference for this op right away. This will force us to face interesting questions, in addition to the 2D/3D item which is already discussed. E.g.:

Do we allow result element type to be different from operand element type? I think we should, if we want to support quantization well.

If yes, what do we do with type inference? Do we: a) say that matmul doesn't support it, b) add something like preferred_element_type attribute to enable type inference, c) infer result element type equal to operand element type and then work around via isCompatibleReturnTypes?

+1,

Also, why not have separate matmul and batch_matmul ops? My experience is that with these "variant" ops it's much easier to isa<BatchMatmulOp> than if (getOperand(0).cast<RankedTensorType>().getRank() == 3)

Do we allow result element type to be different from operand element type? I think we should, if we want to support quantization well.

That's right. Although I don't want to make the design choices w.r.t quantization at this point, while that is still being worked on. We plan to have a separate discussion regarding quantization. We can address these once that is finalized.

why not have separate matmul and batch_matmul ops?

I'm working on a document to summarize the different design choices for Matmul (2 ops vs 1 op, 3D and 2D cases, etc.). So, I'm removing matmul op from this PR. Will send a separate PR for matmul later.

externals/llvm-external-projects/torch-mlir-dialects/test/Conversion/TcpToTosa/binary_ops.mlir

.../llvm-external-projects/torch-mlir-dialects/include/torch-mlir-dialects/Conversion/Passes.td

sanjoy · 2022-09-16T18:26:18Z

...al-projects/torch-mlir-dialects/include/torch-mlir-dialects/Conversion/TcpToTosa/TcpToTosa.h

+namespace torch {
+namespace tcp {
+
+std::unique_ptr<Pass> createConvertTcpToTosaPass();


This is auto-generated right? I'm looking here: https://github.com/llvm/llvm-project/blob/e854c17b02f8cd82a303d223ba5f3b0d87579cd7/mlir/tools/mlir-tblgen/PassGen.cpp#L127

I tried removing this and it doesn't auto generate for me. Not sure if I'm missing something in cmake files to do this.

Can you check if other passes have this? Maybe you need to do something extra to put it in the torch::tcp namespace?

I think what you pointed to is only generating code that goes into Passes.h.inc file. For example, that file contains the call to the pass, like the following:

inline void registerConvertTcpToTosa() { ::mlir::registerPass([]() -> std::unique_ptr<::mlir::Pass> { return mlir::torch::tcp::createConvertTcpToTosaPass(); }); }

I think that is what is being generated here.

@silvasean Do you know if this file could be generated automatically?

createConvertTcpToTosaPass needs to be declared here. It is not autogenerated. See e.g. https://github.com/llvm/torch-mlir/blob/main/include/torch-mlir/Conversion/TorchToArith/TorchToArith.h

...xternal-projects/torch-mlir-dialects/include/torch-mlir-dialects/Dialect/Tcp/IR/TcpDialect.h

...-external-projects/torch-mlir-dialects/include/torch-mlir-dialects/Dialect/Tcp/IR/TcpBase.td

...m-external-projects/torch-mlir-dialects/include/torch-mlir-dialects/Dialect/Tcp/IR/TcpOps.td

sjain-stanford · 2022-09-22T06:23:53Z

In case anyone is wondering, I switched base branches to main and back (to mlir-tcp) to flush the extraneous commit history that was showing earlier, likely due to a rebase on main (while base branch is mlir-tcp). When rebasing, we may want to do mlir-tcp on main first, then this branch tcp_1 on mlir-tcp, which should do it cleanly hopefully.

navahgar

Thanks for the review. Addressed the comments. PTAL.

I'm moving some of the stuff discussed here to follow up PRs:

Linalg lowering
e2e tests
matmul op
shape inference

Thanks for pointers regarding those.

sanjoy

Minor nits inline, LGTM otherwise.

sanjoy · 2022-09-22T23:36:07Z

...al-projects/torch-mlir-dialects/include/torch-mlir-dialects/Conversion/TcpToTosa/TcpToTosa.h

+namespace torch {
+namespace tcp {
+
+std::unique_ptr<Pass> createConvertTcpToTosaPass();


Can you check if other passes have this? Maybe you need to do something extra to put it in the torch::tcp namespace?

...m-external-projects/torch-mlir-dialects/include/torch-mlir-dialects/Dialect/Tcp/IR/TcpOps.td

silvasean · 2022-09-23T10:26:59Z

...s/llvm-external-projects/torch-mlir-dialects/include/torch-mlir-dialects/Conversion/Passes.h

@@ -0,0 +1,13 @@
+#ifndef TORCH_MLIR_DIALECTS_CONVERSION_PASSES_H


all these files need license headers

silvasean · 2022-09-23T10:29:20Z

...al-projects/torch-mlir-dialects/include/torch-mlir-dialects/Conversion/TcpToTosa/TcpToTosa.h

+namespace torch {
+namespace tcp {
+
+std::unique_ptr<Pass> createConvertTcpToTosaPass();


createConvertTcpToTosaPass needs to be declared here. It is not autogenerated. See e.g. https://github.com/llvm/torch-mlir/blob/main/include/torch-mlir/Conversion/TorchToArith/TorchToArith.h

silvasean · 2022-09-23T10:30:05Z

...al-projects/torch-mlir-dialects/include/torch-mlir-dialects/Conversion/TcpToTosa/TcpToTosa.h

+
+namespace mlir {
+
+#define GEN_PASS_DECL_CONVERTTCPTOTOSA


Why do we need this GEN_PASS_DECL_CONVERTTCPTOTOSA? I don't think we do this in the main Torch-MLIR codebase. I think we just have a single PassDetail.h which only need to be included by the .cpp files.

Aah okay. Thanks for pointing that out.

@silvasean GEN_PASS_CLASSES has been recently deprecated, and will be removed in the near future. GEN_PASS_DECL_PASSNAME is the recommended way to approach this moving forward. E.g. see this MLIR-HLO commit for an example.

By the way, functions like createConvertTcpToTosaPass can be autogenerated, but only if you don't specify let constructor in Passes.td, e.g. see this StableHLO PR for an example.

Oh neat. I wasn't aware of that.

By the way, functions like createConvertTcpToTosaPass can be autogenerated, but only if you don't specify let constructor in Passes.td, e.g. see openxla/stablehlo#176 for an example.

Good to know that. Thanks.

burmako

Reviewed prose and .td files. Sean will undoubtedly have a better perspective on the conventions for .h/CMakeLists.txt files. At a glance, everything checks out.

externals/llvm-external-projects/torch-mlir-dialects/README.md

burmako · 2022-09-26T02:04:35Z

...-external-projects/torch-mlir-dialects/include/torch-mlir-dialects/Dialect/Tcp/IR/TcpBase.td

+// Tcp Type Definitions.
+//===----------------------------------------------------------------------===//
+
+def Tcp_Scalar : AnyTypeOf<[AnyFloat, AnySignlessInteger, AnyComplex]>;


The spec draft says AnySignlessIntegerOrIndex. I think AnySignlessInteger is a more recent development, so the spec needs to be updated?

Yes, good point. I have updated the spec to reflect this (w/o index for now). When we add ops that need index, we can update it.

...-external-projects/torch-mlir-dialects/include/torch-mlir-dialects/Dialect/Tcp/IR/TcpBase.td

burmako · 2022-09-26T02:07:17Z

...-external-projects/torch-mlir-dialects/include/torch-mlir-dialects/Dialect/Tcp/IR/TcpBase.td

+
+def Tcp_Dialect : Dialect {
+  let name = "tcp";
+  let cppNamespace = "::mlir::torch::tcp";


::mlir::tcp perhaps, given the plans to be applicable more widely than to PyTorch?

Given that we are bootstrapping TCP under TorchMLIR externals, I assumed it has to use the ::mlir::torch::tcp namespace.

@silvasean Is it okay to use ::mlir::tcp for this instead?

Given the plans to make it more widely applicable, mlir::tcp seems fine.

Thanks for clarifying. Updated it to mlir::tcp

burmako · 2022-09-26T02:07:55Z

...-external-projects/torch-mlir-dialects/include/torch-mlir-dialects/Dialect/Tcp/IR/TcpBase.td

+def Tcp_Tensor : RankedTensorOf<[Tcp_Scalar]>;
+
+//===----------------------------------------------------------------------===//
+// Tcp Operator.


"Tcp Operators"?

Updated it to "Tcp Ops Base", which is more appropriate here.

...vm-external-projects/torch-mlir-dialects/include/torch-mlir-dialects/Dialect/Tcp/IR/TcpOps.h

externals/llvm-external-projects/torch-mlir-dialects/test/Dialect/Tcp/dummy_op.mlir

burmako · 2022-09-27T00:08:15Z

Super pumped to see this PR landing!! 🎉 🎉 🎉

…lvm#1375) Implement support for the ONNX `GatherElements` operator: - [x] verification code (diagnose operator constraints) - [x] shape inference with helper - [x] codegen support - [x] add lit tests to verify constraint diagnostics - [x] add lit test to verify code generation - [x] enable end-to-end test (backend test) Signed-off-by: Ettore Tiotto <etiotto@ca.ibm.com>

* Initial boilerplate for TCP with a dummy op. * Conditional flag to enable TCP

silvasean reviewed Sep 15, 2022

View reviewed changes

...m-external-projects/torch-mlir-dialects/include/torch-mlir-dialects/Dialect/Tcp/IR/TcpOps.td Outdated Show resolved Hide resolved

silvasean reviewed Sep 15, 2022

View reviewed changes

externals/llvm-external-projects/torch-mlir-dialects/test/Conversion/TcpToTosa/binary_ops.mlir Outdated Show resolved Hide resolved

sjain-stanford changed the base branch from main to mlir-tcp September 16, 2022 00:45

sanjoy suggested changes Sep 16, 2022

View reviewed changes

navahgar changed the title ~~[TORCH-MLIR-DIALECTS][TCP] Add boilerplate for TCP dialect~~ [Tcp] Add boilerplate for TCP dialect Sep 16, 2022

burmako reviewed Sep 16, 2022

View reviewed changes

navahgar force-pushed the tcp_1 branch from fc29723 to a0d4f1b Compare September 16, 2022 06:35

sanjoy suggested changes Sep 16, 2022

View reviewed changes

burmako mentioned this pull request Sep 16, 2022

Integration with TCP openxla/stablehlo#17

Open

navahgar force-pushed the tcp_1 branch from a0d4f1b to 9e93a5d Compare September 22, 2022 04:26

sjain-stanford changed the base branch from mlir-tcp to main September 22, 2022 06:15

sjain-stanford changed the base branch from main to mlir-tcp September 22, 2022 06:15

navahgar force-pushed the tcp_1 branch from 9e93a5d to 4a5bb24 Compare September 22, 2022 22:48

navahgar commented Sep 22, 2022

View reviewed changes

navahgar requested review from sanjoy, silvasean, burmako and asaadaldien and removed request for burmako September 22, 2022 23:04

navahgar marked this pull request as ready for review September 22, 2022 23:07

sanjoy approved these changes Sep 22, 2022

View reviewed changes

navahgar force-pushed the tcp_1 branch from 4a5bb24 to e4c123b Compare September 23, 2022 06:08

silvasean requested changes Sep 23, 2022

View reviewed changes

navahgar force-pushed the tcp_1 branch 2 times, most recently from b3e8a41 to e626771 Compare September 23, 2022 23:41

navahgar requested review from silvasean and burmako and removed request for burmako September 23, 2022 23:41

burmako approved these changes Sep 26, 2022

View reviewed changes

silvasean reviewed Sep 26, 2022

View reviewed changes

externals/llvm-external-projects/torch-mlir-dialects/test/Dialect/Tcp/dummy_op.mlir Outdated Show resolved Hide resolved

silvasean approved these changes Sep 26, 2022

View reviewed changes

navahgar added 2 commits September 26, 2022 10:00

Initial boilerplate for TCP with a dummy op.

c322f52

Conditional flag to enable TCP

2b2d47d

navahgar force-pushed the tcp_1 branch from e626771 to 2b2d47d Compare September 26, 2022 17:09

navahgar merged commit 05fd4a5 into llvm:mlir-tcp Sep 26, 2022

navahgar added a commit that referenced this pull request Oct 6, 2022

[Tcp] Add boilerplate for TCP dialect (#1375)

290a50a

* Initial boilerplate for TCP with a dummy op. * Conditional flag to enable TCP

navahgar added a commit that referenced this pull request Nov 1, 2022

[Tcp] Add boilerplate for TCP dialect (#1375)

b8107cf

* Initial boilerplate for TCP with a dummy op. * Conditional flag to enable TCP

navahgar added a commit that referenced this pull request Nov 7, 2022

[Tcp] Add boilerplate for TCP dialect (#1375)

0ca9801

* Initial boilerplate for TCP with a dummy op. * Conditional flag to enable TCP

navahgar added a commit that referenced this pull request Nov 30, 2022

[Tcp] Add boilerplate for TCP dialect (#1375)

df1ec63

* Initial boilerplate for TCP with a dummy op. * Conditional flag to enable TCP

navahgar added a commit that referenced this pull request Nov 30, 2022

[Tcp] Add boilerplate for TCP dialect (#1375)

2061b8f

* Initial boilerplate for TCP with a dummy op. * Conditional flag to enable TCP

navahgar added a commit that referenced this pull request Dec 8, 2022

[Tcp] Add boilerplate for TCP dialect (#1375)

72a41dd

* Initial boilerplate for TCP with a dummy op. * Conditional flag to enable TCP

navahgar added a commit that referenced this pull request Dec 12, 2022

[Tcp] Add boilerplate for TCP dialect (#1375)

97f0608

* Initial boilerplate for TCP with a dummy op. * Conditional flag to enable TCP

sjarus mentioned this pull request Dec 21, 2022

[RFC] Bootstrapping the TCP Dialect in Torch-MLIR #1366

Open

navahgar added a commit that referenced this pull request Jan 5, 2023

[Tcp] Add boilerplate for TCP dialect (#1375)

336b777

* Initial boilerplate for TCP with a dummy op. * Conditional flag to enable TCP

		@@ -0,0 +1,13 @@
		#ifndef TORCH_MLIR_DIALECTS_CONVERSION_PASSES_H


		namespace mlir {

		#define GEN_PASS_DECL_CONVERTTCPTOTOSA

[Tcp] Add boilerplate for TCP dialect #1375

[Tcp] Add boilerplate for TCP dialect #1375

Conversation

navahgar commented Sep 15, 2022 • edited Loading

sjain-stanford commented Sep 16, 2022

silvasean commented Sep 16, 2022

sjarus commented Sep 16, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

navahgar Sep 22, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

navahgar Sep 23, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sjain-stanford commented Sep 22, 2022

navahgar left a comment

Choose a reason for hiding this comment

sanjoy left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

burmako left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

burmako commented Sep 27, 2022

navahgar commented Sep 15, 2022 •

edited

Loading

navahgar Sep 22, 2022 •

edited

Loading

navahgar Sep 23, 2022 •

edited

Loading