[PASS] Layout transform pass #233

ZihengJiang · 2017-07-08T23:25:55Z

No description provided.

tqchen · 2017-07-09T06:01:23Z

apps/graph_executor/src/graph_pass.cc

+  return false;
+}
+
+inline LayoutInfo GetLayout(const nnvm::OpMap<FTVMLayoutInfo>& layouts,


Have a function called CombineLayout? vector - > vector

tqchen · 2017-07-09T06:13:40Z

apps/graph_executor/src/graph_pass.cc

+          LayoutInfo olayout = GetLayout(olayouts, in, e.index);
+          LayoutInfo ilayout = GetLayout(ilayouts, n, idx);
+          if (IsPair(olayout, ilayout)) {
+            break;


What about other inputs that might need layout change? Is break right way ?

sorry, should be continue

tqchen · 2017-07-09T06:16:38Z

apps/graph_executor/src/graph_pass.cc

+    if (olayouts.count(e.node->op())) {
+      LayoutInfo layout = GetLayout(olayouts, e.node, e.index);
+      nnvm::NodePtr tnode =
+        CreateLayoutTransformNode(layout.src, layout.dst);


Always assert output transform back to NCHW

This logic does not take benefit from transform cache. Maybe consider change the logic to, always eagerly create the output transform node(from its producer) and the consumer choose whether to consume it

tqchen · 2017-07-09T06:19:16Z

apps/graph_executor/src/graph_pass.cc

+            tnode->inputs.emplace_back(e);
+            transformed.emplace(e, nnvm::NodeEntry{tnode, 0, 0});
+          }
+          new_node->inputs[idx] = transformed.at(e);


do an if else logic might be faster and more clear here

ZihengJiang · 2017-07-09T21:33:51Z

apps/graph_executor/src/graph_pass.cc

+            CreateLayoutTransformNode(layout.src, layout.dst);
+          tnode->inputs.emplace_back(nnvm::NodeEntry{new_node, i, 0});
+          transformed.emplace(
+            nnvm::NodeEntry{n, i, 0}, nnvm::NodeEntry{tnode, 0, 0});


one problem is the version field, if we create transformed item eagerly, we have to assume it as 0

We don't have to cache NodeEntry, only Node is needed, so you can still copy the node, and copy the entry index over

One node has multiple output layouts, so one node is corresponding with multiple transform node. So I think it should be a mapping from NodeEntry to NodeEntry

nerver mind, I got it

* add simplify * remove simplify in auto complete

It may be useful for some passes to collapse chains of definitions, particularly after other compiler transformations that may reduce or simplify some expressions. This pass will take chains of definitions and replace references to later definitions to the original one. It works by checking `LookupBinding` for each var use-site and replacing the var with its definition if the definition was another var. (Note: This required updating `BlockBuilder` to also update its binding map for `MatchShape` nodes; that was arguably a bug.) Additionally, `MatchShape` bindings where the `LHS` and the `RHS` are guaranteed to match at compile time are canonicalized into ordinary `VarBinding`s.

) Current codegen output `(half4)*(device uint*)A` tries to create a `int32` number and then cast it to `half4`, which is not the expected behavior. As Metal supports `uchar4` and `char4` types, we can direct use them to solve that problem. (cherry picked from commit 6198c7f)

* Merge TL Update * submodule update * Re-implement macro with sub function. * lint fix * Refactor tensor core memory allocation in MatmulFineGrainScheduler - Adjusted the local fragment sizes for tensor core memory allocation in the MatmulFineGrainScheduler class. - Updated the allocation sizes for A_local, B_local, and C_local variables based on the new fragment sizes. - The changes ensure efficient memory utilization and improve performance. Refactor tensor core memory allocation in MatmulDequantizeFineGrainedScheduler - Modified the fragment sizes for tensor core memory allocation in the MatmulDequantizeFineGrainedScheduler class. - Updated the allocation sizes for A_frag, B_frag, and C_frag variables based on the new fragment sizes. - The changes optimize memory usage and enhance the efficiency of the dequantization process. Refactor tensor core memory allocation in MatmulDequantizeWeightPropagationScheduler - Adjusted the fragment sizes for tensor core memory allocation in the MatmulDequantizeWeightPropagationScheduler class. - Updated the allocation sizes for A_frag, B_frag, B_dequantize_frag, and C_frag variables based on the new fragment sizes. - The changes improve memory utilization and optimize the weight propagation process. * Implement int4 tensorcore * lint fix * support uint2->uint4 fast dequantize * Support int4 tensorcore decoding * lint fix

[PASS] Layout transform pass

c87ffd9

tqchen reviewed Jul 9, 2017

View reviewed changes

Fix according to comment

fe1d134

ZihengJiang commented Jul 9, 2017

View reviewed changes

Fix

bdddba6

tqchen approved these changes Jul 10, 2017

View reviewed changes

Merge branch 'master' into layout

22547a7

ZihengJiang merged commit 3212186 into apache:master Jul 10, 2017

ZihengJiang deleted the layout branch July 10, 2017 02:31

vinx13 pushed a commit to vinx13/tvm that referenced this pull request Mar 9, 2022

[TIR] Simplify expressions during the creation of tir func (apache#233)

2678576

* add simplify * remove simplify in auto complete

apivovarov added a commit to apivovarov/tvm that referenced this pull request Mar 30, 2022

Fixes for CI release-1.10.0 (apache#233)

1665486

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[PASS] Layout transform pass #233

[PASS] Layout transform pass #233

Uh oh!

ZihengJiang commented Jul 8, 2017

Uh oh!

tqchen Jul 9, 2017

Uh oh!

tqchen Jul 9, 2017

Uh oh!

ZihengJiang Jul 9, 2017

Uh oh!

tqchen Jul 9, 2017

Uh oh!

tqchen Jul 9, 2017

Uh oh!

tqchen Jul 9, 2017

Uh oh!

ZihengJiang Jul 9, 2017 •

edited

Loading

Uh oh!

tqchen Jul 9, 2017

Uh oh!

ZihengJiang Jul 9, 2017

Uh oh!

ZihengJiang Jul 9, 2017

Uh oh!

Uh oh!

[PASS] Layout transform pass #233

[PASS] Layout transform pass #233

Uh oh!

Conversation

ZihengJiang commented Jul 8, 2017

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ZihengJiang Jul 9, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ZihengJiang Jul 9, 2017 •

edited

Loading