[TE] reverse-mode autodiff without any optimization #5121

yzhliu · 2020-03-22T05:24:54Z

This is the first PR to bring in previously-implemented tensor-level autodiff.

This PR does not include any optimization, thus produces bad performance. Will submit optimization pass in another two or three PRs, so that not to put too much pressure on reviewers.

Also credit to @sgrechanik-h as I mentioned in the header of each file.

RFC: https://discuss.tvm.ai/t/rfc-bring-in-tensor-expression-autodiff

Please help to review @sgrechanik-h @MarisaKirisame @junrushao1994 @tqchen @hzfan

python/tvm/testing.py

include/tvm/te/autodiff.h

MarisaKirisame · 2020-03-22T21:58:55Z

src/te/autodiff/jacobian.cc

+    // This case is relatively difficult because a reduction expression
+    // may use an arbitrary combiner.
+    // The resulting reduction expression will return a tuple containing
+    // both derivatives and the original results (in exactly this order).


can you switch the order? most ad code use original result first, derivatives later.

Looking into a bit more, the order actually makes difference. When original init value is different from its derivative init value, and they depends on each other during calculation, we must calculate derivative first (using origin's init value), switch the order in tvm makes the origin value be replaced before using, produces incorrect results.

One example is in the test case,

def fcombine(x, y): return x*y def fidentity(t0): return tvm.tir.const(1, t0) prod = te.comm_reducer(fcombine, fidentity, name='prod') B = te.compute((10, 10), lambda i, j: prod(A0[i, k] + A0[k, i], axis=k), name='B') check_grad(B, A0)

Correct result (derivative first):

produce B.jacobian { for (i, 0, 10) { for (j, 0, 10) { for (jac_i0, 0, 10) { for (jac_i1, 0, 10) { B.jacobian.v0[((((i*1000) + (j*100)) + (jac_i0*10)) + jac_i1)] = 0f B.jacobian.v1[((((i*1000) + (j*100)) + (jac_i0*10)) + jac_i1)] = 1f for (k, 0, 10) { B.jacobian.v0[((((i*1000) + (j*100)) + (jac_i0*10)) + jac_i1)] = ((B.jacobian.v0[((((i*1000) + (j*100)) + (jac_i0*10)) + jac_i1)]*(A0[((i*10) + k)] + A0[((k*10) + i)])) + ((float32(((jac_i0 == i) && (jac_i1 == k))) + float32(((jac_i0 == k) && (jac_i1 == i))))*B.jacobian.v1[((((i*1000) + (j*100)) + (jac_i0*10)) + jac_i1)])) B.jacobian.v1[((((i*1000) + (j*100)) + (jac_i0*10)) + jac_i1)] = (B.jacobian.v1[((((i*1000) + (j*100)) + (jac_i0*10)) + jac_i1)]*(A0[((i*10) + k)] + A0[((k*10) + i)])) } } } } } } Output B.jacobian.v0

Incorrect result (origin first):

produce B.jacobian { for (i, 0, 10) { for (j, 0, 10) { for (jac_i0, 0, 10) { for (jac_i1, 0, 10) { B.jacobian.v0[((((i*1000) + (j*100)) + (jac_i0*10)) + jac_i1)] = 1f B.jacobian.v1[((((i*1000) + (j*100)) + (jac_i0*10)) + jac_i1)] = 0f for (k, 0, 10) { B.jacobian.v0[((((i*1000) + (j*100)) + (jac_i0*10)) + jac_i1)] = (B.jacobian.v0[((((i*1000) + (j*100)) + (jac_i0*10)) + jac_i1)]*(A0[((i*10) + k)] + A0[((k*10) + i)])) B.jacobian.v1[((((i*1000) + (j*100)) + (jac_i0*10)) + jac_i1)] = ((B.jacobian.v1[((((i*1000) + (j*100)) + (jac_i0*10)) + jac_i1)]*(A0[((i*10) + k)] + A0[((k*10) + i)])) + ((float32(((jac_i0 == i) && (jac_i1 == k))) + float32(((jac_i0 == k) && (jac_i1 == i))))*B.jacobian.v0[((((i*1000) + (j*100)) + (jac_i0*10)) + jac_i1)])) } } } } } } Output B.jacobian.v1

Looks more like a bug in lowering of tupled reductions rather than an intended behavior, might deserve a separate bug report.

@tqchen can you take a quick look and see if it is a bug in tuple reductions?

MarisaKirisame · 2020-03-22T22:57:19Z

src/te/autodiff/jacobian.cc

+  PrimExpr VisitExpr_(const OrNode* op) NOT_IMPLEMENTED
+
+  PrimExpr VisitExpr_(const ReduceNode* op) {
+    // This case is relatively difficult because a reduction expression


Is it possible to have reduce inside other expression as well?

seems to be a bit difficult. do you have an concrete example in mind?

There are no concrete example. If you dont think it will happend then leave it as is.

include/tvm/te/autodiff.h

src/te/autodiff/jacobian.cc

hzfan · 2020-03-23T17:34:58Z

python/tvm/te/autodiff.py

+    head : Tensor
+        The adjoint of the output, in other words, some tensor, by which the Jacobians
+        will be multiplied. Its shape must be of the form `prefix + output.shape`.
+        If `None` is passed, the identity tensor of shape `output.shape + output.shape`


So the default behavior is to return a Jacobian instead of adjoint, right?

that's right. more precisely, that's because the arguments are one output and multiple inputs, instead of one input and multiple outputs. if y is the only output, dy/dx is jacobian, it's also adjoint(x) for the previous layer. it depends on what aspect you want to emphasize, you use different terms.

sgrechanik-h · 2020-03-24T04:00:50Z

Thanks for reviving this. The PR looks good to me, but I'm obviously partial.

yzhliu · 2020-03-24T18:59:11Z

@MarisaKirisame @tqchen @hzfan Could you review again?

MarisaKirisame · 2020-03-24T19:15:30Z

@yzhliu can you do forward mode automatic differentiation? It is easy considering you have jacobian - you only need to do JacobianVectorProduct instead of VectorJacbianProduct

It is useful in higher order derivative a la hessian vector product.

MarisaKirisame · 2020-03-24T19:15:57Z

(of course, not in this PR)

Co-authored-by: Sergei Grechanik <sergei.grechanik+h@gmail.com>

yzhliu · 2020-03-25T05:30:00Z

@MarisaKirisame sure, I will try.

yzhliu · 2020-03-25T21:33:35Z

CI's green. @tqchen @hzfan check if there's anything needs to be addressed.

yzhliu · 2020-03-30T08:03:41Z

Kindly ping @tqchen , can we merge if it looks good?

tqchen · 2020-03-31T02:04:53Z

Thanks @yzhliu @sgrechanik-h @MarisaKirisame @hzfan

* [TE] reverse-mode autodiff without any optimization Co-authored-by: Sergei Grechanik <sergei.grechanik+h@gmail.com> * address review comments * add comments and retrigger CI * move unittest to debug ci * move test back and add seed Co-authored-by: Sergei Grechanik <sergei.grechanik+h@gmail.com>

yzhliu force-pushed the te_autodiff branch 3 times, most recently from 7379717 to 4e94a31 Compare March 22, 2020 06:43

MarisaKirisame reviewed Mar 22, 2020

View reviewed changes

tqchen added the status: need review label Mar 23, 2020

tqchen reviewed Mar 23, 2020

View reviewed changes

include/tvm/te/autodiff.h Outdated Show resolved Hide resolved

hzfan reviewed Mar 23, 2020

View reviewed changes

yzhliu force-pushed the te_autodiff branch 2 times, most recently from 24b9d96 to c527ea8 Compare March 24, 2020 18:41

MarisaKirisame approved these changes Mar 24, 2020

View reviewed changes

yzhliu force-pushed the te_autodiff branch from c527ea8 to b0ee501 Compare March 24, 2020 20:10

yzhliu and others added 2 commits March 24, 2020 15:06

[TE] reverse-mode autodiff without any optimization

d0b42cd

Co-authored-by: Sergei Grechanik <sergei.grechanik+h@gmail.com>

address review comments

6d7e35d

yzhliu force-pushed the te_autodiff branch from b0ee501 to 6d7e35d Compare March 24, 2020 22:07

yzhliu added 3 commits March 24, 2020 16:52

add comments and retrigger CI

725e866

move unittest to debug ci

fa1f13c

move test back and add seed

d93b5f9

yzhliu mentioned this pull request Mar 30, 2020

[Arith] linear system and equation solver #5171

Merged

tqchen mentioned this pull request Mar 30, 2020

[TVM] Automatic differentiation for tensor expressions #2498

Closed

tqchen approved these changes Mar 31, 2020

View reviewed changes

tqchen merged commit e4a5441 into apache:master Mar 31, 2020

ZihengJiang mentioned this pull request Sep 25, 2020

TVM v0.7 Release Note Candidate #6486

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TE] reverse-mode autodiff without any optimization #5121

[TE] reverse-mode autodiff without any optimization #5121

yzhliu commented Mar 22, 2020

MarisaKirisame Mar 22, 2020

yzhliu Mar 24, 2020 •

edited

Loading

sergei-grechanik Mar 31, 2020

MarisaKirisame Mar 31, 2020

MarisaKirisame Mar 22, 2020

yzhliu Mar 23, 2020

MarisaKirisame Mar 23, 2020

hzfan Mar 23, 2020

yzhliu Mar 23, 2020

sgrechanik-h commented Mar 24, 2020

yzhliu commented Mar 24, 2020

MarisaKirisame commented Mar 24, 2020

MarisaKirisame commented Mar 24, 2020

yzhliu commented Mar 25, 2020

yzhliu commented Mar 25, 2020

yzhliu commented Mar 30, 2020

tqchen commented Mar 31, 2020 •

edited

Loading

[TE] reverse-mode autodiff without any optimization #5121

[TE] reverse-mode autodiff without any optimization #5121

Conversation

yzhliu commented Mar 22, 2020

Choose a reason for hiding this comment

yzhliu Mar 24, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sgrechanik-h commented Mar 24, 2020

yzhliu commented Mar 24, 2020

MarisaKirisame commented Mar 24, 2020

MarisaKirisame commented Mar 24, 2020

yzhliu commented Mar 25, 2020

yzhliu commented Mar 25, 2020

yzhliu commented Mar 30, 2020

tqchen commented Mar 31, 2020 • edited Loading

yzhliu Mar 24, 2020 •

edited

Loading

tqchen commented Mar 31, 2020 •

edited

Loading