-
Notifications
You must be signed in to change notification settings - Fork 58
[Tracking Issue] Relax Training M0 #413
Comments
Discussion from the Feb. 7, 2023 community meeting after this work was presented:
|
I would be curious to hear about the plans for further developing AD in Relax. We should be able to support all of the language's features by building up the "tape" within Relax (this could be accomplished through We could consider using the approach in Relay's general-purpose AD pass, where the "tape" is built up using a closure: https://github.com/apache/tvm/blob/main/src/relay/transforms/higher_order_gradient.cc (I think we should aim to make AD as general as we can, as this would allow us to target all manner of diverse models for training). Edit: One issue with using |
There has been increased interest from the community in using TVM for training. Relax, the next generation graph level IR of TVM, also faces the demand of training model.
We are building a training workflow on Relax, including:
The training APIs can serve many needs. You will be able to:
This work is mainly done by @SiriusNEO and @Ubospica, with the help from @tqchen @junrushao @MasterJH5574 @Hzfengsy @spectrometerHBH et al.
Further introduction of our work:
A jupyter notebook tutorial of the training APIs can be found here.
Detailed explanation of the AD pass, and its limitations can be found here.
Currently a large part of our work has been merged into the mlc repo. Now our work is tracked at this issue.
The APIs are still changing. We will update the tutorial within a period of time after the API is modified.
The text was updated successfully, but these errors were encountered: