New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

[RELAY][BACKEND] Enable PlanMemory in the graph runtime. #2120

Merged

tqchen merged 4 commits into apache:master from tqchen:relay

Nov 19, 2018

Member

tqchen commented Nov 15, 2018

This PR implements PlanMemory for graph runtime codegen backend of Relay. It also contains a few other improvements

add show_meta_data option to astext
- Can be helpful to display text when there is a huge meta-data

The algorithm is basically the same from NNVM. We do need to introduce a storage token and have an initialization phase that propagates and calculate expected reference count of the token, before we run the greedy allocation algorithm.


          [RELAY][BACKEND] Enable PlanMemory in the graph runtime.

de5411b

Member Author

tqchen commented Nov 15, 2018

cc @jroesch @yzhliu @ZihengJiang @zhiics

Contributor

ajtulloch commented Nov 15, 2018

@tqchen quick question - are there plans to allow dynamic memory allocation in the graph runtime, which would allow variable shapes? I believe that's not currently supported, but was curious if you had plans there.

Member

jroesch commented Nov 15, 2018

@ajtulloch yes, I think the plan is to write a new runtime system soon ™️. A few of us are working on a PLDI submission, and expect to ship a bunch of improvements/fixes/features post deadline.

Member

jroesch commented Nov 15, 2018

Overall looks good to me, a bit tired from the PLDI push so maybe someone else should do a pass.

Member

zhiics commented Nov 15, 2018

@tqchen Quite busy recently. But I will try my best to spend sometime to do a round tonight if it is not too late.

Member Author

tqchen commented Nov 15, 2018 •

edited

Loading

When we start to move into NNVMv2(relay) we have this clear separation of compiler and runtime. The migration starts as two-phase process, and we are in the first step that moves the compiler, but keeps the old graph runtime.

I think we can expect the static graph runtime to exist for a while, but we can also explore the possibility of new backends that breaks different assumptions(e.g. dynamic memory alloca, control flow). Luckily the IR is expressive enough to represent all these workloads.

There is also a tradeoff here, depending on whether we want to allow JIT, how big the runtime is, etc. So I can imagine it could be possible that we build several of them. @ajtulloch I think it is a good time to hear opinions from everyone on what do we need

tqchen added the status: need review label

Contributor

ajtulloch commented Nov 15, 2018 •

edited

Loading

@tqchen, @jroesch that sounds great. Is there an existing RFC you'd like us to contribute feature suggestions to?

Member Author

tqchen commented Nov 15, 2018

there is not an existing RFC, how about we open a new one?

Member Author

tqchen commented Nov 16, 2018

opened in #2122

Member

jroesch commented Nov 16, 2018

@ajtulloch RFC seems like a great idea, would look forward to figuring out what everyone is interested in, and what people are looking to do.

tqchen added 2 commits

November 15, 2018 21:18


          Bugfix in lower level matching

a625136


          add regression test

03eaf4b

Member

zhiics commented Nov 16, 2018

@tqchen I only have some nit comments. Overall LGTM.

zhiics reviewed

View reviewed changes

src/relay/backend/graph_plan_memory.cc Outdated

+                }
+                void VisitExpr_(const TupleNode* op) final {
+                  // Do nothing.

Member

zhiics Nov 16, 2018

remove the comment?

src/relay/backend/graph_plan_memory.cc Outdated

+                std::unordered_map<const ExprNode*, std::vector<StorageToken*> > token_map_;
+                /*!
+                 * \brief call get token to get the necessary token.

Member

zhiics Nov 16, 2018

Call token

src/relay/backend/graph_plan_memory.cc Outdated

+                  }
+                  // create token for the call node.
+                  CreateToken(op, true);
+                  // check if there is orphaned output that can be released immediately/

Member

zhiics Nov 16, 2018

immediately.

yzhliu reviewed

View reviewed changes

src/relay/backend/graph_plan_memory.cc Outdated

+              struct StorageToken {
+                /*! \brief Reference counter */
+                int ref_counter{0};
+                /*! \brief numbe of bytes */

Member

yzhliu Nov 17, 2018

typo number

src/relay/backend/graph_plan_memory.cc

+                }
+                void VisitExpr_(const OpNode* op) final {
+                  // Do nothing.

Member

yzhliu Nov 17, 2018

just try to learn, what is the default behavior if such function is not defined?

Member Author

tqchen Nov 19, 2018

by default, it will recursively visit, which is fine, just to make it explicit

src/relay/backend/graph_plan_memory.cc Outdated

+                        << ttype->shape;
+                    size *= static_cast<size_t>(pval[0]);
+                  }
+                  size *= (ttype->dtype.bits() * ttype->dtype.lanes() + 7) / 8;

Member

yzhliu Nov 17, 2018

add comments for magic number 7 & 8?

Contributor

ajtulloch Nov 18, 2018

IMO this should be refactored into a round_up/div_round_up function.

Member

zhiics Nov 18, 2018 •

edited

Loading

+1, it might be necessary to have an "alignment" function which takes byte, word, or dword, etc.


          address review comments

c31580a

Member Author

tqchen commented Nov 19, 2018

Thanks, @ajtulloch @yzhliu @zhiics, i have addressed the comments.

zhiics approved these changes

View reviewed changes

tqchen merged commit c113712 into apache:master

Member Author

tqchen commented Nov 19, 2018

Thanks @ajtulloch @yzhliu @zhiics this is merged

tqchen deleted the relay branch

December 15, 2018 02:12

FrozenGene pushed a commit to FrozenGene/tvm that referenced this pull request


          [RELAY][BACKEND] Enable PlanMemory in the graph runtime. (apache#2120)

510c9d5

ZihengJiang mentioned this pull request

TVM 0.5 Release Note #2448

Closed

wweic pushed a commit to neo-ai/tvm that referenced this pull request


          [RELAY][BACKEND] Enable PlanMemory in the graph runtime. (apache#2120)

eafcd02

wweic pushed a commit to neo-ai/tvm that referenced this pull request


          [RELAY][BACKEND] Enable PlanMemory in the graph runtime. (apache#2120)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

status: need review