[Relay][Text Format] Text Printer Refactor and Debug Printing #2605

joshpoll · 2019-02-15T22:08:33Z

Replaces existing text printer with one based on Section 1 of "A prettier printer" by Philip Wadler; however, the printer can be improved by adopting a version later in the paper.
Adds stream-like API for the DocAtom ADT.
Adds debug printing "pass" that includes more traditional pretty printing style.
Adds FunctionType support.

How the printer works

The pretty printer design works roughly as follows. Instead of creating a string directly, you instead create a Doc that consists of a vector of DocAtoms. The purpose of Doc is to separate the text's meaning (e.g. specifying where new lines and indentations should go) from their actual layout. For example, depending on the length of a line, you may want to print

f(a, b, c, d)

or

f(a,
  b,
  c,
  d)

This is easy to do with an abstract doc model. We can encode choices in our Doc specification and then let an algorithm choose the optimal layout later.

The current DocAtoms are Text and Line. Text stores a piece of text with no line breaks and Line stores a line break and the amount of indentation directly after it. Unless you are working with the API, you should not need to think about DocAtoms except that new lines should be added to a Doc separately from text or the line and text will be treated as a single Text atom instead of a Line and a Text atom.

The main way to build a Doc is to via the stream-like interface exposed with the following Doc constructor and member functions:

explicit Doc(const std::string& str)
Doc& operator<<(const Doc& right)
Doc& operator<<(const std::string& right)
template<typename T>
Doc& operator<<(const T& right)
std::string str()

The remaining functions you need to think about are:

friend Doc Indent(int indent, const Doc& doc): This function returns a new Doc that is like doc except all the Lines inside it have had their indentation extended by indent.
Doc PrintVec(const std::vector<Doc>& vec, const Doc& sep = Doc(", ")): This intersperses the sep between every doc in vec.
various specific Print functions

merrymercy · 2019-02-18T16:06:03Z

Can this finally support attributes of operators (e.g. stride, padding) whose types are tvm::Expr, tvm::Array or string? They are not relay expression.

joshpoll · 2019-02-21T19:45:23Z

@merrymercy This PR is intentionally light on new printer functionality and focuses primarily on infrastructure. Once we get the scaffolding down it will be easier to work on the parser and printer. I'm a bit short on time right now so I can't promise that I will be able to add attribute printing, but my hope is that the Doc ADT is easy enough to use that people can extend the printer relatively easily.

merrymercy · 2019-02-22T05:28:31Z

Thanks. Currently, I cannot use the text format because I cannot write attributes in the text format. Although I can hack it to support some constants.

tqchen · 2019-02-23T02:59:42Z

One note: move include/relay/doc.h into src/relay/ir, just to keep it internal and minimum.

MarisaKirisame · 2019-03-01T05:31:03Z

src/relay/ir/pretty_printer.cc

@@ -46,8 +46,7 @@ class PrettyPrinter :
    Doc PrintFinal(const NodeRef& node) {
      // TODO(@jmp): If these lines are combined it segfaults??


fix the todo.
doc_stack_.back() is a reference which get invalidate as Print might push_back into doc_stack_
using a linked list might also fix the problem

joshpoll · 2019-03-07T23:25:57Z

This PR is close to feature complete. cc @wweic @merrymercy @MarisaKirisame @tqchen @jroesch.

tqchen · 2019-03-08T02:04:31Z

Is it possible to simply write test-cases for now using the test infra without introducing hypothesis dependency in the CI. You can still use some of that to test locally and find failure case to add to the CI.
I have seen ;`` as separators, can we remove the reliance of ;```? to make the syntax looks a bit more like python?
The use %var[index] vs of %var.index seems was not discussed, and perhaps need some discussion
The gnf flag of printer should be removed, as the printer should support print GNF, ANF mixed code automatically.
- We want one canonical form of the text format, and we should always print that by default, adding additional flags creates un-necessary metal burdens to the user and can also blow up the interface

python/tvm/relay/base.py

src/relay/ir/doc.cc

MarisaKirisame · 2019-03-08T17:25:40Z

src/relay/ir/pretty_printer.cc

+  }
+
+  // indent a new body
+  // TODO(jmp): indent should be an instance variable of the printer


Can you fix this? It is little work.

MarisaKirisame · 2019-03-08T17:26:14Z

src/relay/ir/pretty_printer.cc

+    * \param prefix The prefix of the name
+    * \return The returned name.
+    */
+  Doc GetUniqueName(std::string prefix) {


MarisaKirisame · 2019-03-08T17:27:56Z

tests/python/relay/test_ir_parser_roundtrip.py

+
+if __name__ == "__main__":
+    # for _ in range(10):
+    #     print(anf_print(exprs.example()))


uncomment or remove

MarisaKirisame · 2019-03-08T17:28:18Z

tests/python/relay/test_ir_parser_roundtrip.py

+    one = relay.const(1)
+    assert gnf_print(relay.Tuple([one, one])) == "v0.0.1\n(1, 1)\n"
+
+    # assert gnf_print(relay.If(relay.const(True), relay.TupleGetItem(relay.Tuple([one, one]), 0), relay.TupleGetItem(relay.Tuple([one, one, relay.const(1)]), 0))) == "v0.0.1\n%0 = True\nif (%0) {\n  %1 = 1\n  %2 = (%1, %1)\n  %2.0\n} else {\n  %1 = 1\n  %2 = 1\n  %3 = (%1, %1, %2)\n  %3.0\n}\n"


uncomment or remove

joshpoll · 2019-03-09T02:15:53Z

@tqchen

Is it possible to simply write test-cases for now using the test infra without introducing hypothesis dependency in the CI. You can still use some of that to test locally and find failure case to add to the CI.

I will remove hypothesis for now, but it should be added back eventually.

I have seen ;`` as separators, can we remove the reliance of ;```? to make the syntax looks a bit more like python?

This is out of the scope of this PR. I think it should go in an RFC.

The use %var[index] vs of %var.index seems was not discussed, and perhaps need some discussion

This is fair. There should be a new RFC soon for features that have not been implemented in the parser yet, including tuple projection. I was maintaining the convention in the commented out parts of the parser and in the old text printer.

The gnf flag of printer should be removed, as the printer should support print GNF, ANF mixed code automatically.
We want one canonical form of the text format, and we should always print that by default, adding additional flags creates un-necessary metal burdens to the user and can also blow up the interface

With the gnf flag enabled (this is done by default in both Python and in C++), the new printer behaves as the old text printer did. The opposite of gnf printing isn't anf, since let nodes are printable with gnf enabled. Rather turning gnf off prevents the creation of temporary variables, which is the style used for anf components of gnf programs. I've received feedback from some developers that this is desirable, and it's useful to bridge the gap between ML and PL. Those who aren't interested in this format can ignore it entirely.

Also I disagree that additional flags are harmful. As long as they're optional they do not create unnecessary mental burden. Moreover, it is common for printers to have many flags:

tqchen · 2019-03-09T03:29:28Z

It might be fine to support a big amount of options in a developer private API. But it is not a good idea to do so in a user-facing API.

I was a big fan of a list of thousands of arguments(hey who do not want new features)? However, the problem of having such new features means two things:

Users have to understand about these new options (e.g. what does gnf stands for?, imagine what will user think when reading the document of astext and find about this additional option).
They bring burden to maintain these features (technical debts)
Something can go wrong

In particular, in the case of gnf=False, the semantics of the program is no longer equivalent to the original one when there are shared path (such as ResNet). This can be very problematic, because astext is supposed to create a bidirectional format (in the case of gnf=False this is no longer true)

Note that if the program itself is ANF already, printing with the normal mode should work out of the box, so we also do not need such option as well.

So to make things simpler for the public facing API, I think we should simply disable this to remove ambiguity. This is mainly because we have the graph node syntax in the program(unlike typical FP languages). This way, the user has one less concept to learn, and we have one precise syntax for the program.

If we really want to support some form of such kind of folding in the long term(which I am not sure if it is really necessary). We need to run analysis to detect the nodes that are only referenced once and optionally use an algorithm to decide how to best fold intermediate nodes into an expression. At the same time, the parser has to be updated to support the folded expression. And if we want to switch such printing option on, the flag could be in some global configuration env(something like relay.set_printer_option), which is only used by advanced developers, so users do not have to worry about the additional option.

tqchen · 2019-03-09T03:39:04Z

Back to the choices of the tuple projection and separator. I think we should settle down the discussion with an RFC before committing the code. It is even more dangerous to have a dynamic text format flying around and claims that we have parser round-tripping (imagine user starts to use the feature then find things break afterwards).

A safe choice would be to keep a consistent behavior with the old text printer, (no separator, [] projection), and give up round-tripping in the PR, until we fully resolved and committed to a text format.

…eed to rethink design

joshpoll · 2019-03-14T22:48:42Z

After discussions with @tqchen I have

removed roundtrip tests
scaled back over-aggressive inlining
reverted syntax changes
separated printing into two public-facing functions: one for the bidirectional text format and one for debugging

The PR is now feature complete. Inlining is currently less aggressive than the existing text printer; however, it is more correct. Here is the roadmap going forward before 0.6:

Introduce smart inlining.
Open an RFC for remaining text format design decisions including semicolons, generics, and tuple projection.
Establish roundtrip tests.
Optimize the printer using arenas.
Use more advanced Doc models from Wadler's paper (may happen earlier depending on need/interest).

cc @wweic @merrymercy @MarisaKirisame @tqchen @jroesch

src/relay/ir/pretty_printer.cc

Co-Authored-By: joshpoll <joshpollock1997@gmail.com>

python/tvm/relay/_parser.py

MarisaKirisame · 2019-03-18T03:35:50Z

src/relay/ir/doc.cc

+// DSL function implementations
+
+Doc& Doc::operator<<(const Doc& right) {
+  this->stream_.insert(this->stream_.end(), right.stream_.begin(), right.stream_.end());


I dont think this is correct.
Use https://en.cppreference.com/w/cpp/iterator/back_insert_iterator instead.

Or just use a for loop.

Also check against this == &right

https://stackoverflow.com/questions/201718/concatenating-two-stdvectors

You still has to catch against this == &right.

see latest commit

src/relay/ir/pretty_printer.cc

…o parser-roundtrip

tqchen · 2019-03-20T23:12:10Z

Thanks, @wweic @MarisaKirisame @joshpoll this is now merged.

…#2605)

joshpoll mentioned this pull request Feb 21, 2019

[RELAY][RFC] Modify repr to return a valid Python AST #2631

Closed

MarisaKirisame requested changes Mar 1, 2019

View reviewed changes

icemelon added the status: WIP label Mar 2, 2019

joshpoll changed the title ~~[Relay][Text Format] Parser-Printer Roundtrip Scaffolding~~ [Relay][Text Format] Text Printer Refactor Mar 4, 2019

joshpoll force-pushed the parser-roundtrip branch from df08054 to 1490d79 Compare March 6, 2019 06:27

joshpoll changed the title ~~[Relay][Text Format] Text Printer Refactor~~ [Relay][Text Format] Text Printer Refactor and ANF Printing Mar 7, 2019

joshpoll mentioned this pull request Mar 7, 2019

[Relay][CI] Add hypothesis to python dependencies #2745

Closed

joshpoll marked this pull request as ready for review March 7, 2019 23:24

MarisaKirisame reviewed Mar 8, 2019

View reviewed changes

joshpoll changed the title ~~[Relay][Text Format] Text Printer Refactor and ANF Printing~~ [Relay][Text Format] Text Printer Refactor and "PL" Printing Mar 9, 2019

joshpoll added 11 commits March 12, 2019 15:11

minor bug fix and beginnings of hypothesis testing

c13b2ae

bump semver. add inline_meta_data flag. implement tuples

aec4724

add simple wadler-style printer infrastructure

1595a46

make things build

7b6ce8b

remove stringstreams. add pretty printer class

c11260a

commit failing refactoring things. can't return abstract class. may n…

0a3210b

…eed to rethink design

switch to tvm-style adts

9f56672

tuple projection and more testing

1f8ff51

revert text_printer changes

414fab4

derandomize

497f4eb

revert version number bump

c921cd0

joshpoll added 4 commits March 12, 2019 15:11

separate interfaces for interchange and debug

3a5d1de

bug fix

5807815

remove hacks

e7201fa

fix atr bug

3184de0

joshpoll force-pushed the parser-roundtrip branch from 9610766 to 3184de0 Compare March 13, 2019 00:26

joshpoll added 5 commits March 12, 2019 17:37

lint

f5081bd

lint

c8ab556

improve doc stream interface

da68638

further simplify interface. remove NOLINTs

c7c8a67

fix bugs and docs

81ea258

joshpoll changed the title ~~[Relay][Text Format] Text Printer Refactor and "PL" Printing~~ [Relay][Text Format] Text Printer Refactor and Debug Printing Mar 14, 2019

move str to doc.cc

fd43b91

remove stale documentation

2eb7f6e

wweic approved these changes Mar 16, 2019

View reviewed changes

src/relay/ir/pretty_printer.cc Outdated Show resolved Hide resolved

Update src/relay/ir/pretty_printer.cc

c9e7d88

Co-Authored-By: joshpoll <joshpollock1997@gmail.com>

MarisaKirisame requested changes Mar 18, 2019

View reviewed changes

joshpoll added 2 commits March 17, 2019 21:02

address feedback

a528fe2

erge branch 'parser-roundtrip' of https://github.com/joshpoll/tvm int…

3a4e41d

…o parser-roundtrip

MarisaKirisame approved these changes Mar 18, 2019

View reviewed changes

minor comment changes and bump ci

7f9921d

tqchen approved these changes Mar 20, 2019

View reviewed changes

tqchen merged commit db5bfa3 into apache:master Mar 20, 2019

tqchen added status: accepted and removed status: WIP labels Mar 20, 2019

Laurawly pushed a commit to Laurawly/tvm-1 that referenced this pull request Mar 22, 2019

[Relay][Text Format] Text Printer Refactor and Debug Printing (apache…

71c6b2c

…#2605)

wweic pushed a commit to wweic/tvm that referenced this pull request Mar 24, 2019

[Relay][Text Format] Text Printer Refactor and Debug Printing (apache…

495b52c

…#2605)

wweic pushed a commit to neo-ai/tvm that referenced this pull request Mar 24, 2019

[Relay][Text Format] Text Printer Refactor and Debug Printing (apache…

45a97fc

…#2605)

tqchen mentioned this pull request Nov 8, 2019

[RELEASE][DRAFT] TVM v0.6 Release candidate #4259

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Relay][Text Format] Text Printer Refactor and Debug Printing #2605

[Relay][Text Format] Text Printer Refactor and Debug Printing #2605

joshpoll commented Feb 15, 2019 •

edited

Loading

merrymercy commented Feb 18, 2019

joshpoll commented Feb 21, 2019

merrymercy commented Feb 22, 2019

tqchen commented Feb 23, 2019

MarisaKirisame Mar 1, 2019

joshpoll commented Mar 7, 2019

tqchen commented Mar 8, 2019 •

edited

Loading

MarisaKirisame Mar 8, 2019

MarisaKirisame Mar 8, 2019

MarisaKirisame Mar 8, 2019

MarisaKirisame Mar 8, 2019

joshpoll commented Mar 9, 2019

tqchen commented Mar 9, 2019 •

edited

Loading

tqchen commented Mar 9, 2019 •

edited

Loading

joshpoll commented Mar 14, 2019

MarisaKirisame Mar 18, 2019

MarisaKirisame Mar 18, 2019

MarisaKirisame Mar 18, 2019

joshpoll Mar 18, 2019

MarisaKirisame Mar 18, 2019

joshpoll Mar 18, 2019

tqchen commented Mar 20, 2019

		@@ -46,8 +46,7 @@ class PrettyPrinter :
		Doc PrintFinal(const NodeRef& node) {
		// TODO(@jmp): If these lines are combined it segfaults??

[Relay][Text Format] Text Printer Refactor and Debug Printing #2605

[Relay][Text Format] Text Printer Refactor and Debug Printing #2605

Conversation

joshpoll commented Feb 15, 2019 • edited Loading

How the printer works

merrymercy commented Feb 18, 2019

joshpoll commented Feb 21, 2019

merrymercy commented Feb 22, 2019

tqchen commented Feb 23, 2019

Choose a reason for hiding this comment

joshpoll commented Mar 7, 2019

tqchen commented Mar 8, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

joshpoll commented Mar 9, 2019

tqchen commented Mar 9, 2019 • edited Loading

tqchen commented Mar 9, 2019 • edited Loading

joshpoll commented Mar 14, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tqchen commented Mar 20, 2019

joshpoll commented Feb 15, 2019 •

edited

Loading

tqchen commented Mar 8, 2019 •

edited

Loading

tqchen commented Mar 9, 2019 •

edited

Loading

tqchen commented Mar 9, 2019 •

edited

Loading