[Refactor] Remove scope attribute from Buffer class #8463

masahi · 2021-07-13T20:27:52Z

A follow up to #8366. Right now, storage scope information are spread across three components:

AttrStmt with attr::storage_scope key
PointerType
Buffer class

tvm/include/tvm/tir/buffer.h

Lines 70 to 71 in 2cca934

/*! \brief storage scope of the buffer, if other than global */

String scope;

After #8366, storage scopes associated with AttrStmt and PointerType are identical. To consolidate storage scope information into one place, I'm proposing to remove storage scope in AttrStmt and Buffer class.

This PR is for the latter refactoring. I removed scope data member from Buffer class and added an alternative way to access the storage scope through its associated buffer variable.

@tqchen @vinx13 @kparzysz-quic @csullivan

also cc @Hzfengsy since the remove field is only used by TensorIR related code

tqchen · 2021-07-14T15:01:07Z

Thanks @masahi . Sorry was being late on this. I think it worthwhile to think a bit more. In general, there are two kinds of information that we normally carry throughout the program:

K0: The information in the declaration/allocation site
K1: The information that carries as part of type (of the Var) which is being used through out the usage site in the code base.

The Buffer object was intended for the declaration site information. There are in general two kinds of design choices here:

C0: On one hand, the information being present in the declaration site K0 can be duplicated with the information present in the allocation site, so we could remove the duplication (for example, dtype and scope are duplicated in the type atm).
C1: On the other hand, one could argue that the type annotation should be made independent from the type annotation that also shares with the usage site, so we could let the allocation site contain a duplicated set of information and check consistency during construction.

Right now this PR follows C0 partially. I wonder if we should also consider the choice between C0 and C1. In the particular case of Buffer, I feel that we should give C1 a serious consideration.

We can also use the text format to illustrate the potential differences between the two.

# Choice C0: the information was already duplicated on the lhs, so do not present on rhs
ptr : Pointer[float32, "gpu"] = allocate_buffer(32)

# Choice C1:
ptr : Pointer[float32, "gpu"] = allocate_buffer(32, "float32", "gpu")

tqchen · 2021-07-14T15:01:37Z

cc @jroesch @junrushao1994 to also share some thoughts here.

jroesch · 2021-07-14T21:22:31Z

@tqchen is there more context on what the design/end goal is?

tqchen · 2021-07-14T22:00:17Z

related context PR #8366

masahi · 2021-07-14T22:10:28Z

@jroesch I've updated the PR description with more details.

jroesch · 2021-07-14T23:20:48Z

@tqchen wrt to C0 vs. C1 is there any world in which these would differ? is the argument that you might not always have the type/lhs information in hand when you need to analyze the call site?

masahi · 2021-07-15T02:46:38Z

On the other hand, one could argue that the type annotation should be made independent from the type annotation that also shares with the usage site, so we could let the allocation site contain a duplicated set of information and check consistency during construction

I think if duplicated information are supposed to be consistent, then they are already not independent. So I don't see an advantage in keeping track of two essentially the same information.

To @jroesch's question, right now our code base uses two ways to create Buffer:

Via decl_buffer function. Here, a buffer variable (with its PointerType) is constructed at the same time as Buffer. So we can make sure that two storage scopes agree (Although currently we don't pass storage_scope to Buffer constructor).
https://github.com/apache/tvm/blob/main/src/tir/ir/buffer.cc#L48-L53
Via Buffer constructor directly, using a buffer variable already constructed. https://github.com/apache/tvm/blob/main/src/tir/ir/buffer.cc#L386-L388 Here, it is possible to have a situation where the storage scope recorded in the buffer variable is different from the one in Buffer constructor. In particular, if the former one is empty while the latter one is non trivial scope (shared, local etc), I would say this is a bug in creating the original buffer variable with empty scope. If we want to allow such usage while keeping the two scope consistent, we need to update the scope in buffer variable (which is not simple). Right now we don't have such usage in the code base, so I chose to drop scope argument from the Buffer constructor` to prevent misuse.

One possible middle ground is to keep scope member of Buffer, but drop it from the constructor and instead initialize it with the scope in the buffer variable (data in the code). This might be better in terms of design, in that we also have dtype duplicated.

tqchen · 2021-07-15T12:02:31Z

To clarify, we are moving toward a world where the type annotation in the lhs is always available.

The main thing we want to decide is whether to remove the additional info from the rhs.

Note that if it is the other way around (keep info in the rhs and remove lhs info) it will be less controversial. But in this case we want the info in the lhs so it is available in the future reference pt.

This asymmetry arises because we normally assume the information flows from rhs to lhs in the TIR. It can be a bit weird to infer the allocation type from the pointer type of that holds the allocation, of course they should be made consistent.

I just want us to think carefully and make such choice consistent

csullivan · 2021-07-15T15:38:03Z

Another consideration is if the Type of a Buffer's Var is not a PointerType, e.g. PrimType, having the scope on the rhs could be necessary. Are there examples of this occurring / do we envision the need?

kparzysz-quic · 2021-07-15T15:56:19Z

We should not duplicate information in the IR.

The flow of information from rhs to lhs is not necessarily the right way to frame it. In an assignment "a = b", there is information present both in a and in b, and the assignment has its own meaning as a whole. Depending on what kind of analysis we want to do, the inference of information can flow either way.

masahi · 2021-07-15T21:13:42Z

@csullivan As of #8366, all buffer vars should be of pointer type. If that doesn't hold, I'd consider it a bug. This PR has a check

tvm/src/tir/ir/buffer.cc

Lines 316 to 317 in 2128bd4

    
           const auto* ptr_type = (*this)->data->type_annotation.as<PointerTypeNode>(); 
        
           ICHECK(ptr_type) << "Buffer variable is not of pointer type";

before retrieving the storage scope (and all tests passed).

masahi · 2021-07-15T21:26:16Z

I'd say if we agree that having storage scope information in the type is a good idea, then we should exploit this information, even if that ends up being going from left to right.

If we want to strictly make the rhs (Buffer declaration) the source of truth, I think we might need to revisit the decision of putting storage information in the type of pointer.

tqchen · 2021-07-15T23:03:32Z

Thanks everyone for sharing the thoghts. First of all, I think we all agree that we should put storage scope in the type, so that the information can flow clearly from to the use site.

On the other hand, there can be certain cases when duplicated information appear, say in the following two assignments, the additional b's type annotation was duplicated because it can be inferred from a, but nevertheless it can also appear in the IR as long as we have clear consistency checks.

a : int = some_value()
b : int = a

That is why I bought up the C0 and C1 distinguishment. As @kparzysz-quic said, on this particular case the argument can also go the other way if we view the Buffer as the assignment(declaration) as a whole.

So if folks feel strongly that the scope information can be removed, I am not too attached to it. I think we should consider more seriously though if it is about the dtype information, since it is more inherently part of DLTensor spec and having a clear field that is checked consistently would help in most of the case.

tqchen · 2021-07-20T13:11:06Z

Trying to move this convo forward and conclude:

I think we can remove the scope from buffer if we felt strongly about it
Ideally it would be great to keep the dtype field since it is part of the DLPack protocol

If we all agree, we can go ahead and merge this PR

tqchen · 2021-07-20T20:20:03Z

Thanks @masahi @jroesch @kparzysz-quic

Co-authored-by: masa <masa@pop-os.localdomain>

masahi and others added 6 commits July 14, 2021 05:03

Remove scope member from Buffer and add scope() method

1a67ef0

fixed all tests

fc7349e

add scope method to python Buffer class

c494769

Expose C++ function and use it from Python Buffer class

1c67a7d

cpplint fix

fe626f3

update vta code

2128bd4

masahi marked this pull request as ready for review July 14, 2021 10:55

vinx13 self-assigned this Jul 14, 2021

tqchen approved these changes Jul 20, 2021

View reviewed changes

tqchen merged commit 1a1be09 into apache:main Jul 20, 2021

ylc pushed a commit to ylc/tvm that referenced this pull request Sep 29, 2021

[Refactor] Remove scope attribute from Buffer class (apache#8463)

b6cfbed

Co-authored-by: masa <masa@pop-os.localdomain>

junrushao mentioned this pull request Nov 1, 2021

Apache TVM v0.8 Release Note Candidate #9416

Closed

ylc pushed a commit to ylc/tvm that referenced this pull request Jan 13, 2022

[Refactor] Remove scope attribute from Buffer class (apache#8463)

35e9df6

Co-authored-by: masa <masa@pop-os.localdomain>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Refactor] Remove scope attribute from Buffer class #8463

[Refactor] Remove scope attribute from Buffer class #8463

masahi commented Jul 13, 2021 •

edited

Loading

tqchen commented Jul 14, 2021 •

edited

Loading

tqchen commented Jul 14, 2021

jroesch commented Jul 14, 2021

tqchen commented Jul 14, 2021

masahi commented Jul 14, 2021

jroesch commented Jul 14, 2021

masahi commented Jul 15, 2021 •

edited

Loading

tqchen commented Jul 15, 2021 •

edited

Loading

csullivan commented Jul 15, 2021

kparzysz-quic commented Jul 15, 2021 •

edited

Loading

masahi commented Jul 15, 2021 •

edited

Loading

masahi commented Jul 15, 2021 •

edited

Loading

tqchen commented Jul 15, 2021

tqchen commented Jul 20, 2021

tqchen commented Jul 20, 2021

	/! \brief storage scope of the buffer, if other than global /
	String scope;

[Refactor] Remove scope attribute from Buffer class #8463

[Refactor] Remove scope attribute from Buffer class #8463

Conversation

masahi commented Jul 13, 2021 • edited Loading

tqchen commented Jul 14, 2021 • edited Loading

tqchen commented Jul 14, 2021

jroesch commented Jul 14, 2021

tqchen commented Jul 14, 2021

masahi commented Jul 14, 2021

jroesch commented Jul 14, 2021

masahi commented Jul 15, 2021 • edited Loading

tqchen commented Jul 15, 2021 • edited Loading

csullivan commented Jul 15, 2021

kparzysz-quic commented Jul 15, 2021 • edited Loading

masahi commented Jul 15, 2021 • edited Loading

masahi commented Jul 15, 2021 • edited Loading

tqchen commented Jul 15, 2021

tqchen commented Jul 20, 2021

tqchen commented Jul 20, 2021

masahi commented Jul 13, 2021 •

edited

Loading

tqchen commented Jul 14, 2021 •

edited

Loading

masahi commented Jul 15, 2021 •

edited

Loading

tqchen commented Jul 15, 2021 •

edited

Loading

kparzysz-quic commented Jul 15, 2021 •

edited

Loading

masahi commented Jul 15, 2021 •

edited

Loading

masahi commented Jul 15, 2021 •

edited

Loading