[IR] Creation-time constant fold for constant expressions #209

yaoyaoding · 2023-05-04T06:22:33Z

Previously, when we do some arhtimatic operations for constant, we will keep them in our IR:

from hidet.ir.expr import convert
a = convert(1)
b = convert(2)
c = a + b
print(c)  # would be Add(Constant(1), Constant(2))

With this PR, we will do the computation when we create the Add expression, and replace it with hidet.ir.Constant:

from hidet.ir.expr import convert
a = convert(1)
b = convert(2)
c = a + b
print(c)  # would be Constant(3)

This PR also clean the hidet.ir.dialects.pattern.py. We do not many fancy pattern matching any more.

steal_weight option works as expected after removing all references to torch tensors. `HidetModule` has `torch_params` and `hidet_params` attributes. In modules such as `HidetLinear` and `HidetMultiheadAttention` we don't need to keep original weight tensors in `torch_params` and `hidet_params`. Instead, their transposed copies are stored in those modules. Removing original weight tensors allows us to have some additional free space on GPU. Now we are able to compile LLama2-7b model with 12 Gib of weights on 24Gib RTX 3090 GPU. This is the state of GPU memory after compiling with hidet: - Allocated by hidet: 21179 MiB - Allocated by torch: 314 MiB For a reference, torch.compile with `backend=inductor` and `mode=max-autotune`: - Allocated by hidet: 0 Mib - Allocated by torch: 12925 Mib This script is used to test llama2 model: https://drive.google.com/file/d/1Baz5MrC9wWg9ceirmKZtkMPbv4u3CAuc/view?usp=sharing --------- Co-authored-by: Zhumakhan <nazirzhumakhan@gmail,.com>

creation-time constant fold for constant expressions

2eb67c7

yaoyaoding force-pushed the const-fold branch from 3505091 to 2eb67c7 Compare May 4, 2023 19:34

yaoyaoding added 4 commits May 4, 2023 16:04

rename UnaryOp to UnaryExpr, BinaryOp to BinaryExpr, fold unary expr

c56a6bd

.

752d13f

.

03031d3

.

5245199

yaoyaoding merged commit daae22e into hidet-org:main May 5, 2023

yaoyaoding deleted the const-fold branch May 5, 2023 04:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[IR] Creation-time constant fold for constant expressions #209

[IR] Creation-time constant fold for constant expressions #209

yaoyaoding commented May 4, 2023

[IR] Creation-time constant fold for constant expressions #209

[IR] Creation-time constant fold for constant expressions #209

Conversation

yaoyaoding commented May 4, 2023