[WIP][DRAFT] Is this the right track? Generating Dense Quantized op #4

AndrewZhaoLuo · 2021-04-09T00:06:16Z

Add the qparam manager class Lily alluded to here
Generate the dense layer according to math (TM) and using existing relay nodes. Have not checked for correctness of implementation but hopefully the idea is clear
These relay generation functions return the new relay expression and the associated qparams for the output.
The hot take is that simulated quantize and static quantize graphs are always going to be very similar the only difference is the actual internal casting between them. We try to capture this with:

internal_accumulation_dtype: str = "float32" which represents the actual type being accumulated and

simulated_accumulation_dtype: str = "int32" which represents the accumulated type being simulated

Things need to do / think about:
- per channel quantization
- what are the actual differences between simulated and static quantized graphs?
- organizing files in general
- a lot of other stuff...

electriclilies

Overall, looks pretty good to me, just a few nitpicky things :)

python/tvm/relay/transform/quantization/quantized_operators/dense.py

electriclilies · 2021-04-09T19:12:43Z

python/tvm/relay/transform/quantization/quantized_operators/dense.py

+        self.ref_count = 0
+        self.qparams = []
+
+    def get_qparams(self, name_hint: str, dtype: str = "int8") -> QParams:


Passing in a name to this function is good, it will help avoid confusion.

Right now, your implementation saves every QParam ever created and adds it to a list. I'm not sure this is the exact behavior we want, since the purpose of the class saving the QParams is to let the C++ rewriter see the QParams that were created in the rewrite of the last pattern, not all the QParams. (We can get all the scales and zero point variables by doing relay.analysis.free_vars, so we don't have to store them all in a list).

However, I'm not 100% sure of the implementation details of that because it will have to interact with the FFI. For now, maybe we should just not save the QParams we've previously created, and we can add the functionality when we need it.

electriclilies · 2021-04-09T19:19:15Z

python/tvm/relay/transform/quantization/quantized_operators/dense.py

+) -> Tuple[tvm.relay.Expr, QParams]:
+    """TODO"""
+
+    # TODO: figure out whether we need this or we can always have the


Units is a dense attribute, but it is not always set, so sometimes the caller will be able to pass in the units and sometimes not.

I believe that the units in dense is in_units, so the caller won't have access to out_units unless they look at the shape.

python/tvm/relay/transform/quantization/quantized_operators/dense.py

electriclilies · 2021-04-09T19:55:04Z

Could you change the file name to utils.py? I'm not a huge fan of common :) Also, not sure if tvm.relay.transform.quantization is the best namespace.. I personally prefertvm.relay.transform.quantize, but it's worth a discussion

electriclilies · 2021-04-09T22:39:55Z

One question to investigate is whether the constant folder will get rid of all the extra terms created by the non-zero zero point in the quantized inference version of the graph. If not, we may need to check the actual value of the zero point while generating the quantized relay for inference.
Not something we need to worry about right now, but it should be on our radar.

…ng on your preference

…write-utils-alt-interface Alt interface branch goes to quantization dev main branch

* WIP support per-channel quantization * more WIP * More WIP * fix issue with per-channel bias_add * Fix fake quantize tests (#4) * Fixed fake quantize issues. * Formatting. * Cleanup unused imports * Fix real int8 tests. * Add Relu * One more little one (#5) * Fixed fake quantize issues. * Formatting. * Cleanup unused imports * Fix real int8 tests. * Fix requantize shape bug. * Non-working Per-channel Dense * Fix legalization for non spatial operators. (apache#6) * Fix legalization for non spatial operators. * Fix axis checks for end2end functionality. * fix axis normalization fix lint fix lint again * Per channel fq2i (apache#8) * WIP support per-channel quantization * more WIP * More WIP * fix issue with per-channel bias_add * Fix fake quantize tests (#4) * Fixed fake quantize issues. * Formatting. * Cleanup unused imports * Fix real int8 tests. * Add Relu * One more little one (#5) * Fixed fake quantize issues. * Formatting. * Cleanup unused imports * Fix real int8 tests. * Fix requantize shape bug. * Non-working Per-channel Dense * Fix legalization for non spatial operators. (apache#6) * Fix legalization for non spatial operators. * Fix axis checks for end2end functionality. * fix axis normalization fix lint fix lint again * Fix bug in requantize dimension expansion. * Format. Co-authored-by: Josh Fromm <jwfromm@octoml.ai> * respond to review comments respond to review comments Co-authored-by: Josh Fromm <jwfromm@octoml.ai>

* WIP support per-channel quantization * more WIP * More WIP * fix issue with per-channel bias_add * Fix fake quantize tests (#4) * Fixed fake quantize issues. * Formatting. * Cleanup unused imports * Fix real int8 tests. * Add Relu * One more little one (#5) * Fixed fake quantize issues. * Formatting. * Cleanup unused imports * Fix real int8 tests. * Fix requantize shape bug. * Non-working Per-channel Dense * Fix legalization for non spatial operators. (apache#6) * Fix legalization for non spatial operators. * Fix axis checks for end2end functionality. * fix axis normalization fix lint fix lint again * Per channel fq2i (apache#8) * WIP support per-channel quantization * more WIP * More WIP * fix issue with per-channel bias_add * Fix fake quantize tests (#4) * Fixed fake quantize issues. * Formatting. * Cleanup unused imports * Fix real int8 tests. * Add Relu * One more little one (#5) * Fixed fake quantize issues. * Formatting. * Cleanup unused imports * Fix real int8 tests. * Fix requantize shape bug. * Non-working Per-channel Dense * Fix legalization for non spatial operators. (apache#6) * Fix legalization for non spatial operators. * Fix axis checks for end2end functionality. * fix axis normalization fix lint fix lint again * Fix bug in requantize dimension expansion. * Format. Co-authored-by: Josh Fromm <jwfromm@octoml.ai> * respond to review comments * start dtos * wip depth_to_space * dtos ident Co-authored-by: Matthew <mbrookhart@octoml.ai> Co-authored-by: Josh Fromm <jwfromm@octoml.ai>

electriclilies reviewed Apr 9, 2021

View reviewed changes

AndrewZhaoLuo force-pushed the quantization-dev-main-add-rewrite-utils branch from 96e14d9 to 24ebe4d Compare April 30, 2021 19:55

Andrew Zhao Luo and others added 26 commits May 4, 2021 11:56

example moving things around

84e850d

dense operator example

d655405

dense stub

8ae43b5

clean up old file

d20343f

move common to common.py

8efce58

add numpy quantization calculator

3198516

make dense creators create fp32 -> fp32 real or affine domain dependi…

8557397

…ng on your preference

lily's comments

2df25eb

renamed common.py -> utils.py

ddd45df

rename common -> utils in files

5ece075

add more utils to simplify quantization/dequant code

a417c37

cleanup interface calls

aa67d38

add arithmetic operators

0575996

rename arithmetic to add

ee1487d

make add default dequantize

e9dd073

option to give output qparams to add

0c99a46

fix examples with new interface

9eb6f12

fix proper scale values

2f88d14

new multiply operator

03ae810

add subtraction operator

28b643c

fix rounding bug

d4a60ee

add short circuits to some point termsg

4def50b

fix bug with calculating zero point when symmetric

63fbf7e

very ugly working conv

3d1cb02

more cool convs

81abd8c

works with groups'

07f8fee

Andrew Zhao Luo and others added 22 commits May 5, 2021 14:30

add numpy quantization calculator

dd51d9b

make dense creators create fp32 -> fp32 real or affine domain dependi…

c45a31e

…ng on your preference

lily's comments

ab665b8

renamed common.py -> utils.py

a993ee9

rename common -> utils in files

773860f

add more utils to simplify quantization/dequant code

879b3b9

cleanup interface calls

26849d8

add arithmetic operators

62ea876

rename arithmetic to add

252c416

make add default dequantize

52638ac

option to give output qparams to add

6597498

fix examples with new interface

3325f5e

fix proper scale values

999c234

new multiply operator

3c20289

add subtraction operator

b372b48

fix rounding bug

e53040f

add short circuits to some point termsg

9767479

fix bug with calculating zero point when symmetric

3bbb008

very ugly working conv

b13efc2

more cool convs

9cb6f59

works with groups'

737187b

example of dynamic pad not working

467de8b

AndrewZhaoLuo force-pushed the quantization-dev-main-add-rewrite-utils branch from 24ebe4d to 467de8b Compare May 5, 2021 21:30

AndrewZhaoLuo added 5 commits May 5, 2021 14:31

Merge pull request #2 from AndrewZhaoLuo/quantization-dev-main-add-re…

81cf33a

…write-utils-alt-interface Alt interface branch goes to quantization dev main branch

change interface to be more explicit when simulated

7d29017

extended quantize add/sub tests

3b11404

more multiplication tests

73a985d

quantized util tests

3f1c4ba

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP][DRAFT] Is this the right track? Generating Dense Quantized op #4

[WIP][DRAFT] Is this the right track? Generating Dense Quantized op #4

AndrewZhaoLuo commented Apr 9, 2021 •

edited

Loading

electriclilies left a comment

electriclilies Apr 9, 2021

electriclilies Apr 9, 2021

electriclilies Apr 9, 2021

electriclilies commented Apr 9, 2021

electriclilies commented Apr 9, 2021 •

edited

Loading

[WIP][DRAFT] Is this the right track? Generating Dense Quantized op #4

Are you sure you want to change the base?

[WIP][DRAFT] Is this the right track? Generating Dense Quantized op #4

Conversation

AndrewZhaoLuo commented Apr 9, 2021 • edited Loading

electriclilies left a comment

Choose a reason for hiding this comment

electriclilies Apr 9, 2021

Choose a reason for hiding this comment

electriclilies Apr 9, 2021

Choose a reason for hiding this comment

electriclilies Apr 9, 2021

Choose a reason for hiding this comment

electriclilies commented Apr 9, 2021

electriclilies commented Apr 9, 2021 • edited Loading

AndrewZhaoLuo commented Apr 9, 2021 •

edited

Loading

electriclilies commented Apr 9, 2021 •

edited

Loading