Feat (proxy): scale computation delegated to bias proxy #938

Giuseppe5 · 2024-04-19T09:33:38Z

As part of decoupling QuantLayers and QuantTensor, the computation of output scale for bias quantization is now delegated to the underlying proxy, who takes as input the possibly quantized input and weights, and internally defines the output scale.

Compared to our current setup, the main change would happen in the case where:

bias quantizer is shared across multiple layers
These layers are of different types (e.g., Linear and Conv)

Our current setup would have no issue with this case, while after this PR, the user would be required to use internally scaled bias and not externally defined.
I believe this to be a edge case that can be safely ignored.

costigt-dev · 2024-04-19T12:15:54Z

src/brevitas/proxy/parameter_quant.py

- input_scale: Optional[Tensor] = None) -> Union[Tensor, QuantTensor]:
+ def quant_output_scale_impl(
+ self, input: QuantTensor, weight: QuantTensor, module: torch.nn.Module) -> Tensor:
+ channel_dim = -1 if isinstance(module, torch.nn.Linear) else 1


how come its -1 if linear and 1 otherwise?

https://pytorch.org/docs/stable/generated/torch.nn.Linear.html
For linear layer, the channel dimension is always the last one, otherwise for Conv, ConvTranpose etc. is always at dim 1, at least for the input tensor.

Giuseppe5 added 5 commits April 19, 2024 10:30

Feat (proxy): scale computation delegated to bias proxy

0b8a19b

Typo

9c3d4d0

Typing

2813157

Typing 2

72c7e0a

Fix condition

5553920

Giuseppe5 requested a review from costigt-dev April 19, 2024 11:32

costigt-dev reviewed Apr 19, 2024

View reviewed changes

costigt-dev approved these changes Apr 19, 2024

View reviewed changes

Giuseppe5 merged commit 670420f into Xilinx:dev Apr 19, 2024
304 of 347 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat (proxy): scale computation delegated to bias proxy #938

Feat (proxy): scale computation delegated to bias proxy #938

Giuseppe5 commented Apr 19, 2024 •

edited

Loading

costigt-dev Apr 19, 2024

Giuseppe5 Apr 19, 2024

Feat (proxy): scale computation delegated to bias proxy #938

Feat (proxy): scale computation delegated to bias proxy #938

Conversation

Giuseppe5 commented Apr 19, 2024 • edited Loading

costigt-dev Apr 19, 2024

Choose a reason for hiding this comment

Giuseppe5 Apr 19, 2024

Choose a reason for hiding this comment

Giuseppe5 commented Apr 19, 2024 •

edited

Loading