Refactor tensors #147

dacorvo · 2024-04-02T08:16:17Z

This makes the distinction between quantized activations (8-bit per-tensor only) and quantized weights (per-axis) clearer.

This better separates the responsibilities between Tensor subclasses and their quantizers. It also creates dedicated quantize_weight and quantize_activation helper to better illustrate the differences between the two quantized Tensor types.

dacorvo added 2 commits March 28, 2024 17:09

refactor(tensor): rename QBitsTensor file

019260c

refactor(tensor): introduce quantizers

6b03415

This better separates the responsibilities between Tensor subclasses and their quantizers. It also creates dedicated quantize_weight and quantize_activation helper to better illustrate the differences between the two quantized Tensor types.

dacorvo force-pushed the refactor_tensors branch 4 times, most recently from c77b612 to cb66d7a Compare April 2, 2024 08:53

ci: add examples workflow

ff791c7

dacorvo force-pushed the refactor_tensors branch from cb66d7a to ff791c7 Compare April 2, 2024 09:12

dacorvo merged commit e905dc3 into main Apr 2, 2024
4 checks passed

dacorvo deleted the refactor_tensors branch April 2, 2024 09:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor tensors #147

Refactor tensors #147

dacorvo commented Apr 2, 2024

Refactor tensors #147

Refactor tensors #147

Conversation

dacorvo commented Apr 2, 2024