Skip to content

Release v0.10.0

Compare
Choose a tag to compare
@volcacius volcacius released this 08 Dec 16:36
· 173 commits to master since this release

Highlights

  • Support for PyTorch up to version 2.1 .
  • Support for GPTQ PTQ algorithm.
  • Support for GPFQ PTQ algorithm.
  • Support for SmoothQuant / activation equalization PTQ algorithm.
  • Support for MSE based scale and zero-point for weights and activations.
  • Support for row-wise scaling at the input of QuantLinear.
  • Support for quantization of a slice of a weight tensor.
  • End-to-end support for learned rounding in ImageNet PTQ.
  • End-to-end example training scripts for A2Q (low precision accumulation) over superresolution.
  • Experimental support for minifloats (eXmY quantization).
  • Experimental LLM PTQ flow with support for weight-only and weight+activation quantization, together with GPTQ, AWQ and SmoothQuant.
  • Experimental Stable Diffusion PTQ flow with support for weight-only quantization.
  • Deprecated FINN ONNX export flow.
  • Update custom value_trace FX tracer to latest FX.
  • New custom variant of make_fx tracer with support for custom torch.library ops through @Wrap annotation.

What's Changed

New Contributors

Full Changelog: v0.9.1...v0.10.0