Skip to content

[Tracker] WIP features for torchao 0.3 #252

Closed
@supriyar

Description

@supriyar

Focus - benchmarking, documentation, tutorials, prototype to beta

Due date: June 13 2024

Spillover from 0.2.0

Benchmarking

Documentation

Tutorials

  • Tutorial for affine quantization dtype and unified quant primitives - Found lots of subtle differences, especially w.r.t. preserving zeros and tinygemm (@jerryzh168)

Core

  • QAT workflow (@andrewor14)
  • dedup the implementations of quant primitives (@jerryzh168)
  • dedup the implementations of quant APIs (@jerryzh168)
  • Deduplicate int4 workflows
  • Factory function ahd implements decorator for affine quantization dtype
  • Bit packing interfaces @msaroufim
  • float6 kernels @gau-nernst
  • int 3/5 kernel @msaroufim

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions