Skip to content

Releases: neuralmagic/compressed-tensors

Compressed Tensors v0.8.0

12 Nov 14:53
ff121cc
Compare
Choose a tag to compare

What's Changed

  • [Observer Restructure]: Separate out scale/zp and observer init; separate out calibration from forward pass by @dsikka in #188
  • Fix device allocation for MSE observer by @anmarques in #190
  • drop 3.8 and add 3.12 to testing by @dhuangnm in #196
  • Fix test which required accelerate, apply style by @kylesayrs in #194
  • [Bugfix] Move observer and g_idx until after module in onloaded by @kylesayrs in #195
  • Add sparsity structure enum by @rahul-tuli in #197
  • Observer Restructure: Remove Observers, calibration, and applying frozen steps from lifecycle by @dsikka in #189
  • Clean up observer defaulting logic, better error message by @kylesayrs in #200
  • apply style and quality by @kylesayrs in #201
  • [BugFix] Fix Marlin24 Bug by @dsikka in #203
  • Bump version to v0.8.0 by @dsikka in #204

New Contributors

Full Changelog: 0.7.1...0.8.0

Compressed Tensors v0.7.1

17 Oct 18:13
506cd36
Compare
Choose a tag to compare

What's Changed

  • [Observer Restructure]: Remove MemoryLess Observer; use helper function for dynamic quantization by @dsikka in #187
  • bump up to 0.7.1 for patch release by @dhuangnm in #192

Full Changelog: 0.7.0...0.7.1

Compressed Tensors v0.7.0

09 Oct 11:29
b876a60
Compare
Choose a tag to compare

What's Changed

Full Changelog: 0.6.0...0.7.0

Compressed Tensors v0.6.0

23 Sep 18:44
83b2e7a
Compare
Choose a tag to compare

What's Changed

Read more

Compressed Tensors v0.5.0

14 Aug 20:36
2f22bef
Compare
Choose a tag to compare

What's Changed

New Contributors

Read more

Compressed Tensors v0.4.0

03 Jul 20:25
42575fd
Compare
Choose a tag to compare

New Features:

  • Scheme alias support in quant config (#40)
  • New compressors: packed int4 (#47), Marlin 2:4 (#77)

Changes:

  • None

Resolved Issues:

  • Group-size quantization implementation addressed to ensure correctness. (#60)

Known Issues:

  • None