Skip to content

Releases: HanGuo97/flute

v0.2.6

19 Nov 13:56
Compare
Choose a tag to compare

v0.2.4

18 Nov 04:34
Compare
Choose a tag to compare
bumped version number

v0.2.3

18 Nov 00:30
Compare
Choose a tag to compare
v0.2.3 Pre-release
Pre-release

Added support for vector dequantization (with a vector size of 2)

v0.1.0

05 Oct 18:56
28d6710
Compare
Choose a tag to compare

What's Changed

  • Add learnable scales functionality. by @radi-cho in #8
  • Better HuggingFace support.

New Contributors

Full Changelog: v0.0.7...v0.1.0

v0.0.7

26 Aug 21:21
Compare
Choose a tag to compare

Added bitsandbytes conversion support.

v0.0.6

03 Aug 00:31
Compare
Choose a tag to compare

Patch release.

Full Changelog: v0.0.5...v0.0.6

v0.0.5

02 Aug 13:42
Compare
Choose a tag to compare

Added support for RTX4090

v0.0.4

27 Jul 22:37
Compare
Choose a tag to compare
  1. Adding support for LLaMA-3.1 405B
  2. Lightly tuned BF16 performance, though still worse than FP16, especially in 3-bit settings.
  3. Uses newer vLLM version.

v0.0.3

20 Jul 22:58
Compare
Choose a tag to compare
included dependencies in setup

v0.0.2

20 Jul 00:01
Compare
Choose a tag to compare
v0.0.2 Pre-release
Pre-release

Renamed the distribution name to flute-kernel to avoid name conflicts.