v0.6.1.post1
github-actions
released this
13 Sep 08:09
·
59 commits
to main
since this release
What's Changed
- chore: register custom torch ops for flash-attn and flashinfer by @AlpinDale in #724
- feat: launch API server with uvloop by @AlpinDale in #725
- chore: fix return statement in
Detokenizer.decode_sequence_inplace
by @AlpinDale in #727 - Fix tensor parallelism, libcudart path for some versions of pytorch by @miku448 in #726
- ci: bump to 0.6.1.post1 by @AlpinDale in #728
Full Changelog: v0.6.1...v0.6.1.post1