v0.16.1
What's new in 0.16.1 (2024-10-25)
These are the changes in inference v0.16.1.
New features
- FEAT: Add support for Qwen/Qwen2.5-Coder-7B-Instruct gptq format by @frostyplanet in #2408
- FEAT: Support GOT-OCR2_0 by @codingl2k1 in #2458
- FEAT: [UI] Image model with the lora_config. by @yiboyasss in #2482
- FEAT: added MLX support for Flux.1 by @qinxuye in #2459
Enhancements
- ENH: Support ChatTTS 0.2 by @codingl2k1 in #2449
- ENH: Pending queue for concurrent requests by @codingl2k1 in #2473
Bug fixes
- BUG: Remove duplicated call of model_install by @frostyplanet in #2457
- BUG: fix embedding model gte-Qwen2 dimensions by @JinCheng666 in #2479
Documentation
New Contributors
- @JinCheng666 made their first contribution in #2479
Full Changelog: v0.16.0...v0.16.1