v0.1.1
What's new in 0.1.1 (2023-08-03)
These are the changes in inference v0.1.1.
New features
- FEAT: add opt-125m pytorch model and add ut by @pangyoki in #263
- FEAT: support falcon 40b pytorch model by @pangyoki in #278
- FEAT: pytorch model embeddings by @jiayini1119 in #282
- FEAT: support falcon-instruct 7b and 40b pytorch model by @jiayini1119 in #287
- FEAT: support chatglm/chatglm2/chatglm2-32k pytorch model by @pangyoki in #283
- FEAT: support qwen 7b by @UranusSeven in #294
Enhancements
- ENH: Support Enviroment Variable by @RayJi01 in #285
- REF: split supervisor and worker by @UranusSeven in #279
Bug fixes
- BUG: fix import torch error even if user don't want to launch torch model by @pangyoki in #274
- BUG: empty legacy model dir by @UranusSeven in #276
Tests
Documentation
- DOC: Update README_ja_JP.md by @eltociear in #269
- DOC: add docstring to client methods by @RayJi01 in #247
Full Changelog: v0.1.0...v0.1.1