Skip to content

Releases: xorbitsai/inference

v0.9.3

15 Mar 06:36
60f098c
Compare
Choose a tag to compare

What's new in 0.9.3 (2024-03-15)

These are the changes in inference v0.9.3.

New features

Enhancements

Bug fixes

  • BUG: fix the useless fstring. by @mikeshi80 in #1130
  • BUG: Fixing the issue of model list loading failure caused by a large number of invalid requests on the model list page. by @wertycn in #1111
  • BUG: Fix cache status for embedding, rerank and image models on the web UI by @ChengjieLi28 in #1135
  • BUG: Fix missing information for xinference registrations and xinference list command by @ChengjieLi28 in #1140
  • BUG: Fix cannot continue to chat after canceling the streaming chat via ctrl+c by @ChengjieLi28 in #1144

Tests

Documentation

New Contributors

Full Changelog: v0.9.2...v0.9.3

v0.9.2

08 Mar 06:09
29f4c10
Compare
Choose a tag to compare

What's new in 0.9.2 (2024-03-08)

These are the changes in inference v0.9.2.

New features

Enhancements

  • ENH: Supports n_gpu_layers parameter for llama-cpp-python by @ChengjieLi28 in #1070
  • ENH: Add a dropdown to the web UI to support adjusting GPU offload layers for llama.cpp loader by @notsyncing in #1073
  • ENH: [UI] Show replica on running model page by @ChengjieLi28 in #1093
  • ENH: Add "[DONE]" to the end of stream generation for better openai SDK compatibility by @ZhangTianrong in #1062
  • ENH: [UI] Support setting CPU when selecting n_gpu by @ChengjieLi28 in #1096

Documentation

Others

  • Update llm_family.json to correct the context length of glaive coder by @mikeshi80 in #1083

New Contributors

Full Changelog: v0.9.1...v0.9.2

v0.9.1

01 Mar 07:04
7b20f76
Compare
Choose a tag to compare

What's new in 0.9.1 (2024-03-01)

These are the changes in inference v0.9.1.

New features

Enhancements

Bug fixes

  • BUG: Fix some models that cannot download from modelscope by @ChengjieLi28 in #1066
  • BUG: Fix early truncation due to max_token being default to 16 instead of 1024 by @ZhangTianrong in #1061

Documentation

New Contributors

Full Changelog: v0.9.0...v0.9.1

v0.9.0

22 Feb 08:03
c653c97
Compare
Choose a tag to compare

What's new in 0.9.0 (2024-02-22)

These are the changes in inference v0.9.0.

New features

  • FEAT: Refactor device related code and add initial Intel GPU support by @notsyncing in #968
  • FEAT: Support gemma series model by @aresnow1 in #1024

Enhancements

Bug fixes

Documentation

  • DOC: Polish model operation command doc by @onesuper in #1000
  • DOC: Fix note on secret_key generation and algorithm selection for OAuth2 by @ChengjieLi28 in #1012

New Contributors

Full Changelog: v0.8.5...v0.9.0

v0.8.5

06 Feb 05:37
e903e05
Compare
Choose a tag to compare

What's new in 0.8.5 (2024-02-06)

These are the changes in inference v0.8.5.

New features

Enhancements

Bug fixes

Others

New Contributors

Full Changelog: v0.8.4...v0.8.5

v0.8.4

04 Feb 09:17
1b9b8c8
Compare
Choose a tag to compare

What's new in 0.8.4 (2024-02-04)

These are the changes in inference v0.8.4.

Enhancements

Bug fixes

Documentation

Full Changelog: v0.8.3.1...v0.8.4

v0.8.3.1

02 Feb 08:06
cfbe5ba
Compare
Choose a tag to compare

What's new in 0.8.3.1 (2024-02-02)

These are the changes in inference v0.8.3.1.

Bug fixes

Full Changelog: v0.8.3...v0.8.3.1

v0.8.3

02 Feb 07:14
749ef3f
Compare
Choose a tag to compare

What's new in 0.8.3 (2024-02-02)

These are the changes in inference v0.8.3.

New features

Enhancements

Bug fixes

Documentation

New Contributors

Full Changelog: v0.8.2...v0.8.3

v0.8.2

26 Jan 08:32
6fa3ee0
Compare
Choose a tag to compare

What's new in 0.8.2 (2024-01-26)

These are the changes in inference v0.8.2.

New features

Enhancements

Documentation

Full Changelog: v0.8.1...v0.8.2

v0.8.1

19 Jan 09:17
fb3985e
Compare
Choose a tag to compare

What's new in 0.8.1 (2024-01-19)

These are the changes in inference v0.8.1.

New features

Enhancements

Bug fixes

Documentation

  • DOC: Update the documentation about use of docker by @aresnow1 in #901
  • DOC:ADD FAQ IN troubleshooting.rst by @sisuad in #911

New Contributors

Full Changelog: v0.8.0...v0.8.1