Skip to content

Releases: microsoft/Foundry-Local

Foundry Local Release 0.8.117

23 Dec 18:58
495b266

Choose a tag to compare

Pre-release

Foundry Local 0.8.117 Release Notes ๐Ÿš€

This release is an incremental build, targeting tool calling scenarios.

๐Ÿ› Bug fixes

#346 Tool calling doesn't return tool_calls results in streaming mode
#341 Exception when network is disconnected

๐Ÿ“ Known issues

#363 Tool calling fails on NVIDIA GPUs.

Foundry Local Release 0.8.115

12 Dec 22:53
fdcce52

Choose a tag to compare

Pre-release

Foundry Local Release Notes: v0.8.115 ๐Ÿš€

This release is an incremental build targeting tool calling scenarios.

๐Ÿ› Bug fixes

#335 Guidance error when tool_choice=required
#336 Foundry Local enforcing "required" field of function parameters

๐Ÿ“ Known issues

#346 Tool calling doesn't return tool_calls results in streaming mode

Foundry Local Release 0.8.113

26 Nov 20:00
111b6cd

Choose a tag to compare

Pre-release

Foundry Local Release Notes: v0.8.113 ๐Ÿš€

โœจ New Features

Add support for tool calling. Models that support tool calling have the supportsToolCalling tag, which is also exposed via the SDKs.

๐Ÿ› Bug fixes

Fix crash on context length exhaustion. CLI now exits when context length is exhausted and the REST API returns an error if the request requires more tokens than max_length configuration allows.

๐Ÿ“ Known issues

This release only allows one tool call per request.

Foundry Local Release 0.8.103

12 Nov 18:32
5366b90

Choose a tag to compare

Pre-release

Foundry Local Release Notes: v0.8.103 ๐Ÿš€

๐Ÿ”จ Filter out automatic speech recognition models from foundry model list

These models can be listed using the /foundry/list endpoint and run using the standalone SDK

โญ Sign Up for Foundry Local SDK vNext Private Preview โ€“ Fill in form โญ

Foundry Local Release 0.8.101

07 Nov 01:18
87d50b7

Choose a tag to compare

Pre-release

Foundry Local Release Notes: v0.8.101 ๐Ÿš€

โœจ New Features

Improve performance for multi-turn conversations on macOS, especially time to first token, with the addition of the continuous decoding feature. Only new tokens are sent to the model instead of the entire conversation. The previous inputs and responses are saved by the model in the KV-cache.

๐Ÿ“ Known issues

When the context length is exhausted (set by the max_length value), instead of showing a warning / error message, an exception is thrown

Foundry Local Release 0.8.94

22 Oct 22:11
11a3fb1

Choose a tag to compare

Pre-release

Foundry Local Release Notes: v0.8.94 ๐Ÿš€

โœจ New Features

Improve performance for multi-turn conversations, especially time to first token, with the addition of the continuous decoding feature. Only new tokens are sent to the model instead of the entire conversation. The previous inputs and responses are saved by the model in the KV-cache.

Website showing full model list with hardware variants: https://foundrylocal.ai/models

๐Ÿ› Bug fixes

  • Foundry Local now defaults to --default-log-level instead of Information if --log-level is not provided. Foundry Local also elevates the level with which some errors were being written with from Information to Error.
  • #265
  • #263
  • #71

๐Ÿ“ Known issues

  • This version is not supported on macos. Please use the previous release for macos. Support coming soon!
  • If model is not found in the catalog, instead of showing a warning / suggestion message and gracefully exiting, an exception
    is thrown.
  • When the context length is exhausted (set by the max_length value), instead of showing a warning / error message, an exception is thrown

Foundry Local Release 0.7.120

01 Oct 16:51
992c4ee

Choose a tag to compare

Pre-release

Foundry Local Release Notes: v0.7.120 ๐Ÿš€

โœจ New Features

Improvements to NPU accelerator (execution provider) download and registration user experience

๐Ÿ› Bug fixes

#257
#259
#263
#264

Foundry Local Release 0.7.117

23 Sep 18:20
992c4ee

Choose a tag to compare

Pre-release

Foundry Local Release Notes: v0.7.117 ๐Ÿš€

โœจ New Features

  • Support for AMD and Intel NPUs, and more Qualcomm NPU models coming very soon
  • Pluggable execution providers, downloaded at runtime on Windows for AMD NPUs, Intel NPUs, NVIDIA GPUs, and Qualcomm NPUs
  • Filter models by device and provider

๐Ÿ› Bug fixes

Foundry Local 0.6.87 Release

05 Aug 17:00
73b82a3

Choose a tag to compare

Pre-release

Foundry Local Release Notes: v0.6 ๐Ÿš€

โœจ New Features

Support new OAI-OSS-20B model!

limitation: only NVidia GPU with 16GB VRAM currently supported. Other GPU and macOS support coming soon.

9/12/2025: updating python SDK to fix compatibility issue.

Foundry Local 0.5.117 Release

30 Jul 01:03
37e6ead

Choose a tag to compare

Pre-release

Foundry Local Release Notes: v0.5 ๐Ÿš€

This release introduces several new features, bug fixes, and improvements to enhance your experience with Foundry Local.

โœจ New Features

๐Ÿ› Bug Fixes

#144 Streaming chunks have the same id
#162 Foundry Local supports BYOM (Bring your own model) correctly again.

  • Fixed an empty prompt bug
  • Fixed a buffer overrun bug

โš™๏ธ Improvements

Improvements to and expansion of logging and telemetry

Notes:

  • The previous SDK packages are still the latest ones. Install them using: pip install foundry-local sdk or npm install foundry-local-sdk