Skip to content

Releases: pwilkin/llama.cpp

b6710

08 Oct 06:46
74b8fc1

Choose a tag to compare

ggml webgpu: profiling, CI updates, reworking of command submission (…

b6688

04 Oct 12:54
898acba

Choose a tag to compare

rpc : add support for multiple devices (#16276)

* rpc : add support for multiple devices

Allow rpc-server to expose multiple devices from a single endpoint.
Change RPC protocol to include device identifier where needed.

closes: #15210

* fixes

* use ggml_backend_reg_t

* address review comments

* fix llama-bench backend report

* address review comments, change device naming

* fix cmd order

b6586

25 Sep 20:15
835b2b9

Choose a tag to compare

model : add GroveMoE support (#15510)

* add GroveMoE support

* remove constexpr that fails on certain compilers

* revert crude scalar div implementation, use cast

* build_attn_inp_kv_unified -> build_attn_inp_kv

* fix build_attn

* re-apply ffn_exps regex changes

b6585

25 Sep 17:32
b05a9d6

Choose a tag to compare

vendors: update miniaudio version (#16212)

* vendor: update miniaudio.h

Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>

* vendor: update miniaudio.h

Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>

---------

Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>

b6497

17 Sep 11:04
cd08fc3

Choose a tag to compare

common : Fix corrupted memory error on json grammar initialization (#…

b6381

04 Sep 13:13
c1c354e

Choose a tag to compare

CANN: Refactor ND to NZ workspace to be per-device (#15763)

* CANN:Refactor ND to NZ workspace to be per-device in Ascend backend

- Replaced the previous single global ND→NZ workspace with a per-device
  cache using unordered_map keyed by device ID.
- Functions `release_nz_workspace`, `relloc_nz_workspace`, and
  `get_nz_workspace` now manage workspace independently for each device,
  preventing memory conflicts in multi-device / pipeline parallel scenarios.
- This change fixes potential precision issues caused by workspace
  overwrites when multiple devices perform ND→NZ conversions concurrently.

Co-authored-by: hipudding <huafengchun@gmail.com>

* refactor

Signed-off-by: noemotiovon <757486878@qq.com>

* rename

Signed-off-by: noemotiovon <757486878@qq.com>

* fix review comments

Signed-off-by: noemotiovon <757486878@qq.com>

---------

Signed-off-by: noemotiovon <757486878@qq.com>
Co-authored-by: hipudding <huafengchun@gmail.com>

b6360

02 Sep 20:57
3de0082

Choose a tag to compare

fix: resolve unsigned int initialization warning for n_dims/size in g…

b6319

29 Aug 09:21
25f8378

Choose a tag to compare

Merge branch 'ggml-org:master' into master

b6247

22 Aug 10:29
0bc9df3

Choose a tag to compare

Merge branch 'ggml-org:master' into master

b5949

21 Jul 10:42
c82d48e

Choose a tag to compare

llama : fix `--reverse-prompt` crashing issue (#14794)

Signed-off-by: Molly Sophia <mollysophia379@gmail.com>