Skip to content

Releases: arthw/llama.cpp

b3151

15 Jun 02:55
f8ec887
Compare
Choose a tag to compare
ci : fix macos x86 build (#7940)

In order to use old `macos-latest` we should use `macos-12`

Potentially will fix: https://github.com/ggerganov/llama.cpp/issues/6975

b3145

14 Jun 05:27
172c825
Compare
Choose a tag to compare
rpc : fix ggml_backend_rpc_supports_buft() (#7918)

b3014

28 May 03:16
852aafb
Compare
Choose a tag to compare
update HIP_UMA #7399 (#7414)

* update HIP_UMA #7399

add use of hipMemAdviseSetCoarseGrain when LLAMA_HIP_UMA is enable.
- get x2 on prompte eval and x1.5 on token gen with rocm6.0 on ryzen 7940HX iGPU (780M/gfx1103)

* simplify code, more consistent style

---------

Co-authored-by: slaren <slarengh@gmail.com>

b2986

24 May 02:25
74f33ad
Compare
Choose a tag to compare
readme : remove trailing space (#7469)

b2953

21 May 09:15
917dc8c
Compare
Choose a tag to compare
Tokenizer SPM fixes for phi-3 and llama-spm (#7375)

* Update brute force test: special tokens
* Fix added tokens
  - Try to read 'added_tokens.json'.
  - Try to read 'tokenizer_config.json'.
  - Try to read 'tokenizer.json'.
* Fix special tokens rtrim

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
* server : fix test regexes

update_oneapi_2024.1-b2861-69a0609

12 May 05:24
Compare
Choose a tag to compare
update CI with oneapi 2024.1

add_oneapi_runtime-b2866-6cf75b2

12 May 12:18
Compare
Choose a tag to compare

add_oneapi_runtime-b2865-d2ca97b

12 May 08:41
Compare
Choose a tag to compare