-
Notifications
You must be signed in to change notification settings - Fork 11.9k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
tests : add test-tokenizers-remote
testing
Everything test related
#13846
opened May 28, 2025 by
CISC
Loading…
convert : fix rwkv bos/eos token
python
python script changes
#13844
opened May 28, 2025 by
CISC
Loading…
ggml: aarch64: Implement SVE F32 kernels for vector functions
ggml
changes relating to the ggml tensor library for machine learning
#13843
opened May 28, 2025 by
vineelabhinav
Loading…
musa: enable fp16 mma (all) and cublas on qy2
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#13842
opened May 28, 2025 by
yeahdongcn
Loading…
3 tasks done
gguf/utility: return full content on size < 0
python
python script changes
#13841
opened May 28, 2025 by
Beinsezii
Loading…
OpenCL: Add concat, tsembd, upscale, tanh, pad and repeat
ggml
changes relating to the ggml tensor library for machine learning
#13840
opened May 28, 2025 by
rmatif
Loading…
convert: small addition to support LlamaModel
python
python script changes
#13838
opened May 28, 2025 by
huydt84
Loading…
kv-cache : avoid modifying recurrent cells when setting inputs
#13834
opened May 27, 2025 by
compilade
Loading…
convert: add support for Japanese Bert model
python
python script changes
#13830
opened May 27, 2025 by
huydt84
Loading…
examples : support MiniCPM-V-2
examples
python
python script changes
#13828
opened May 27, 2025 by
guoQiNing
Loading…
sycl: quantize and reorder the input to q8_1 when reorder is enabled
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13826
opened May 27, 2025 by
AD2605
Loading…
ggml: improve ggml_backend_cuda_cpy_tensor_async
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#13818
opened May 27, 2025 by
koush
Loading…
Add OPT model support - Add OPT architecture support in C++ code - Im…
python
python script changes
#13799
opened May 26, 2025 by
NoAmateur
Loading…
Add support for VK_EXT_debug_utils to add labels to Vulkan objects.
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#13792
opened May 26, 2025 by
mtavenrath
Loading…
server: args for draft model cache types (#11200)
examples
server
#13782
opened May 25, 2025 by
aa956
Loading…
ggml : add ggml_fill()
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
#13772
opened May 25, 2025 by
ngxson
Loading…
Add comprehensive test for llama_batch/sbatch/ubatch concepts
testing
Everything test related
#13764
opened May 24, 2025 by
Zijie-Tian
•
Draft
convert : fix nomic-bert-moe mask token
python
python script changes
#13757
opened May 24, 2025 by
CISC
Loading…
SYCL: Add mrope kernel
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13755
opened May 24, 2025 by
qnixsynapse
Loading…
cmake : set Compilation issues
RPATH
to $ORIGIN
on Linux (#13740)
build
#13741
opened May 24, 2025 by
sunhaitao
Loading…
Previous Next
ProTip!
Adding no:label will show everything without a label.