-
Notifications
You must be signed in to change notification settings - Fork 11.5k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
convert : improve model arch handling
python
python script changes
#13122
opened Apr 26, 2025 by
ngxson
Loading…
sycl : Implemented reorder Q4_K mmvq
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13109
opened Apr 25, 2025 by
sgeor255
Loading…
1 task
ggml: move fp16/bf16 conversion optimizations to CPU backend + export conversion APIs
ggml
changes relating to the ggml tensor library for machine learning
#13107
opened Apr 25, 2025 by
SongXiaoXi
Loading…
ggml-backend : add load_tensor() to backend API
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
examples
ggml
changes relating to the ggml tensor library for machine learning
Kompute
https://github.com/KomputeProject/kompute/
Nvidia GPU
Issues specific to Nvidia GPUs
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
Vulkan
Issues specific to the Vulkan backend
[CANN] Simplify the environment variable setting for GGML_CANN_MEM_POOL and GGML_CANN_ASYNC_MODE
ggml
changes relating to the ggml tensor library for machine learning
#13104
opened Apr 25, 2025 by
bachelor-dou
Loading…
fix wrong template in GLM4-0414
python
python script changes
#13099
opened Apr 24, 2025 by
matteoserva
Loading…
ggml: Implement yield barrier using futex for improved thread scheduling efficiency
ggml
changes relating to the ggml tensor library for machine learning
#13079
opened Apr 23, 2025 by
SongXiaoXi
Loading…
SYCL: Add all missing unary kernels
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13074
opened Apr 23, 2025 by
qnixsynapse
Loading…
Reduce enum sizes some are used in structs, which allowed them to be optimized.
build
Compilation issues
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
Vulkan
Issues specific to the Vulkan backend
#13071
opened Apr 22, 2025 by
GermanAizek
Loading…
fix(rpc): Improve input validation and error handling
ggml
changes relating to the ggml tensor library for machine learning
#13069
opened Apr 22, 2025 by
thevilledev
Loading…
Fix ChatGLMModel for glm-4-9b cannot find tokenizer merges in model file
python
python script changes
#13058
opened Apr 22, 2025 by
glide-the
Loading…
Update README.md for tts example to use afplay on MacOS
examples
#13056
opened Apr 22, 2025 by
maxxam1221
Loading…
ggml-cpu: Integrate fp32=bf16xbf16 SME KleidiAI kernel
ggml
changes relating to the ggml tensor library for machine learning
#13053
opened Apr 21, 2025 by
eddnjjn
Loading…
[CANN]Support OP MUL_MAT_ID
ggml
changes relating to the ggml tensor library for machine learning
#13042
opened Apr 21, 2025 by
noemotiovon
Loading…
gguf-py : avoid requiring PySide6 for packaged scripts
bugfix
fixes an issue or bug
devops
improvements to build systems and github actions
nix
Issues specific to consuming flake.nix, or generally concerned with ❄ Nix-based llama.cpp deployment
python
python script changes
#13036
opened Apr 20, 2025 by
compilade
Loading…
quantize: improve pattern matching for allowed tensors
examples
#13033
opened Apr 20, 2025 by
EAddario
Loading…
Bitnet: directly use scale instead of inverting it twice
python
python script changes
#13026
opened Apr 19, 2025 by
viraatdas
Loading…
Nix portability improvements
devops
improvements to build systems and github actions
ggml
changes relating to the ggml tensor library for machine learning
nix
Issues specific to consuming flake.nix, or generally concerned with ❄ Nix-based llama.cpp deployment
#13005
opened Apr 18, 2025 by
hacker1024
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.