Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

tests : add test-tokenizers-remote testing Everything test related
#13846 opened May 28, 2025 by CISC Loading…
llama : auto-batch examples server
#13845 opened May 28, 2025 by ggerganov Draft
1 task
convert : fix rwkv bos/eos token python python script changes
#13844 opened May 28, 2025 by CISC Loading…
ggml: aarch64: Implement SVE F32 kernels for vector functions ggml changes relating to the ggml tensor library for machine learning
#13843 opened May 28, 2025 by vineelabhinav Loading…
musa: enable fp16 mma (all) and cublas on qy2 ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#13842 opened May 28, 2025 by yeahdongcn Loading…
3 tasks done
gguf/utility: return full content on size < 0 python python script changes
#13841 opened May 28, 2025 by Beinsezii Loading…
OpenCL: Add concat, tsembd, upscale, tanh, pad and repeat ggml changes relating to the ggml tensor library for machine learning
#13840 opened May 28, 2025 by rmatif Loading…
convert: small addition to support LlamaModel python python script changes
#13838 opened May 28, 2025 by huydt84 Loading…
convert: add support for Japanese Bert model python python script changes
#13830 opened May 27, 2025 by huydt84 Loading…
examples : support MiniCPM-V-2 examples python python script changes
#13828 opened May 27, 2025 by guoQiNing Loading…
sycl: quantize and reorder the input to q8_1 when reorder is enabled ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13826 opened May 27, 2025 by AD2605 Loading…
Tokenize logging examples
#13821 opened May 27, 2025 by koichog Loading…
ggml: improve ggml_backend_cuda_cpy_tensor_async ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#13818 opened May 27, 2025 by koush Loading…
ggml-vulkan: adds support for op CONV_TRANSPOSE_1D ggml changes relating to the ggml tensor library for machine learning testing Everything test related Vulkan Issues specific to the Vulkan backend
#13813 opened May 26, 2025 by etasnadi Loading…
Add OPT model support - Add OPT architecture support in C++ code - Im… python python script changes
#13799 opened May 26, 2025 by NoAmateur Loading…
Add support for VK_EXT_debug_utils to add labels to Vulkan objects. ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#13792 opened May 26, 2025 by mtavenrath Loading…
ggml : add ggml_fill() ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#13772 opened May 25, 2025 by ngxson Loading…
Add comprehensive test for llama_batch/sbatch/ubatch concepts testing Everything test related
#13764 opened May 24, 2025 by Zijie-Tian Draft
convert : fix nomic-bert-moe mask token python python script changes
#13757 opened May 24, 2025 by CISC Loading…
SYCL: Add mrope kernel ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13755 opened May 24, 2025 by qnixsynapse Loading…
kv-cache : simplify examples server
#13746 opened May 24, 2025 by ggerganov Draft
4 of 5 tasks
cmake : set RPATH to $ORIGIN on Linux (#13740) build Compilation issues
#13741 opened May 24, 2025 by sunhaitao Loading…
ProTip! Adding no:label will show everything without a label.