Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

sync : vendor examples python python script changes script Script related server testing Everything test related
#13901 opened May 29, 2025 by ggerganov Loading…
ci(intel): venv for python & pip installation for intel docker devops improvements to build systems and github actions
#13898 opened May 29, 2025 by Thammachart Loading…
CUDA: add a prop in ggml_cuda_device_infor for distinguish iGPU or dGPU in cuda #13856 ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#13895 opened May 29, 2025 by Yangxiaoz Loading…
Need to undefine "hz" on AIX examples
#13894 opened May 29, 2025 by mehendarkarprajwal Loading…
ggml-cpu : split arch-specific implementations ggml changes relating to the ggml tensor library for machine learning
#13892 opened May 29, 2025 by xctan Draft
cmake: Guard GGML_CPU_ALL_VARIANTS by architecture ggml changes relating to the ggml tensor library for machine learning
#13890 opened May 29, 2025 by ckastner Loading…
[WIP] model: add new model minimax-text-01 python python script changes
#13889 opened May 29, 2025 by qscqesze Draft
Try to Optimize ppc Fp32 tiny blas kernels ggml changes relating to the ggml tensor library for machine learning
#13888 opened May 29, 2025 by shalinib-ibm Loading…
musa: extract ggml_cuda_mul_mat_batched_cublas_gemm_batched_ex ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#13887 opened May 29, 2025 by yeahdongcn Loading…
3 tasks done
sycl: Add reorder to Q6_K mmvq implementation ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13885 opened May 29, 2025 by s-Nick Loading…
finetune.cpp command-line arg examples ggml changes relating to the ggml tensor library for machine learning
#13873 opened May 28, 2025 by graehl Loading…
docs : add "Quick start" section for new users documentation Improvements or additions to documentation
#13862 opened May 28, 2025 by ngxson Loading…
convert : allow partial update to the chkhsh pre-tokenizer list python python script changes testing Everything test related
#13847 opened May 28, 2025 by ngxson Loading…
tests : add test-tokenizers-remote testing Everything test related
#13846 opened May 28, 2025 by CISC Loading…
llama : auto-batch examples server
#13845 opened May 28, 2025 by ggerganov Draft
1 task
convert : fix rwkv bos/eos token python python script changes
#13844 opened May 28, 2025 by CISC Loading…
musa: enable fp16 mma (all) and cublas on qy2 ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#13842 opened May 28, 2025 by yeahdongcn Loading…
3 tasks done
OpenCL: Add concat, tsembd, upscale, tanh, pad and repeat ggml changes relating to the ggml tensor library for machine learning
#13840 opened May 28, 2025 by rmatif Loading…
convert: add support for Japanese Bert model python python script changes
#13830 opened May 27, 2025 by huydt84 Loading…
examples : support MiniCPM-V-2 examples python python script changes
#13828 opened May 27, 2025 by guoQiNing Loading…
sycl: quantize and reorder the input to q8_1 when reorder is enabled ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13826 opened May 27, 2025 by AD2605 Loading…
Tokenize logging examples
#13821 opened May 27, 2025 by koichog Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.