-
Notifications
You must be signed in to change notification settings - Fork 12k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
llama : add RobertaForSequenceClassification reranker support
python
python script changes
#13875
opened May 28, 2025 by
CISC
Loading…
finetune.cpp command-line arg
examples
ggml
changes relating to the ggml tensor library for machine learning
#13873
opened May 28, 2025 by
graehl
Loading…
docs : add "Quick start" section for new users
documentation
Improvements or additions to documentation
#13862
opened May 28, 2025 by
ngxson
Loading…
CUDA: add a flag "GGML_CUDA_JETSON_DEVICE" for optimization
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#13861
opened May 28, 2025 by
Yangxiaoz
Loading…
[WIP] model: add new model minimax-text-01
python
python script changes
#13857
opened May 28, 2025 by
qscqesze
Loading…
convert : allow partial update to the chkhsh pre-tokenizer list
python
python script changes
testing
Everything test related
#13847
opened May 28, 2025 by
ngxson
Loading…
tests : add test-tokenizers-remote
testing
Everything test related
#13846
opened May 28, 2025 by
CISC
Loading…
convert : fix rwkv bos/eos token
python
python script changes
#13844
opened May 28, 2025 by
CISC
Loading…
ggml: aarch64: Implement SVE F32 kernels for vector functions
ggml
changes relating to the ggml tensor library for machine learning
#13843
opened May 28, 2025 by
vineelabhinav
Loading…
musa: enable fp16 mma (all) and cublas on qy2
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#13842
opened May 28, 2025 by
yeahdongcn
Loading…
3 tasks done
OpenCL: Add concat, tsembd, upscale, tanh, pad and repeat
ggml
changes relating to the ggml tensor library for machine learning
#13840
opened May 28, 2025 by
rmatif
Loading…
kv-cache : avoid modifying recurrent cells when setting inputs
#13834
opened May 27, 2025 by
compilade
Loading…
convert: add support for Japanese Bert model
python
python script changes
#13830
opened May 27, 2025 by
huydt84
Loading…
examples : support MiniCPM-V-2
examples
python
python script changes
#13828
opened May 27, 2025 by
guoQiNing
Loading…
sycl: quantize and reorder the input to q8_1 when reorder is enabled
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13826
opened May 27, 2025 by
AD2605
Loading…
ggml: improve ggml_backend_cuda_cpy_tensor_async
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#13818
opened May 27, 2025 by
koush
Loading…
Add OPT model support - Add OPT architecture support in C++ code - Im…
python
python script changes
#13799
opened May 26, 2025 by
NoAmateur
Loading…
Add support for VK_EXT_debug_utils to add labels to Vulkan objects.
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#13792
opened May 26, 2025 by
mtavenrath
Loading…
server: args for draft model cache types (#11200)
examples
server
#13782
opened May 25, 2025 by
aa956
Loading…
ggml : add ggml_fill()
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
#13772
opened May 25, 2025 by
ngxson
Loading…
Add comprehensive test for llama_batch/sbatch/ubatch concepts
testing
Everything test related
#13764
opened May 24, 2025 by
Zijie-Tian
•
Draft
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.