-
Notifications
You must be signed in to change notification settings - Fork 1
Pull requests: auroralabs-loci/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
UPSTREAM PR #18755: Kimi-Linear support (backend agnostic + MLA KV cache)
#1087
opened Jan 31, 2026 by
loci-dev
Loading…
UPSTREAM PR #19141: [WIP]ggml-hexagon: flash-attn opt - part2
#1086
opened Jan 31, 2026 by
loci-dev
Loading…
UPSTREAM PR #19167: model : support LongCat-Flash-Lite (ngram embeddings)
#1085
opened Jan 31, 2026 by
loci-dev
Loading…
UPSTREAM PR #19209: ggml-cpu: split across kv for faster TG
#1084
opened Jan 31, 2026 by
loci-dev
Loading…
UPSTREAM PR #19218: nix: fix nix develop .#python-scripts
#1082
opened Jan 30, 2026 by
loci-dev
Loading…
UPSTREAM PR #19209: ggml-cpu: split across kv for faster TG
#1081
opened Jan 30, 2026 by
loci-dev
Loading…
UPSTREAM PR #19205: Add missing unordered_map include to jinja/value.h
#1080
opened Jan 30, 2026 by
loci-dev
Loading…
UPSTREAM PR #19206: metal: convert shutdown assertion to warning log
#1079
opened Jan 30, 2026 by
loci-dev
Loading…
UPSTREAM PR #19196: ggml-cpu: optimize q4_0_q8_0 scales using Zvfhmin
#1077
opened Jan 30, 2026 by
loci-dev
Loading…
UPSTREAM PR #19194: Correctly fetch q8_1 quantize pipeline in test as needed by 8a3519b
#1075
opened Jan 30, 2026 by
loci-dev
Loading…
UPSTREAM PR #19183: [RFC] implementing Expected Attention in llama.cpp?
#1074
opened Jan 29, 2026 by
loci-dev
Loading…
UPSTREAM PR #19186: add tensor type checking as part of cuda graph properties
#1073
opened Jan 29, 2026 by
loci-dev
Loading…
UPSTREAM PR #19182: model: support Longcat-Flash (help wanted)
#1072
opened Jan 29, 2026 by
loci-dev
Loading…
UPSTREAM PR #19178: Finish “TODO: avoid using atexit() here by making
console a singleton”
#1071
opened Jan 29, 2026 by
loci-dev
Loading…
UPSTREAM PR #19176: jinja : do not pass empty tools and add some none filters
#1070
opened Jan 29, 2026 by
loci-dev
Loading…
UPSTREAM PR #19132: ggml: aarch64: Implement SVE in Gemm q4_k 8x8 q8_k Kernel
#1069
opened Jan 29, 2026 by
loci-dev
Loading…
UPSTREAM PR #19168: build(cuda): Add warning for CUDA 13.0 Blackwell compiler bug
#1067
opened Jan 29, 2026 by
loci-dev
Loading…
UPSTREAM PR #19167: model : support LongCat-Flash-Lite (ngram embeddings)
#1066
opened Jan 28, 2026 by
loci-dev
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:overlay.