-
Notifications
You must be signed in to change notification settings - Fork 16.6k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
CUDA: also store
node->src->data ptrs for equality check
#21635
opened Apr 8, 2026 by
am17an
Loading…
common : skip non-primary GGUF split files when selecting model
#21633
opened Apr 8, 2026 by
angt
Loading…
ci: fix labeler.yml v6 syntax — drop
all: composition
#21627
opened Apr 8, 2026 by
Marxist-Leninist
Loading…
SYCL: fix reorder crash when device memory is full
#21618
opened Apr 8, 2026 by
PMZFX
Loading…
5 tasks done
gemma4: derive attn_soft_cap from GGUF instead of hardcoding
#21613
opened Apr 8, 2026 by
stephencox-ict
•
Draft
vulkan: unify type macros to use Vx instead of _VECx
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#21605
opened Apr 8, 2026 by
0cc4m
Loading…
quant: force Q6_K minimum for Gemma4 tied embeddings
#21599
opened Apr 8, 2026 by
stephencox-ict
•
Draft
6 tasks done
SYCL: fix multi-GPU system RAM exhaustion by using Level Zero allocations
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#21597
opened Apr 8, 2026 by
PMZFX
Loading…
7 tasks done
opencl: add q5_K gemm and gemv kernels for Adreno
ggml
changes relating to the ggml tensor library for machine learning
OpenCL
Issues specific to the OpenCL backend
#21595
opened Apr 8, 2026 by
shaofeiqi
Loading…
common/sampling: reset reasoning budget sampler state between generations
#21594
opened Apr 8, 2026 by
cnsiva
Loading…
opencl: add basic support for q5_k
ggml
changes relating to the ggml tensor library for machine learning
OpenCL
Issues specific to the OpenCL backend
#21593
opened Apr 7, 2026 by
shaofeiqi
Loading…
vocab : fix Gemma 4 BPE tokenizer SIGSEGV on long prompts without newlines
examples
server
testing
Everything test related
#21587
opened Apr 7, 2026 by
prue-starfield
Loading…
fix: support non-ASCII (Unicode) file paths on Windows
ggml
changes relating to the ggml tensor library for machine learning
#21583
opened Apr 7, 2026 by
lekot
Loading…
3 tasks done
SYCL: add BF16 to DMMV kernel path (~4x tg speedup on Intel Arc)
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#21580
opened Apr 7, 2026 by
PMZFX
Loading…
webui: add "Send message on Enter" setting
examples
server
#21577
opened Apr 7, 2026 by
mourix
Loading…
Previous Next
ProTip!
Exclude everything labeled
bug with -label:bug.