-
Notifications
You must be signed in to change notification settings - Fork 13.3k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
opencl: transposed gemm/gemv moe kernel with mxfp4,f32
ggml
changes relating to the ggml tensor library for machine learning
OpenCL
Issues specific to the OpenCL backend
#16602
opened Oct 15, 2025 by
shawngu-quic
Loading…
fix: added a normalization step for MathJax-style \[\] and \(\) delimiters
examples
server
#16599
opened Oct 15, 2025 by
ServeurpersoCom
Loading…
fix(ggml): use safe character conversion in ggml_fopen on Windows
ggml
changes relating to the ggml tensor library for machine learning
#16589
opened Oct 15, 2025 by
sirus20x6
Loading…
llama-model: fix insonsistent ctxs <-> bufs order
#16581
opened Oct 14, 2025 by
JohannesGaessler
Loading…
mtmd: Add JinaCLIP v2 vision projector + GGUF support for jina-bert-v3 (merged-LoRA or adapter)
examples
python
python script changes
#16574
opened Oct 14, 2025 by
pockers21
Loading…
ggml-cpu: build fails with changes relating to the ggml tensor library for machine learning
-Werror=discarded-qualifiers
ggml
#16573
opened Oct 14, 2025 by
otegami
Loading…
extend server/public_simplechat with simple minded interactive browser-client side based toolcalling - base logic
examples
server
#16563
opened Oct 13, 2025 by
hanishkvc
Loading…
webui: introduce OpenAI-compatible model selector in JSON payload
examples
server
#16562
opened Oct 13, 2025 by
ServeurpersoCom
Loading…
embeddings: Fix --log-disable should not suppress embedding outputs
examples
#16561
opened Oct 13, 2025 by
cduk
Loading…
Implement and use cuda graph plans
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#16548
opened Oct 13, 2025 by
wishstudio
Loading…
Add experimental ggml-hexagon backend for the Hexagon NPU
devops
improvements to build systems and github actions
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
script
Script related
#16547
opened Oct 13, 2025 by
max-krasnyansky
•
Draft
tests: increase NMSE threshold for q5_1 MUL_MAT tests
testing
Everything test related
#16544
opened Oct 12, 2025 by
Erics38
Loading…
Add https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
CONV_TRANSPOSE_2D
for Metal
Apple Metal
#16542
opened Oct 12, 2025 by
iliailmer
Loading…
1 task done
embedding: add raw option for --embd-output-format
examples
#16541
opened Oct 12, 2025 by
SamMalayek
Loading…
chat: add defensive IBM Granite Jinja compatibility (<tool_call> and <|tool_call|> support)
#16537
opened Oct 12, 2025 by
ServeurpersoCom
•
Draft
Update close-issue.yml
devops
improvements to build systems and github actions
#16535
opened Oct 12, 2025 by
barneysspeedshop
•
Draft
server: add /slots/status endpoint for secure monitoring
examples
python
python script changes
server
#16534
opened Oct 12, 2025 by
Roshankumarb31
Loading…
metal: add support for LOG op (f32, f16)
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#16530
opened Oct 12, 2025 by
RD-zhang1234
Loading…
Leverage the existing GGML_F32_VEC helpers to vectorize ggml_vec_set_f32 for faster fills
ggml
changes relating to the ggml tensor library for machine learning
#16522
opened Oct 11, 2025 by
sirus20x6
Loading…
Switch to using Ubuntu 25.10 vulkan/mesa
devops
improvements to build systems and github actions
#16497
opened Oct 10, 2025 by
ericcurtin
Loading…
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.