-
Notifications
You must be signed in to change notification settings - Fork 13.6k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
convert : register UMT5Model architecture for T5 conversion
python
python script changes
#17160
opened Nov 11, 2025 by
levkropp
Loading…
vulkan: change graph_compute to be async and enable get_tensor_async
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#17158
opened Nov 10, 2025 by
jeffbolznv
Loading…
HIP: WMMA-MMQ kernels for RDNA 4
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#17156
opened Nov 10, 2025 by
jiachengjason
•
Draft
llama.android : Rewrite Android binding
android
Issues specific to Android
documentation
Improvements or additions to documentation
examples
ggml
changes relating to the ggml tensor library for machine learning
#17152
opened Nov 10, 2025 by
hanyin-arm
Loading…
Install rpc-server when GGML_RPC is ON.
devops
improvements to build systems and github actions
examples
nix
Issues specific to consuming flake.nix, or generally concerned with ❄ Nix-based llama.cpp deployment
#17149
opened Nov 10, 2025 by
nbp
Loading…
vulkan: add q2_K implementation in mul_mmq with ACC_TYPE_VEC2
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#17147
opened Nov 10, 2025 by
SavicStefan
Loading…
metal : make the FA extra sizes consistent
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#17143
opened Nov 10, 2025 by
ggerganov
Loading…
Add complete Megrez-MoE support: GGUF conversion + inference.
model
Model specific
python
python script changes
#17141
opened Nov 10, 2025 by
tamarPal
Loading…
hexagon: various Op fixes
ggml
changes relating to the ggml tensor library for machine learning
#17135
opened Nov 10, 2025 by
max-krasnyansky
Loading…
vulkan: disable rms_norm + mul + rope for old gpus
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#17134
opened Nov 10, 2025 by
netrunnereve
Loading…
SYCL: add full support for ABS unary op
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#17126
opened Nov 9, 2025 by
shani-f
Loading…
llama: introduce support for model-embedded sampling parameters
python
python script changes
#17120
opened Nov 9, 2025 by
taronaeo
Loading…
rpc : fix alloc size logic
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#17116
opened Nov 9, 2025 by
ggerganov
Loading…
2 tasks
CPU SIMD and pipeline optimizations across vec/mmq/ops/kv-cache/repack
ggml
changes relating to the ggml tensor library for machine learning
#17113
opened Nov 8, 2025 by
NoahOksuz
Loading…
webui : add keyboard shortcut to toggle sidebar
examples
server
#17099
opened Nov 8, 2025 by
danbev
Loading…
Add Metal-4 Tensor API test harness for iOS
examples
#17098
opened Nov 8, 2025 by
ArjunDivecha
Loading…
CUDA: support F32 kernel type for changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
CONV_TRANSPOSE_2D
ggml
#17094
opened Nov 8, 2025 by
AgainstEntropy
Loading…
add version to all shared object files
examples
ggml
changes relating to the ggml tensor library for machine learning
#17091
opened Nov 7, 2025 by
furrysalamander
Loading…
HIP: RDNA4 tensor core support for MMF
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#17077
opened Nov 7, 2025 by
zhang-hui-yulo
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:master.