-
Notifications
You must be signed in to change notification settings - Fork 11.3k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Improve Chat Input with Auto-Sizing Textarea
examples
server
#12785
opened Apr 6, 2025 by
characharm
Loading…
vulkan: Use fp16 for the flash attention P*V multiplication
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#12783
opened Apr 6, 2025 by
jeffbolznv
Loading…
ci: fix issue in android build(https://github.com/ggml-org/llama.cpp/issues/12638)
devops
improvements to build systems and github actions
#12775
opened Apr 6, 2025 by
zhouwg
Loading…
ggml: use _mm[512/256]_dpbusd[_avx]_epi32 to directly accumulate into the result register
ggml
changes relating to the ggml tensor library for machine learning
#12773
opened Apr 5, 2025 by
SongXiaoXi
Loading…
sycl: remove unused min_compute_capability
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12768
opened Apr 5, 2025 by
jounjj
Loading…
cmake : enable curl by default
android
Issues specific to Android
build
Compilation issues
devops
improvements to build systems and github actions
examples
server
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
testing
Everything test related
#12761
opened Apr 4, 2025 by
ngxson
Loading…
opencl: better identify Adreno GPU
ggml
changes relating to the ggml tensor library for machine learning
#12760
opened Apr 4, 2025 by
lhez
Loading…
Added all CPU to Docker GPU images for 'token_embd.weight' compatibility
devops
improvements to build systems and github actions
#12749
opened Apr 4, 2025 by
rudiservo
Loading…
sycl:remove redundant memcopy in function ggml_backend_sycl_buffer_set_tensor
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12734
opened Apr 3, 2025 by
zhouwg
Loading…
CANN: fix typo in ggml-cann
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
#12733
opened Apr 3, 2025 by
zhouwg
Loading…
sync : ggml
ggml
changes relating to the ggml tensor library for machine learning
script
Script related
#12732
opened Apr 3, 2025 by
ggerganov
Loading…
CANN: Refactor to reduce duplicate code
ggml
changes relating to the ggml tensor library for machine learning
Update llama-quant.cpp llama_tensor_get_type with DeepSeek friendly modifications
ggml
changes relating to the ggml tensor library for machine learning
#12727
opened Apr 3, 2025 by
bartowski1182
Loading…
imatrix: add option to display importance score statistics for a given imatrix file
examples
#12718
opened Apr 2, 2025 by
EAddario
Loading…
Fix: Abnormal exit on Android devices
ggml
changes relating to the ggml tensor library for machine learning
#12712
opened Apr 2, 2025 by
biyou
Loading…
[RFC][WIP] Common: Add an Initial Chat Memory Interface/Implementation
examples
server
#12698
opened Apr 1, 2025 by
markhpc
Loading…
WIP: Add support for CogAgent
examples
python
python script changes
server
#12679
opened Mar 31, 2025 by
Tianyue-Zhao
•
Draft
update changes relating to the ggml tensor library for machine learning
rope_multi
:
ggml
#12665
opened Mar 31, 2025 by
foldl
Loading…
tts : implement sesame CSM + Mimi decoder
examples
python
python script changes
#12648
opened Mar 29, 2025 by
ngxson
Loading…
llama-server : implement universal assisted decoding
examples
server
#12635
opened Mar 28, 2025 by
g2mt
Loading…
opencl: remove a self-referential macro
ggml
changes relating to the ggml tensor library for machine learning
#12626
opened Mar 28, 2025 by
linehill
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.