Skip to content

Pull requests: ggerganov/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

rpc-server : add support for the SYCL backend examples
#10934 opened Dec 21, 2024 by rgerganov Loading…
llama : the WPM vocabs use the CLS token as BOS
#10930 opened Dec 21, 2024 by ggerganov Loading…
Allow user to compile with any cuda version using github actions devops improvements to build systems and github actions
#10928 opened Dec 21, 2024 by jianlins Loading…
vulkan: build fixes for 32b ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#10927 opened Dec 21, 2024 by jeffbolznv Loading…
server : add system_fingerprint to chat/completion examples python python script changes server
#10917 opened Dec 20, 2024 by ngxson Loading…
llamafile_sgemm API - INT8 implementation ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#10912 opened Dec 20, 2024 by amritahs-ibm Loading…
llama : refactor src/llama.cpp
#10902 opened Dec 19, 2024 by ggerganov Draft
llama : add support for Cohere2ForCausalLM python python script changes
#10900 opened Dec 19, 2024 by dranger003 Loading…
ASCII/Romanization for OuteTTS Multilingual Processing demo Demonstrate some concept or idea, not intended to be merged examples
#10894 opened Dec 19, 2024 by edwko Loading…
Support InfiniAI Megrez 3b python python script changes testing Everything test related
#10893 opened Dec 19, 2024 by dixyes Loading…
Add Falcon3 support and Fix issue #10875 python python script changes
#10883 opened Dec 18, 2024 by mokeddembillel Loading…
llama: Ensure KV cache is fully defragmented.
#10873 opened Dec 17, 2024 by jessegross Loading…
SYCL: Fixes for building SYCL backend for AMD GPUs documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#10851 opened Dec 16, 2024 by lhl Loading…
vulkan: multi-row k quants ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#10846 opened Dec 16, 2024 by netrunnereve Loading…
Fix compilation on Pop!_OS 22.04 LTS CUDA ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#10835 opened Dec 15, 2024 by mika314 Loading…
added docker-multi-stage builds devops improvements to build systems and github actions
#10832 opened Dec 14, 2024 by rudiservo Loading…
add ggml_backend_sched_dump_dot ggml changes relating to the ggml tensor library for machine learning
#10825 opened Dec 14, 2024 by foldl Loading…
Bamba architecture Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning python python script changes testing Everything test related
#10810 opened Dec 12, 2024 by gabe-l-hart Draft
3 tasks
musa: fix aarch64 build build Compilation issues documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#10781 opened Dec 11, 2024 by BodhiHu Loading…
server: bench: minor fixes examples performance Speed related topics python python script changes server
#10765 opened Dec 10, 2024 by phymbert Draft
Cuda build doc documentation Improvements or additions to documentation
#10743 opened Dec 10, 2024 by YannFollet Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.