Skip to content

Releases: ngxson/llama.cpp

b6670

02 Oct 14:05
91a2a56

Choose a tag to compare

musa: update compile flags (#16265)

Signed-off-by: Xiaodong Ye <[email protected]>

b6666

02 Oct 08:01
2be72c2

Choose a tag to compare

SYCL: Update to oneAPI 2025.2 (#16371)

* update oneapi to 2025.2, use deep-learning-essentials to replace base-tool

* update to 2025.2 use deeplearn essi to replace base toolkit

* add missed dll

* add deep learning essentials

* add sycl-ls

---------

Co-authored-by: Zhang Jianyu <[email protected]>

b6663

01 Oct 22:01
e95fec6

Choose a tag to compare

HIP: Disable ROCWMMA fattn on CDNA when compiled against ROCWMMA 2.0.…

b6661

01 Oct 18:46
1fe4e38

Choose a tag to compare

ci: Properly install rocwmma for hip builds (#16305)

* CI: Properly install rocwmma for hip builds

on windows we now windows install rocwmma from ubuntu pacakges

* CI: update linux rocm docker build to use rocm 7.0

b6660

01 Oct 17:46
4201dea

Choose a tag to compare

common: introduce http.h for httplib-based client (#16373)

* common: introduce http.h for httplib-based client

This change moves cpp-httplib based URL parsing and client setup into
a new header `common/http.h`, and integrates it in `arg.cpp` and `run.cpp`.

It is an iteration towards removing libcurl, while intentionally
minimizing changes to existing code to guarantee the same behavior when
`LLAMA_CURL` is used.

Signed-off-by: Adrien Gallouët <[email protected]>

* tools : add missing WIN32_LEAN_AND_MEAN

Signed-off-by: Adrien Gallouët <[email protected]>

---------

Signed-off-by: Adrien Gallouët <[email protected]>
Signed-off-by: Adrien Gallouët <[email protected]>

b6653

30 Sep 20:52
e74c92e

Choose a tag to compare

model : support GLM 4.6 (make a few NextN/MTP tensors not required) (…

b6651

30 Sep 18:35
bf6f3b3

Choose a tag to compare

common : disable progress bar without a tty (#16352)

* common : disable progress bar without a tty

Signed-off-by: Adrien Gallouët <[email protected]>

* Add missing headers

Signed-off-by: Adrien Gallouët <[email protected]>

---------

Signed-off-by: Adrien Gallouët <[email protected]>

b6650

30 Sep 18:42
7c156df

Choose a tag to compare

opencl: support pad_ext (#15888)

b6648

30 Sep 18:07
8d78cd2

Choose a tag to compare

ggml webgpu: support for rope,div,sub,glu,scale,cont operators (#16187)

* Work on rope

* Simplify inplace operation generation and combine mul/add generation

* Work on rope variants

* implement neox rope

* rope complete

* Add sub,div,glu operators

* implement scale op

* Update cpy shader to handle cont/more types

* formatting

* Update test vars printing for rope,rms_norm

* Avoid ROPE hardcoded constants

* Add TODO to change ROPE constants to enum

Co-authored-by: Georgi Gerganov <[email protected]>

* fix TODO comment

---------

Co-authored-by: Georgi Gerganov <[email protected]>

b6647

30 Sep 17:42
d1c84a6

Choose a tag to compare

opencl: support ne3 in get_rows (#15866)