Skip to content

b6770

Choose a tag to compare

@github-actions github-actions released this 15 Oct 14:36
f4ce81c
metal: optimise `GGML_OP_SUM` (#16559)

* optimise GGML_OP_SUM

* add non-contiguous tests by permuting the input

* change tests to require full contiguity of OP_SUM

* cuda : add check GGML_OP_SUM

---------

Co-authored-by: Georgi Gerganov <[email protected]>