CANN: supports out_prod operator for F32 and F16 #17406

TianHao324 · 2025-11-20T10:30:15Z

The CANN backend supports floating-point product calculations.

TianHao324 · 2025-11-20T10:46:29Z

test result

noemotiovon

LGTM, just a minor issue.

noemotiovon · 2025-11-20T10:45:23Z

ggml/src/ggml-cann/aclnn_ops.cpp

+
+            const int64_t i12 = i2;
+            const int64_t i13 = i3;
+            aclTensor *accumulator = ggml_cann_create_tensor(


The result of ggml_cann_create_tensor should be acl_tensor_ptr, not aclTensor*.

noemotiovon · 2025-11-20T10:45:39Z

ggml/src/ggml-cann/aclnn_ops.h

+ */
+void ggml_cann_out_prod(ggml_backend_cann_context & ctx, ggml_tensor * dst);
+
+void ggml_cann_out_prod_fp(ggml_backend_cann_context & ctx, ggml_tensor * dst);


noemotiovon · 2025-11-20T10:46:55Z

ggml/src/ggml-cann/aclnn_ops.cpp

 #include <aclnnop/aclnn_index_select.h>
 #include <aclnnop/aclnn_clamp.h>
 #include <aclnnop/aclnn_threshold.h>
+#include <aclnnop/aclnn_ger.h>


You should use
find ggml/src/ggml-cann -iname ".cpp" -o -iname ".h" | xargs clang-format -i
to format code.

noemotiovon · 2025-11-20T10:47:01Z

ggml/src/ggml-cann/aclnn_ops.cpp

 #include <aclnnop/aclnn_index_select.h>
 #include <aclnnop/aclnn_clamp.h>
 #include <aclnnop/aclnn_threshold.h>
+#include <aclnnop/aclnn_ger.h>


You should use
find ggml/src/ggml-cann -iname ".cpp" -o -iname ".h" | xargs clang-format -i
to format code.

noemotiovon · 2025-11-20T10:48:36Z

ggml/src/ggml-cann/aclnn_ops.cpp

+                dst->nb,
+                2);
+
+            GGML_CANN_CALL_ACLNN_OP(ctx, InplaceZero, accumulator);


Currently, InplaceZero is being called on each iteration of the for loop. I believe we can just call it once on dst before the loop.

noemotiovon · 2025-11-21T01:30:05Z

Thank you for your contribution! :)

* DGX Spark: UMA support * Updates from PR feedback * More PR feedback cleanup * Update ggml/src/ggml-cuda/ggml-cuda.cu Co-authored-by: Georgi Gerganov <[email protected]> * Remove trailing whitespace * Update ggml/src/ggml-cuda/ggml-cuda.cu --------- Co-authored-by: Georgi Gerganov <[email protected]>

noemotiovon added the Ascend NPU issues specific to Ascend NPUs label Nov 20, 2025

TianHao324 changed the title ~~cann supports out_prod operator for F32 and F16~~ CANN: supports out_prod operator for F32 and F16 Nov 20, 2025

noemotiovon reviewed Nov 20, 2025

View reviewed changes

github-actions bot added the ggml changes relating to the ggml tensor library for machine learning label Nov 20, 2025

TianHao324 force-pushed the out_prod branch 3 times, most recently from 815e770 to 5d9578a Compare November 20, 2025 11:45

TianHao324 force-pushed the out_prod branch from 5d9578a to 1c2b447 Compare November 21, 2025 07:43

CANN: supports out_prod operator for F32 and F16 #17406

Are you sure you want to change the base?

CANN: supports out_prod operator for F32 and F16 #17406

Conversation

TianHao324 commented Nov 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

TianHao324 commented Nov 20, 2025

Uh oh!

noemotiovon left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

noemotiovon commented Nov 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

TianHao324 commented Nov 20, 2025 •

edited

Loading