issue/843: 增加QY和NVIDIA上per_channel_quant_int8算子 by xgqdut2016 · Pull Request #855 · InfiniTensor/InfiniCore

xgqdut2016 · 2025-12-26T05:13:36Z

测试w8a8需要使用xmake clean && xmake f --nv-gpu=true --cuda_arch=sm_90a --cutlass=true -cv && xmake build && xmake install && python test/infiniop/w8a8int8.py --nvidia

src/infiniop/ops/quant/per_channel_quant_int8/per_channel_quant_int8.h

whjthu · 2026-01-28T01:20:57Z

src/infiniop/ops/quant/per_channel_quant_int8/per_channel_quant_int8.h

@@ -0,0 +1,40 @@
+#ifndef __QUANT_H__


过于简单，不符合头文件部分的宏定义习惯

whjthu · 2026-01-28T01:23:48Z

src/infiniop/ops/quant/per_channel_quant_int8/operator.cc

+            x_zero_desc,                                                                      \
+            x_desc);
+    switch (handle->device) {
+#ifdef ENABLE_NVIDIA_API


这个写法有些奇怪，为什么已经 switch 了，还要再 ifdef

whjthu · 2026-01-28T01:25:21Z

src/infiniop/ops/quant/per_channel_quant_int8/nvidia/per_channel_quant_int8_nvidia.cu

+            blockPerChannelQuantI8<Tdata, BLOCK_SIZE>
+                <<<M, BLOCK_SIZE, 0, stream>>>(x_packed, x_scale, x_zero, x, M, K);
+        }
+


xgqdut2016 requested a review from a team December 26, 2025 05:13

xgqdut2016 force-pushed the issue/843 branch from 73968c3 to 4543fa1 Compare December 29, 2025 08:17

xgqdut2016 assigned whjthu Jan 4, 2026

pengcheng888 reviewed Jan 4, 2026

View reviewed changes

src/infiniop/ops/quant/per_channel_quant_int8/per_channel_quant_int8.h Outdated Show resolved Hide resolved

xgqdut2016 changed the base branch from main to dev January 5, 2026 01:54

xgqdut2016 changed the base branch from dev to main January 7, 2026 02:22

xgqdut2016 requested a review from whjthu January 7, 2026 07:28

issue/843: 增加per_channel_quant_int8算子

05aafb2

xgqdut2016 force-pushed the issue/843 branch from 15f829a to 05aafb2 Compare January 9, 2026 02:14

xgqdut2016 added 2 commits January 12, 2026 15:36

issue/843: modified w8a8int8.py parameters

b70e968

issue/843: modified x_scale datatype

544665b

whjthu requested changes Jan 28, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

issue/843: 增加QY和NVIDIA上per_channel_quant_int8算子#855

issue/843: 增加QY和NVIDIA上per_channel_quant_int8算子#855
xgqdut2016 wants to merge 3 commits intomainfrom
issue/843

xgqdut2016 commented Dec 26, 2025 •

edited

Loading

Uh oh!

Uh oh!

whjthu Jan 28, 2026

Uh oh!

whjthu Jan 28, 2026

Uh oh!

whjthu Jan 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

xgqdut2016 commented Dec 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

whjthu Jan 28, 2026

Choose a reason for hiding this comment

Uh oh!

whjthu Jan 28, 2026

Choose a reason for hiding this comment

Uh oh!

whjthu Jan 28, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

xgqdut2016 commented Dec 26, 2025 •

edited

Loading