auto-round-kernel installation method #1221

chensuyue · 2026-01-05T08:10:19Z

This pull request introduces significant improvements to the AutoRound Kernel integration, including enabling and documenting the kernel backend, updating requirements and installation methods, and enhancing test coverage. It also refactors the ARK QLinear implementation for better device and dtype handling. The most important changes are summarized below:

AutoRound Kernel Integration and Documentation:

Added a comprehensive README.md for AutoRound Kernel, detailing supported hardware, quantization configurations, versioning, and installation instructions.
Introduced a new installation script install_kernel.py to automatically detect the PyTorch version and install the appropriate kernel version, and registered a new CLI command auto-round-kernel-install. [1] [2]

Backend Configuration and Requirements:

Enabled and registered the auto_round_kernel, auto_round_kernel_zp, and auto_round_kernel_awq backends for CPU (previously commented out), and updated their PyTorch version requirements to torch>=2.8.0 for broader compatibility. [1] [2] [3]
Removed the "kernel" extra from setup.py to simplify dependency management.

ARK QLinear Refactoring:

Refactored ark/qlinear.py to instantiate ARK via auto_round_kernel.ARK(), improved dtype/device handling, unified bias and input dtype logic, and switched to using woqgemm for computation. [1] [2] [3]

Testing Improvements:

Expanded test coverage in test_model.py to include CPU devices for all relevant test cases, and re-enabled tests for additional quantization configurations.

Signed-off-by: chensuyue <[email protected]>

for more information, see https://pre-commit.ci

Signed-off-by: chensuyue <[email protected]>

auto_round_extension/ark/install_kernel.py

Signed-off-by: chensuyue <[email protected]>

auto_round/inference/backend.py

auto_round_extension/ark/README.md

Copilot

Copilot encountered an error and was unable to review this pull request. You can try again by re-requesting a review.

Signed-off-by: chensuyue <[email protected]>

This reverts commit bf49e79.

chensuyue · 2026-01-15T06:51:11Z

Test on CPU must run with OMP_NUM_THREADS and numactl, e.g. OMP_NUM_THREADS=32 numactl -C "0-31" pytest -v test_ark/test_model.py.
Test verified locally on GNR and BMG.
Keep ark CI closed, because test failed on CI machine (EMR), debugging WIP.

chensuyue added 5 commits January 5, 2026 15:46

add ark install scripts and README.md

32f8707

Signed-off-by: chensuyue <[email protected]>

fix format

cfa02f4

Signed-off-by: chensuyue <[email protected]>

minor update

862001f

Signed-off-by: chensuyue <[email protected]>

add kernel-install in auto-round setup

a42cb67

Signed-off-by: chensuyue <[email protected]>

minor update

a43ae35

Signed-off-by: chensuyue <[email protected]>

chensuyue force-pushed the suyue/ark_install branch from e3ed93b to a43ae35 Compare January 5, 2026 08:12

pre-commit-ci bot and others added 2 commits January 5, 2026 08:14

[pre-commit.ci] auto fixes from pre-commit.com hooks

ae0cd60

for more information, see https://pre-commit.ci

format update

4039add

Signed-off-by: chensuyue <[email protected]>

chensuyue force-pushed the suyue/ark_install branch from 2970188 to 4039add Compare January 5, 2026 08:18

chensuyue added 2 commits January 5, 2026 16:28

fix issue

52be92a

Signed-off-by: chensuyue <[email protected]>

update ark install cmd

c816e9b

Signed-off-by: chensuyue <[email protected]>

chensuyue marked this pull request as ready for review January 5, 2026 14:27

chensuyue requested review from luoyu-intel and wenhuach21 January 5, 2026 14:27

wenhuach21 reviewed Jan 6, 2026

View reviewed changes

auto_round_extension/ark/install_kernel.py Show resolved Hide resolved

wenhuach21 approved these changes Jan 6, 2026

View reviewed changes

chensuyue and others added 4 commits January 6, 2026 11:36

update readme

0d4db1b

Signed-off-by: chensuyue <[email protected]>

minor update

6f7b49e

Signed-off-by: chensuyue <[email protected]>

minor update

786ef4a

Signed-off-by: chensuyue <[email protected]>

Update the usage of new ARK functions (#1224)

f47974e

wenhuach21 reviewed Jan 6, 2026

View reviewed changes

auto_round/inference/backend.py Outdated Show resolved Hide resolved

luoyu-intel added 2 commits January 7, 2026 07:28

update MD

053584f

revert ipex; add windows for ark

a799336

hshen14 reviewed Jan 7, 2026

View reviewed changes

auto_round_extension/ark/README.md Outdated Show resolved Hide resolved

hshen14 reviewed Jan 7, 2026

View reviewed changes

auto_round_extension/ark/README.md Show resolved Hide resolved

hshen14 reviewed Jan 7, 2026

View reviewed changes

auto_round_extension/ark/README.md Outdated Show resolved Hide resolved

hshen14 reviewed Jan 7, 2026

View reviewed changes

auto_round_extension/ark/README.md Outdated Show resolved Hide resolved

use Algorithm

9b1a4e9

chensuyue added this to the 0.9.5 milestone Jan 8, 2026

Copilot AI review requested due to automatic review settings January 12, 2026 08:56

Copilot AI reviewed Jan 12, 2026

View reviewed changes

luoyu-intel and others added 8 commits January 12, 2026 09:11

change description of torch version

f5790b7

use 2.8 as minimum torch version

0cecd9a

update README.md

92cb6f4

Signed-off-by: chensuyue <[email protected]>

update ut scripts

ce53d2b

Signed-off-by: chensuyue <[email protected]>

Merge branch 'main' into suyue/ark_install

15d311a

test with torch gpu

bf49e79

Signed-off-by: chensuyue <[email protected]>

Revert "test with torch gpu"

914f62d

This reverts commit bf49e79.

Merge branch 'main' into suyue/ark_install

2501ab4

jiqing-feng mentioned this pull request Jan 15, 2026

transformers UT test_convert_from_awq_cpu failed after removing ipex #1154

Open

chensuyue merged commit ba48dd7 into main Jan 15, 2026
28 checks passed

chensuyue deleted the suyue/ark_install branch January 15, 2026 06:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

auto-round-kernel installation method #1221

auto-round-kernel installation method #1221

Uh oh!

chensuyue commented Jan 5, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

chensuyue commented Jan 15, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

auto-round-kernel installation method #1221

auto-round-kernel installation method #1221

Uh oh!

Conversation

chensuyue commented Jan 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

chensuyue commented Jan 15, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

chensuyue commented Jan 5, 2026 •

edited

Loading