[QNN-EP] Support alternate Layernorm fusion pattern in QNN preprocess #26060

qti-mattsinc · 2025-09-16T19:48:31Z

Description

Small change to allow QNN Preprocess to allow a Mul node (with A=B) instead of a Pow node (with Y=2) for layernorm fusion.

HectorSVC · 2025-09-16T21:35:30Z

/azp run Linux QNN CI Pipeline,Win_TRT_Minimal_CUDA_Test_CI,Windows ARM64 QNN CI Pipeline,Windows GPU Doc Gen CI Pipeline

azure-pipelines · 2025-09-16T21:36:02Z

Azure Pipelines successfully started running 4 pipeline(s).

Copilot

Pull request overview

This PR extends the QNN preprocessor's LayerNorm fusion capability to recognize an alternate pattern where a Mul node (with both inputs being the same tensor) is used instead of a Pow node (with exponent 2.0) for computing the squared values. This is mathematically equivalent since x² = x * x, and some model exporters may generate this pattern.

Key Changes:

Added documentation for the Mul-based LayerNorm fusion pattern
Extended pattern matching to recognize Mul nodes in place of Pow nodes
Added validation logic to ensure Mul nodes have matching inputs

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

onnxruntime/python/tools/quantization/fusions/fusion_layernorm.py

yuslepukhin

yuslepukhin · 2026-01-12T19:52:19Z

Please, rebase off main

qti-mattsinc · 2026-01-12T23:30:59Z

Rebased on top of tip

yuslepukhin · 2026-01-13T20:13:17Z

/azp run Linux QNN CI Pipeline, Win_TRT_Minimal_CUDA_Test_CI, Windows ARM64 QNN CI Pipeline, Windows GPU CUDA CI Pipeline, Windows GPU DML CI Pipeline, Windows GPU Doc Gen CI Pipeline, Windows GPU TensorRT CI Pipeline, Windows OpenVINO CI Pipeline, Windows x64 QNN CI Pipeline

azure-pipelines · 2026-01-13T20:13:37Z

Azure Pipelines successfully started running 4 pipeline(s).

qti-mattsinc · 2026-01-15T00:41:38Z

The CI error does not seem related to the change based on the logs. Not sure if this is known flaky behavior. @yuslepukhin, maybe worth retriggering it?

yuslepukhin · 2026-01-15T22:19:42Z

Can you rebase/merge from main? #Resolved

yuslepukhin · 2026-01-15T22:48:36Z

Please, refrain from force pushing, this makes things longer.

qti-mattsinc · 2026-01-15T22:48:57Z

Can you rebase/merge from main?

Rebased on top of current tip of main (1c02b79)

qti-mattsinc · 2026-01-15T22:52:24Z

Please, refrain from force pushing, this makes things longer.

My bad. For future reference, would it have been correct just to merge main into this branch, then? I'm much more familiar with stacked diff workflows (Gerrit), which would have would have preserved CI checks after a force-push to rebase.

yuslepukhin · 2026-01-15T22:58:22Z

Please, refrain from force pushing, this makes things longer.

My bad. For future reference, would it have been correct just to merge main into this branch, then? I'm much more familiar with stacked diff workflows (Gerrit), which would have would have preserved CI checks after a force-push to rebase.

Yes, simply merging from main is the easiest and preferred way.

yuslepukhin · 2026-01-15T22:58:37Z

/azp run Linux QNN CI Pipeline, Win_TRT_Minimal_CUDA_Test_CI, Windows ARM64 QNN CI Pipeline, Windows GPU CUDA CI Pipeline, Windows GPU DML CI Pipeline, Windows GPU Doc Gen CI Pipeline, Windows GPU TensorRT CI Pipeline, Windows OpenVINO CI Pipeline, Windows x64 QNN CI Pipeline

azure-pipelines · 2026-01-15T22:58:54Z

Azure Pipelines successfully started running 4 pipeline(s).

yuslepukhin · 2026-01-15T23:09:18Z

/azp run Linux QNN CI Pipeline,Win_TRT_Minimal_CUDA_Test_CI,Windows ARM64 QNN CI Pipeline,Windows GPU Doc Gen CI Pipeline

azure-pipelines · 2026-01-15T23:09:35Z

Azure Pipelines successfully started running 4 pipeline(s).

yuslepukhin · 2026-01-15T23:29:53Z

/azp run Windows ARM64 QNN CI Pipeline

azure-pipelines · 2026-01-15T23:30:02Z

Azure Pipelines successfully started running 1 pipeline(s).

…microsoft#26060) ### Description Small change to allow QNN Preprocess to allow a Mul node (with A=B) instead of a Pow node (with Y=2) for layernorm fusion.

…#26060) ### Description Small change to allow QNN Preprocess to allow a Mul node (with A=B) instead of a Pow node (with Y=2) for layernorm fusion. (cherry picked from commit e7dfd69)

HectorSVC added the ep:QNN issues related to QNN exeution provider label Sep 16, 2025

yuslepukhin requested a review from Copilot December 17, 2025 20:05

Copilot started reviewing on behalf of yuslepukhin December 17, 2025 20:06 View session

Copilot AI reviewed Dec 17, 2025

View reviewed changes

onnxruntime/python/tools/quantization/fusions/fusion_layernorm.py Show resolved Hide resolved

yuslepukhin approved these changes Jan 7, 2026

View reviewed changes

qti-mattsinc force-pushed the dev/mattsinc/AISW-148002 branch 2 times, most recently from 1161fcd to ebaa0cd Compare January 12, 2026 23:29

qti-mattsinc added 2 commits January 15, 2026 14:46

[QNN-EP] Support alternate Layernorm fusion pattern in QNN preprocess

0bae7c7

Add to unit tests

591cbe1

qti-mattsinc force-pushed the dev/mattsinc/AISW-148002 branch from ebaa0cd to 591cbe1 Compare January 15, 2026 22:47

yuslepukhin merged commit e7dfd69 into microsoft:main Jan 16, 2026
90 of 91 checks passed

edgchen1 added the release:1.24.0 label Jan 16, 2026

This was referenced Jan 21, 2026

1.24.0 release cherry-pick round 1 #27103

Closed

1.24.0 release cherry-pick round 1 #27104

Open

[QNN-EP] Support alternate Layernorm fusion pattern in QNN preprocess #26060

[QNN-EP] Support alternate Layernorm fusion pattern in QNN preprocess #26060

Uh oh!

Conversation

qti-mattsinc commented Sep 16, 2025

Description

Uh oh!

HectorSVC commented Sep 16, 2025

Uh oh!

azure-pipelines bot commented Sep 16, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

yuslepukhin left a comment

Choose a reason for hiding this comment

Uh oh!

yuslepukhin commented Jan 12, 2026

Uh oh!

qti-mattsinc commented Jan 12, 2026

Uh oh!

yuslepukhin commented Jan 13, 2026

Uh oh!

azure-pipelines bot commented Jan 13, 2026

Uh oh!

qti-mattsinc commented Jan 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yuslepukhin commented Jan 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yuslepukhin commented Jan 15, 2026

Uh oh!

qti-mattsinc commented Jan 15, 2026

Uh oh!

qti-mattsinc commented Jan 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yuslepukhin commented Jan 15, 2026

Uh oh!

yuslepukhin commented Jan 15, 2026

Uh oh!

azure-pipelines bot commented Jan 15, 2026

Uh oh!

yuslepukhin commented Jan 15, 2026

Uh oh!

azure-pipelines bot commented Jan 15, 2026

Uh oh!

yuslepukhin commented Jan 15, 2026

Uh oh!

azure-pipelines bot commented Jan 15, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

qti-mattsinc commented Jan 15, 2026 •

edited

Loading

yuslepukhin commented Jan 15, 2026 •

edited

Loading

qti-mattsinc commented Jan 15, 2026 •

edited

Loading