NXP backend: Add support for optimizing Conv+BN during QAT #16246

StrycekSimon · 2025-12-15T11:44:44Z

Summary

Enables optimization of Conv+BatchNorm during QAT. This involves:

Enabling TorchAO native batch norm and conv fusing (by disabling our FuseBatchNormWithConvPass when in QAT mode)
Removing output quantization of convolution (for native conv+bn fusing to properly match the pattern to be replaced)
Adding BatchNorm quantization output pattern implementation

Test plan

Test cases for Conv+BatchNorm fusion and quantization were added.

cc @robert-kalmar @JakeStevens @digantdesai

pytorch-bot · 2025-12-15T11:44:48Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/16246

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure, 3 Unrelated Failures

As of commit 8503477 with merge base b081123 ():

NEW FAILURE - The following job has failed:

pull / test-openvino-linux / linux-job (gh)
RuntimeError: Command docker exec -t e6136fe7005be70ef9cd0e9fdb39ab412c62bb5baba7b8e6a45cf6fd0d21b95e /exec failed with exit code 1

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / unittest / windows / windows-job (gh) (trunk failure)
backends/xnnpack/test/recipes/test_xnnpack_recipes.py::TestXnnpackRecipes::test_8a4w_recipe
pull / unittest-editable / windows / windows-job (gh) (trunk failure)
backends/xnnpack/test/recipes/test_xnnpack_recipes.py::TestXnnpackRecipes::test_8a4w_recipe

UNSTABLE - The following job is marked as unstable, possibly due to flakiness on trunk:

pull / android / run-emulator (gh) (#16137)
Timeout waiting for emulator to boot.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Copilot

Pull request overview

This PR enables optimization of Conv+BatchNorm patterns during Quantization-Aware Training (QAT) in the NXP backend. The implementation leverages TorchAO's native Conv+BN fusion by conditionally skipping output quantization on Conv operations when followed by BatchNorm in QAT mode. The key mechanism involves disabling the ExecutorCH's FuseBatchNormWithConvPass in QAT mode and introducing a new BatchNormPattern to preserve quantization for subsequent layers.

Key Changes:

Added --use_qat command-line argument to enable QAT mode during model compilation
Implemented conditional logic to skip Conv output quantization when followed by BatchNorm in QAT mode
Added BatchNormPattern to quantize BatchNorm outputs while leaving inputs unquantized (for Conv+BN fusion)

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 6 comments.

Show a summary per file

File	Description
examples/nxp/aot_neutron_compile.py	Adds --use_qat CLI argument and passes it to NeutronQuantizer and calibrate_and_quantize
backends/nxp/quantizer/neutron_quantizer.py	Registers BatchNormPattern and filters out FuseBatchNormWithConvPass when in QAT mode
backends/nxp/quantizer/patterns.py	Adds BatchNormPattern class and conditional output quantization logic in Conv patterns for QAT Conv+BN fusion
backends/nxp/tests/models.py	Adds ConvBNModule test model supporting Conv1d/2d/Transpose + BatchNorm combinations
backends/nxp/tests/test_quantizer.py	Adds parameterized test for Conv+BN fusion in QAT mode across different conv types, bias, and affine configurations

Comments suppressed due to low confidence (3)

backends/nxp/quantizer/patterns.py:444

The output_specs variable is set conditionally in the QAT block (lines 432-438) but is never used in the return statement on line 444, which hardcodes [(conv_node,)] instead. This means the Conv+BatchNorm fusion logic for QAT mode has no effect for Conv1dPattern and ConvTranspose1dPattern which inherit from this class. The variable should be initialized before the conditional block as output_specs = [(conv_node,)] and the return statement should use output=output_specs to match the pattern used in Conv2dPattern.

        if self.is_qat:
            conv_users = conv_node.users
            possibly_bn = list(conv_users.keys())[0] if len(conv_users) == 1 else None
            if possibly_bn and _is_batch_norm(possibly_bn):
                output_specs = []
            else:
                output_specs = [(conv_node,)]

        return PartitionAnchors(
            inputs=[(conv_node, NodeArgsIdx(0))],
            weights=[(conv_node, NodeArgsIdx(1), weight_quantization_spec)],
            biases=bias,
            output=[(conv_node,)],

backends/nxp/quantizer/patterns.py:436

Variable output_specs is not used.

                output_specs = []

backends/nxp/quantizer/patterns.py:438

Variable output_specs is not used.

                output_specs = [(conv_node,)]

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

backends/nxp/tests/test_quantizer.py

backends/nxp/quantizer/patterns.py

backends/nxp/quantizer/neutron_quantizer.py

backends/nxp/quantizer/patterns.py

Copilot

Pull request overview

Copilot reviewed 6 out of 6 changed files in this pull request and generated 3 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

backends/nxp/tests/test_quantizer.py

backends/nxp/tests/models.py

backends/nxp/tests/test_quantizer.py

…y BN

MartinPavella · 2025-12-17T08:22:49Z

examples/nxp/aot_neutron_compile.py

+        action="store_true",
+        required=False,
+        default=False,
+        help="Use QAT mode for quantization (does not include QAT training)",


If the quantization aware training is not possible using this module, why include it? Just to show how it can be triggered? If so, perhaps a separate example module, or even just a README might be better in my opinion.

StrycekSimon · 2025-12-17T09:08:06Z

The failing unittest tests are related to XNNPack backend. Although I have not seen such error in other PRs, implementation in this PR should not interfere with it.

Copilot AI review requested due to automatic review settings December 15, 2025 11:44

StrycekSimon requested a review from robert-kalmar as a code owner December 15, 2025 11:44

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Dec 15, 2025

Copilot started reviewing on behalf of StrycekSimon December 15, 2025 11:45 View session

Copilot AI reviewed Dec 15, 2025

View reviewed changes

StrycekSimon marked this pull request as draft December 15, 2025 12:35

StrycekSimon force-pushed the EIEX-650-fix-native-conv-batchnorm-fusing-leaving-artefacts branch from 6bb4c2d to 33cb6e4 Compare December 15, 2025 13:31

StrycekSimon requested a review from Copilot December 15, 2025 13:43

StrycekSimon force-pushed the EIEX-650-fix-native-conv-batchnorm-fusing-leaving-artefacts branch from 33cb6e4 to 0675a79 Compare December 15, 2025 13:48

Copilot started reviewing on behalf of StrycekSimon December 15, 2025 13:53 View session

Copilot AI reviewed Dec 15, 2025

View reviewed changes

backends/nxp/tests/test_quantizer.py Show resolved Hide resolved

backends/nxp/tests/models.py Show resolved Hide resolved

backends/nxp/tests/test_quantizer.py Show resolved Hide resolved

StrycekSimon force-pushed the EIEX-650-fix-native-conv-batchnorm-fusing-leaving-artefacts branch from 0675a79 to 2e873bc Compare December 15, 2025 15:35

StrycekSimon added 5 commits December 15, 2025 17:15

NXP backend: Add quantization pattern for BatchNorm operator

85d1adf

NXP backend: Disable Conv+BN fusing pass in QAT mode

0394506

NXP backend: Add QAT support to aot examples

78ef065

NXP backend: Remove conv output quantization annotation if followed b…

4e839f2

…y BN

NXP backend: Add tests for conv+bn fusing in QAT

8503477

StrycekSimon force-pushed the EIEX-650-fix-native-conv-batchnorm-fusing-leaving-artefacts branch from 2e873bc to 8503477 Compare December 15, 2025 16:17

robert-kalmar added module: nxp Issues related to NXP Neutron NPU delegation and code under backends/nxp/ release notes: nxp Changes to the NXP Neutron backend delegate labels Dec 16, 2025

robert-kalmar marked this pull request as ready for review December 17, 2025 07:52

robert-kalmar requested a review from MartinPavella December 17, 2025 07:52

MartinPavella approved these changes Dec 17, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

NXP backend: Add support for optimizing Conv+BN during QAT #16246

NXP backend: Add support for optimizing Conv+BN during QAT #16246

StrycekSimon commented Dec 15, 2025 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented Dec 15, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

MartinPavella Dec 17, 2025

Uh oh!

StrycekSimon commented Dec 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

NXP backend: Add support for optimizing Conv+BN during QAT #16246

Are you sure you want to change the base?

NXP backend: Add support for optimizing Conv+BN during QAT #16246

Conversation

StrycekSimon commented Dec 15, 2025 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

pytorch-bot bot commented Dec 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/16246

❌ 1 New Failure, 3 Unrelated Failures

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

MartinPavella Dec 17, 2025

Choose a reason for hiding this comment

Uh oh!

StrycekSimon commented Dec 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

StrycekSimon commented Dec 15, 2025 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Dec 15, 2025 •

edited

Loading