NXP backend: Per-channel quantization of convolution layer #14061

StrycekSimon · 2025-09-08T13:22:36Z

Summary

Adds per-channel quantization for convolution layer and introduces NodeArgsIdx class to Neutron Quantizer for better handling of indexes to quantized node's args list.

NodeArgsIdx allows selection of nested objects, e.g. an object in a list in node's args list. It also simplifies NeutronAtenQuantizer annotation process by using annotate_inputs() for inputs, weights and biases.

Test plan

The implementation should be covered by either existing or newly added unit tests.

cc @digantdesai @JakeStevens @robert-kalmar, @skywall, @roman-janik-nxp

pytorch-bot · 2025-09-08T13:22:39Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/14061

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure, 1 Cancelled Job

As of commit 1a3d9e5 with merge base 7e228ee ():

NEW FAILURE - The following job has failed:

pull / test-samsung-models-linux / linux-job (gh)
RuntimeError: Command docker exec -t 9805d6ddc72e95e0b22e9e42977c1d7b40271b598ad15b86e664624c9042f5cc /exec failed with exit code 1

CANCELLED JOB - The following job was cancelled. Please retry:

pull / unittest-editable / macos / macos-job (gh)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

StrycekSimon · 2025-09-08T13:26:17Z

Based on a conversation from this PR, I added a feature that uses the implementation. The old PR can be declined.

StrycekSimon · 2025-09-08T13:28:21Z

@pytorchbot label "release notes: nxp"

roman-janik-nxp · 2025-09-10T15:16:49Z

backends/nxp/tests/test_per_channel_conversion.py

+            )
+            assert nodes[10].name == "aten_convolution_default"
+
+    @classmethod


Minor: I would move this method up to be 1st, to aling with other tests.

This is pretty chaotic across our unittest tests. I will make it consistent.

JakeStevens · 2025-09-12T13:25:47Z

backends/nxp/backend/ir/converter/node_converters/ops_converters/__init__.py

-    "QDQDequantizeConverter",
+    "QDQPerTensorDequantizeConverter",
+    "QDQPerChannelDequantizeConverter",
    "QDQQuantizeConverter",


only QDQDequantizer needs to be updated, not QDQQuantizeConverter too?

Correct, as there are no changes to QDQQuantizeConverter. Per channel quantization scheme is used only for weights and biases, which are inputs - dequantize nodes.

JakeStevens · 2025-09-12T13:27:20Z

backends/nxp/quantizer/neutron_quantizer.py

+                    list[tuple[fx.Node, NodeArgsIdx]]
+                    | list[tuple[fx.Node, NodeArgsIdx, DerivedQuantizationSpec]]
+                ),
+                spec: QuantizationSpec | None,


just curious, why switch from Optional to | None?

It's a part of move to Python 3.10 type hints and leaving imports from Typing.

JakeStevens · 2025-09-12T13:28:15Z

backends/nxp/quantizer/neutron_quantizer.py

-
            # pyre-ignore[6]: incompatible parameter type
            annotate_inputs(anchors.inputs, input_act_qspec)
-            annotate_weights_or_biases(anchors.weights, weight_qspec)


is this function no longer used at all now and can be removed entirely?

Yes, it is replaced by annotate_inputs().

…or comparison

StrycekSimon · 2025-09-23T12:49:52Z

The error in Samsung's tests seems to be caused by some problem with downloading resources, and is most likely unrelated.

Update: I double-checked it and this error is also present in other PRs...

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 8, 2025

StrycekSimon changed the title ~~Per-channel quantization of convolution layer~~ NXP backend: Per-channel quantization of convolution layer Sep 8, 2025

pytorch-bot bot added the release notes: nxp Changes to the NXP Neutron backend delegate label Sep 8, 2025

StrycekSimon mentioned this pull request Sep 8, 2025

NXP backend: Better handling of arg indexes in Neutron quantizer #13368

Closed

robert-kalmar added the module: nxp Issues related to NXP Neutron NPU delegation and code under backends/nxp/ label Sep 10, 2025

roman-janik-nxp self-requested a review September 10, 2025 14:46

roman-janik-nxp approved these changes Sep 10, 2025

View reviewed changes

JakeStevens reviewed Sep 12, 2025

View reviewed changes

robert-kalmar force-pushed the upstream/main-nxp/EIEX-486+EIEX-493-upstream-per-channel-quantization-of-convolution branch from 02d89fb to 16df359 Compare September 22, 2025 07:58

roman-janik-nxp and others added 5 commits September 23, 2025 09:34

NXP backend: Abstract PartitionAnchors annotations of arg indexes

e7c43b6

NXP backend: Add support for per-channel quantization for Conv

2c82054

NXP backend: Use per-channel quantization for Conv in NeutronQuantizer

b340ad1

NXP backend: Print information about max error during the output tens…

82e6032

…or comparison

NXP backend: Make setUpClass placement consistent across unit tests

1a3d9e5

StrycekSimon force-pushed the upstream/main-nxp/EIEX-486+EIEX-493-upstream-per-channel-quantization-of-convolution branch from 16df359 to 1a3d9e5 Compare September 23, 2025 07:34

robert-kalmar assigned robert-kalmar and unassigned robert-kalmar Sep 23, 2025

robert-kalmar self-requested a review September 23, 2025 13:06

robert-kalmar approved these changes Sep 23, 2025

View reviewed changes

robert-kalmar merged commit 57ca96f into pytorch:main Sep 23, 2025
128 of 130 checks passed

robert-kalmar deleted the upstream/main-nxp/EIEX-486+EIEX-493-upstream-per-channel-quantization-of-convolution branch September 23, 2025 13:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

NXP backend: Per-channel quantization of convolution layer #14061

NXP backend: Per-channel quantization of convolution layer #14061

Uh oh!

StrycekSimon commented Sep 8, 2025 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented Sep 8, 2025 •

edited

Loading

Uh oh!

StrycekSimon commented Sep 8, 2025

Uh oh!

StrycekSimon commented Sep 8, 2025

Uh oh!

roman-janik-nxp Sep 10, 2025

Uh oh!

StrycekSimon Sep 11, 2025

Uh oh!

StrycekSimon Sep 11, 2025

Uh oh!

JakeStevens Sep 12, 2025

Uh oh!

roman-janik-nxp Sep 12, 2025

Uh oh!

JakeStevens Sep 12, 2025

Uh oh!

roman-janik-nxp Sep 12, 2025

Uh oh!

JakeStevens Sep 12, 2025

Uh oh!

roman-janik-nxp Sep 12, 2025

Uh oh!

StrycekSimon commented Sep 23, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

NXP backend: Per-channel quantization of convolution layer #14061

NXP backend: Per-channel quantization of convolution layer #14061

Uh oh!

Conversation

StrycekSimon commented Sep 8, 2025 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

pytorch-bot bot commented Sep 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/14061

❌ 1 New Failure, 1 Cancelled Job

Uh oh!

StrycekSimon commented Sep 8, 2025

Uh oh!

StrycekSimon commented Sep 8, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

StrycekSimon commented Sep 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

StrycekSimon commented Sep 8, 2025 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Sep 8, 2025 •

edited

Loading

StrycekSimon commented Sep 23, 2025 •

edited

Loading