NXP backend: Resolve limitations of uncertain tensor formats #13942

MartinPavella · 2025-09-04T09:35:36Z

Summary

This PR resolves format related issues by inferring the format (NCHW/NHWC) for all nodes before partitioning. These formats are then used by the NeutronPartitioner to accurately determine which nodes are supported on Neutron.

Test plan

Unit tests provided, and correct function is tested by nearly every test in the nxp backend.

cc @robert-kalmar @roman-janik-nxp @StrycekSimon @jirioc

MartinPavella · 2025-09-04T09:35:45Z

@pytorchbot label "module: nxp" "release notes: nxp"

pytorch-bot · 2025-09-04T09:35:53Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/13942

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures, 1 Cancelled Job, 3 Unrelated Failures

As of commit 8255593 with merge base 03333c5 ():

NEW FAILURES - The following jobs have failed:

pull / test-moshi-linux / linux-job (gh)
RuntimeError: Command docker exec -t a4533010a3954df58034f7af6bef6a5d848759b1c342eefb89ac0717215bfbf0 /exec failed with exit code 1
pull / unittest-editable / macos / macos-job (gh)
RuntimeError: Command bash /Users/ec2-user/runner/_work/_temp/exec_script failed with exit code 5

CANCELLED JOB - The following job was cancelled. Please retry:

pull / test-samsung-models-linux / linux-job (gh)

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / test-binary-size-linux-gcc / linux-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / test-openvino-linux / linux-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / test-setup-linux-gcc / linux-job (gh) (trunk failure)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

… node format. The pass `RemoveGetItemPass` replaces a `max_pool2d_with_indices` node with a `max_pool2d` node, that doesn't require a GetItem afterward. The new operator must, however, preserve the original node format. Therefore, a copy of the pass was created in `backends/nxp/_passes`, where it was modified. The new directory was created, because the pass doesn't follow the `NeutronEdgePass` interface.

Before, the format inference was done during conversion to NeutronIR (after partitioning), so the partitioner didn't yet know the formats. Now, the partitioner has the format data, which can be used to accurately select nodes for delegation.

…ode formats.

robert-kalmar · 2025-09-11T09:15:31Z

backends/nxp/backend/node_format_inference.py

+                node.meta = {}
+            if NXP_NODE_FORMAT not in node.meta:
+                logging.warning(f"Node `{node}` does not have inferred format.")
+                node.meta[NXP_NODE_FORMAT] = NodeFormat.NONE


As we now perform the node format inference during partition (that is on the whole Edge Program), it is likely that some nodes wont have the format determined, as the NodeFormat inference algorithm does not know them. Right?

We should make sure we stop the channel_first tag propagation on unknown operator, as we cannot determine if it propagates the channel_first or stops it. As example, the Reshape stops the propagation of channel first tag. But if we would not know the Reshape, op we were incorrectly propagate the channel_first tag behind it in the compute path.
So we must defensively stop the propagation at every unknown node. Is my thought process correct?

Yes, you are correct, with a few caveats.
The fact that we currently propagate the format through unknown nodes, should never cause crashes, as our format handling system is quite robust. It can, however, result in unnecessary transpositions. As the "unknown" operators will inevitably not be delegated, they will split the graph, resulting in multiple delegated partitions. It is possible (and likely) that one of these partitions requires NHWC, which is propagated to the second partition (via the "unknown" node), but the second partition doesn't require NHWC. If we keep our format inference as is, unnecessary transpositions would have to be done at the inputs and outputs of the second partition.

I will update the code to not propagate the format through "unknown" operators.

robert-kalmar · 2025-09-11T09:27:53Z

backends/nxp/_passes/remove_getitem_pass.py

+            if node.op == "call_function":
+                if (
+                    node.target.__name__ == "aten.max_pool2d_with_indices.default"
+                    or node.target.__name__ == "aten.max.dim"


Why do we handle the aten.max.dim too? Is it a loftover from original pass?

Yes, it was in the original file. I wanted to make as few changes as possible, as it is not the main focus of this PR.

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 4, 2025

pytorch-bot bot added module: nxp Issues related to NXP Neutron NPU delegation and code under backends/nxp/ release notes: nxp Changes to the NXP Neutron backend delegate labels Sep 4, 2025

mergennachin requested review from robert-kalmar, jirioc, roman-janik-nxp and digantdesai September 5, 2025 13:36

MartinPavella added 5 commits September 10, 2025 16:28

NXP backend: Store inferred node format in the node.meta.

d299bae

NXP backend: Improve cat delegation by using inferred node formats.

904d36b

NXP backend: Improve constant_pad_nd delegation by using inferred n…

8255593

…ode formats.

robert-kalmar force-pushed the upstream/main-nxp/EIEX-392-resolve-limitations-of-uncertain-tensor-formats branch from 1d726a4 to 8255593 Compare September 10, 2025 14:28

roman-janik-nxp approved these changes Sep 10, 2025

View reviewed changes

robert-kalmar reviewed Sep 11, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

NXP backend: Resolve limitations of uncertain tensor formats #13942

NXP backend: Resolve limitations of uncertain tensor formats #13942

Uh oh!

MartinPavella commented Sep 4, 2025 •

edited

Loading

Uh oh!

MartinPavella commented Sep 4, 2025

Uh oh!

pytorch-bot bot commented Sep 4, 2025 •

edited

Loading

Uh oh!

robert-kalmar Sep 11, 2025

Uh oh!

MartinPavella Sep 11, 2025 •

edited

Loading

Uh oh!

robert-kalmar Sep 11, 2025

Uh oh!

MartinPavella Sep 11, 2025

Uh oh!

Uh oh!

NXP backend: Resolve limitations of uncertain tensor formats #13942

Are you sure you want to change the base?

NXP backend: Resolve limitations of uncertain tensor formats #13942

Uh oh!

Conversation

MartinPavella commented Sep 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

MartinPavella commented Sep 4, 2025

Uh oh!

pytorch-bot bot commented Sep 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/13942

❌ 2 New Failures, 1 Cancelled Job, 3 Unrelated Failures

Uh oh!

robert-kalmar Sep 11, 2025

Choose a reason for hiding this comment

Uh oh!

MartinPavella Sep 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

robert-kalmar Sep 11, 2025

Choose a reason for hiding this comment

Uh oh!

MartinPavella Sep 11, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

MartinPavella commented Sep 4, 2025 •

edited

Loading

pytorch-bot bot commented Sep 4, 2025 •

edited

Loading

MartinPavella Sep 11, 2025 •

edited

Loading