[ET-VK] Re-implement (de)quantize_per_tensor.default #15721

SS-JIA · 2025-11-10T21:32:43Z

Stack from ghstack (oldest at bottom):

Re-implement the quantized_decomposed.(de)quantize_per_tensor.default ops with

add_quantize_and_pack_4w4c_node

As a consequence, the et_vk.quantize_q8ta_for_conv2d.default and et_vk.dequantize_q8to_from_conv2d.default ops are not needed anymore.

The overall goal is to streamline the quantize/dequantize interface in ET-VK.

Differential Revision: D86702457

Re-implement the `quantized_decomposed.(de)quantize_per_tensor.default` ops with `add_quantize_and_pack_4w4c_node` As a consequence, the `et_vk.quantize_q8ta_for_conv2d.default` and `et_vk.dequantize_q8to_from_conv2d.default` ops are not needed anymore. The overall goal is to streamline the quantize/dequantize interface in ET-VK. Differential Revision: [D86702457](https://our.internmc.facebook.com/intern/diff/D86702457/) [ghstack-poisoned]

pytorch-bot · 2025-11-10T21:32:47Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/15721

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 62fdd2e with merge base aba44fd ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Re-implement the `quantized_decomposed.(de)quantize_per_tensor.default` ops with `add_quantize_and_pack_4w4c_node` As a consequence, the `et_vk.quantize_q8ta_for_conv2d.default` and `et_vk.dequantize_q8to_from_conv2d.default` ops are not needed anymore. The overall goal is to streamline the quantize/dequantize interface in ET-VK. Differential Revision: [D86702457](https://our.internmc.facebook.com/intern/diff/D86702457/) ghstack-source-id: 322214458 Pull Request resolved: #15721

github-actions · 2025-11-10T21:33:34Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

Re-implement the `quantized_decomposed.(de)quantize_per_tensor.default` ops with `add_quantize_and_pack_4w4c_node` As a consequence, the `et_vk.quantize_q8ta_for_conv2d.default` and `et_vk.dequantize_q8to_from_conv2d.default` ops are not needed anymore. The overall goal is to streamline the quantize/dequantize interface in ET-VK. Differential Revision: [D86702457](https://our.internmc.facebook.com/intern/diff/D86702457/) ghstack-source-id: 322214458 Pull Request resolved: #15721

This was referenced Nov 10, 2025

[ET-VK][ez] Apply quantize op replacement to all argument nodes #15702

Merged

[ET-VK] Allow buffer input/output for quantize/dequantize for conv2d ops #15703

Merged

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 10, 2025

meta-codesync bot added fb-exported meta-exported labels Nov 10, 2025

manuelcandales approved these changes Nov 11, 2025

View reviewed changes

meta-codesync bot merged commit 493236a into gh/SS-JIA/367/base Nov 11, 2025
14 of 23 checks passed

meta-codesync bot deleted the gh/SS-JIA/367/head branch November 11, 2025 18:29

pytorchbot mentioned this pull request Nov 11, 2025

[ET-VK] Re-implement (de)quantize_per_tensor.default #15753

Merged

meta-codesync bot temporarily deployed to cherry-pick-bot November 11, 2025 18:29 Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ET-VK] Re-implement (de)quantize_per_tensor.default #15721

[ET-VK] Re-implement (de)quantize_per_tensor.default #15721

Uh oh!

SS-JIA commented Nov 10, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Nov 10, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Nov 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[ET-VK] Re-implement (de)quantize_per_tensor.default #15721

[ET-VK] Re-implement (de)quantize_per_tensor.default #15721

Uh oh!

Conversation

SS-JIA commented Nov 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Nov 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/15721

✅ No Failures

Uh oh!

github-actions bot commented Nov 10, 2025

This PR needs a release notes: label

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

SS-JIA commented Nov 10, 2025 •

edited

Loading

pytorch-bot bot commented Nov 10, 2025 •

edited

Loading

This PR needs a `release notes:` label