Integrate Automated QDQ placement tool - Part 4 #704

willg-nv · 2025-12-17T07:03:58Z

What does this PR do?

Type of change: new feature

Overview: This PR integrates automated QDQ placement tool to ModelOpt, this PR is 4/4 parts of the changes. This PR contains the following changes:

Implements reference and design documents
Implements a simple resnet example
Update change log
prepend timestamp to logger format text.

Part 1: #701
Part 2: #702
Part 3: #703
Part 4: #704

Usage

Check example README.md for example details
Check docs for the design and usage of this tool

Testing

This PR does not contains tests

Before your PR is "Ready for review"

Make sure you read and follow Contributor guidelines and your commits are signed.
Is this change backward compatible?: Yes
Did you write any new necessary tests?: No
Did you add or update any necessary documentation?: Yes
Did you update Changelog?: Yes

Additional Information

copy-pr-bot · 2025-12-17T07:04:02Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

willg-nv · 2025-12-22T03:25:31Z

@ChenhanYu @cjluo-nv could you help me review this PR? thanks!

gcunhase · 2026-01-09T15:32:38Z

docs/source/guides/9_qdq_placement.rst

+
+**Q: Can I optimize for accuracy instead of latency?**
+
+A: Currently, the autotuner optimizes for latency. For accuracy-aware optimization, you would need to implement a custom benchmarking function that evaluates accuracy on a validation dataset.


Can we provide an example on how users could do that? You may re-use or modify the evaluate.py script in modelopt/examples/onnx_ptq as a starting point.

Sorry, this line was generated by cursor. This tool currently only focus on perf, accuracy-aware optimization is not supported.

Is there any way for users to implement something to direct the Q/DQ node placement according to an accuracy metric as well or would that not be straight-forward to do?

Signed-off-by: Will Guo <[email protected]>

gcunhase · 2026-01-13T00:50:56Z

docs/source/guides/9_qdq_placement.rst

+
+**Q: Can I optimize for accuracy instead of latency?**
+
+A: Currently, the autotuner optimizes for latency.


nit: A: Currently, the autotuner optimizes for latency only.

willg-nv requested review from a team as code owners December 17, 2025 07:03

willg-nv requested review from ChenhanYu and cjluo-nv December 17, 2025 07:03

willg-nv changed the title ~~Dev willg integrate auto qdq placement part4~~ Integrate Automated QDQ placement tool - Part 4 Dec 17, 2025

This was referenced Dec 17, 2025

Integrate Automated QDQ placement tool - Part 3 #703

Open

Integrate Automated QDQ placement tool - Part 2 #702

Open

Integrate Automated QDQ placement tool - Part 1 #701

Open

willg-nv force-pushed the dev-willg-integrate-auto-qdq-placement-part4 branch from 1698082 to 6d55fcb Compare December 31, 2025 01:58

gcunhase reviewed Jan 9, 2026

View reviewed changes

Integrate Automated QDQ placement tool - part 4

14ab5b6

Signed-off-by: Will Guo <[email protected]>

willg-nv force-pushed the dev-willg-integrate-auto-qdq-placement-part4 branch from 6d55fcb to 14ab5b6 Compare January 12, 2026 03:17

gcunhase reviewed Jan 13, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Integrate Automated QDQ placement tool - Part 4 #704

Integrate Automated QDQ placement tool - Part 4 #704

willg-nv commented Dec 17, 2025 •

edited

Loading

Uh oh!

copy-pr-bot bot commented Dec 17, 2025

Uh oh!

willg-nv commented Dec 22, 2025

Uh oh!

gcunhase Jan 9, 2026

Uh oh!

willg-nv Jan 12, 2026

Uh oh!

gcunhase Jan 13, 2026

Uh oh!

gcunhase Jan 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		Q: Can I optimize for accuracy instead of latency?

		A: Currently, the autotuner optimizes for latency. For accuracy-aware optimization, you would need to implement a custom benchmarking function that evaluates accuracy on a validation dataset.

Integrate Automated QDQ placement tool - Part 4 #704

Are you sure you want to change the base?

Integrate Automated QDQ placement tool - Part 4 #704

Conversation

willg-nv commented Dec 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Usage

Testing

Before your PR is "Ready for review"

Additional Information

Uh oh!

copy-pr-bot bot commented Dec 17, 2025

Uh oh!

willg-nv commented Dec 22, 2025

Uh oh!

gcunhase Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

willg-nv Jan 12, 2026

Choose a reason for hiding this comment

Uh oh!

gcunhase Jan 13, 2026

Choose a reason for hiding this comment

Uh oh!

gcunhase Jan 13, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

willg-nv commented Dec 17, 2025 •

edited

Loading