Missing optimization of DequantizeLinear ∘ Flatten ∘ QuantizeLinear? #21167

mcollinswisc · 2024-06-25T17:28:16Z

mcollinswisc
Jun 25, 2024

It looks like ONNXRuntime will optimize DequantizeLinear ∘ Reshape ∘ QuantizeLinear to only the Reshape, eliminating the quantization/de-quantization, if the scales & zero points are the same.

However, an equivalent Flatten is not optimized. Is this likely to be just a missing optimization, or is there some reason the qdq would be preserved in this case?

Tested out in:
https://gist.github.com/mcollinswisc/d1cd9d13b4e5fbad01c75dca5c9ca576
with ONNXRuntime 1.18.0

mcollinswisc · 2024-06-25T18:33:11Z

mcollinswisc
Jun 25, 2024
Author

Perhaps a matter of including Flatten here:

onnxruntime/onnxruntime/core/optimizer/qdq_transformer/selectors_actions/qdq_selector_action_transformer.cc

Line 62 in 4743803

    
           std::unique_ptr<NodeSelector> selector = std::make_unique<QDQ::DropQDQNodesSelector>(true);

3 replies

mcollinswisc Jun 25, 2024
Author

Other operators that may make sense to include in that list:

Expand
Tile
Slice
~~Split~~ [Uncertain if this selector can handle multiple outputs]
GatherElements
~~ScatterElements~~ [Multiple inputs]
DepthToSpace
SpaceToDepth

The selector that excludes 16 bit also optimizes around MaxPool:

onnxruntime/onnxruntime/core/optimizer/qdq_transformer/selectors_actions/qdq_selector_action_transformer.cc

Line 57 in 4743803

{{"MaxPool", {12}},

This perhaps assumes that the quantization scale is strictly positive? (Or maybe this is explicitly checked somewhere...) By the same assumption, I'd guess the following can also be optimized:

mcollinswisc Jun 26, 2024
Author

Addressing Q on whether the quantization scale is positive with #21182

mcollinswisc Jul 8, 2024
Author

DepthToSpace and SpaceToDepth can't be included because there's no integer implementations:
#21287

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Missing optimization of DequantizeLinear ∘ Flatten ∘ QuantizeLinear? #21167

{{title}}

Replies: 1 comment 3 replies

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

Select a reply

Missing optimization of DequantizeLinear ∘ Flatten ∘ QuantizeLinear? #21167

mcollinswisc Jun 25, 2024

Replies: 1 comment · 3 replies

mcollinswisc Jun 25, 2024 Author

mcollinswisc Jun 25, 2024 Author

mcollinswisc Jun 26, 2024 Author

mcollinswisc Jul 8, 2024 Author

mcollinswisc
Jun 25, 2024

Replies: 1 comment 3 replies

mcollinswisc
Jun 25, 2024
Author

mcollinswisc Jun 25, 2024
Author

mcollinswisc Jun 26, 2024
Author

mcollinswisc Jul 8, 2024
Author