Will onxxruntime backend support INT8 on cpu ? #240

bharadwajymg · 2024-02-20T15:25:57Z

Hi,
we are trying to quantise our onnx models to int8 to run on cpu using : https://onnxruntime.ai/docs/performance/model-optimizations/quantization.html#quantization-on-gpu

we are using dynamic quantisation , and banking on AVX2 and AVX512 extensions , when we tested our models using onnx runtime we see an improvement so cross checking if this backend supports them by directly defining backend in config.pbtxt ?

Jackiexiao · 2024-03-08T07:41:54Z

yes, onxxruntime backend support INT8 on cpu

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Will onxxruntime backend support INT8 on cpu ? #240

Will onxxruntime backend support INT8 on cpu ? #240

bharadwajymg commented Feb 20, 2024

Jackiexiao commented Mar 8, 2024

Will onxxruntime backend support INT8 on cpu ? #240

Will onxxruntime backend support INT8 on cpu ? #240

Comments

bharadwajymg commented Feb 20, 2024

Jackiexiao commented Mar 8, 2024