Add ModelOpt support #513

sam-india-007 · 2024-05-24T05:23:47Z

Change in builder utility to enable an option to use NVIDIA's TensorRT Model Optimizer https://github.com/NVIDIA/TensorRT-Model-Optimizer

sam-india-007 · 2024-05-24T05:35:11Z

@microsoft-github-policy-service agree company="NVIDIA"

src/python/py/models/builder.py

kunal-vaishnavi · 2024-05-29T01:06:18Z

Thank you for your contribution!

How long does the int4_awq_clip quantization take with modelopt? The goal of the model builder is to quickly produce and save the final ONNX model to disk within a few minutes max. If int4_awq_clip takes a while, a note will need to be added somewhere so users are aware in advance that the model builder will take time to run with int4_awq_clip quantization enabled.

…enai

sam-india-007 · 2024-05-30T05:44:14Z

ModelOpt quantization with int4_awq_clip can take significant amount of time, but results in smaller and better quantization. Added a disclaimer in the README to this end

src/python/py/models/README.md

src/python/py/models/builder.py

Samriddha Sinha added 2 commits May 17, 2024 16:39

modified: src/python/py/models/builder.py

62fb167

Add ModelOpt support

2b968e3

kunal-vaishnavi reviewed May 29, 2024

View reviewed changes

src/python/py/models/builder.py Outdated Show resolved Hide resolved

kunal-vaishnavi reviewed May 29, 2024

View reviewed changes

src/python/py/models/builder.py Show resolved Hide resolved

kunal-vaishnavi reviewed May 29, 2024

View reviewed changes

src/python/py/models/builder.py Outdated Show resolved Hide resolved

kunal-vaishnavi reviewed May 29, 2024

View reviewed changes

src/python/py/models/builder.py Outdated Show resolved Hide resolved

sam-india-007 added 3 commits May 30, 2024 11:05

Adding suggested changes

4526a43

Merge branch 'main' of https://github.com/sam-india-007/onnxruntime-g…

fe208c6

…enai

Update builder.py

cfb37d4

kunal-vaishnavi reviewed May 31, 2024

View reviewed changes

src/python/py/models/README.md Show resolved Hide resolved

jambayk reviewed Jun 3, 2024

View reviewed changes

src/python/py/models/builder.py Show resolved Hide resolved

riyadshairi979 reviewed Jun 17, 2024

View reviewed changes

src/python/py/models/builder.py Outdated Show resolved Hide resolved

Update builder.py, remove unverified info

55dce33

kunal-vaishnavi mentioned this pull request Oct 15, 2024

Integration nvidia modelopt quantization in GenAI builder #984

Closed

kunal-vaishnavi closed this Oct 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ModelOpt support #513

Add ModelOpt support #513

sam-india-007 commented May 24, 2024

sam-india-007 commented May 24, 2024

kunal-vaishnavi commented May 29, 2024

sam-india-007 commented May 30, 2024

Add ModelOpt support #513

Add ModelOpt support #513

Conversation

sam-india-007 commented May 24, 2024

sam-india-007 commented May 24, 2024

kunal-vaishnavi commented May 29, 2024

sam-india-007 commented May 30, 2024