Some ONNX model takes over 50mins to load #23644
Unanswered
lorenzo-park
asked this question in
Performance Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
What would be the cause?
I exported a model to ONNX format. However, its loading time is extremely slow, like over 40mins to load it. The model file size is about 1.16GB. I tried the other model file(This file is >2GB, so it contains weight files as well) but the loading time is less than 1min.
As you see in the log, the loading takes almost 50mins from the moment
Use DeviceBasedPartition as default
is logged to the momentSaving initialized tensors.
is logged.I am trying to dig what is happening here, but no clues. Here is the model file in google drive https://drive.google.com/file/d/1C3lF5IkL3DDvAh7-Ktg6i8R32pzS3DiL/view?usp=sharing
Versions
onnx==1.17.0
onnxruntime==1.20.1
onnxruntime-gpu==1.20.1
Beta Was this translation helpful? Give feedback.
All reactions