model concurrency between different providers #11527

victorehailo · 2022-05-15T07:58:35Z

victorehailo
May 15, 2022

hello,

I am trying to understand the way the ONNX executes a model graph that is composed of multiple provider nodes.

from a performance standpoint, I would expect the runtime to break the supplied batch and execute nodes that belong to different providers in parallel so the backend accelerators can be fully utilized.
trying to understand this I saw only the option to execute the session itself multiple times. in this way, the runtime can't guarantee the parallel use of the backend accelerators. moreover, the runtime can create congestion on a single EP in case the session executions are synced

I want to understand if this is the case?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

model concurrency between different providers #11527

{{title}}

Replies: 0 comments

Select a reply

model concurrency between different providers #11527

victorehailo May 15, 2022

Replies: 0 comments

victorehailo
May 15, 2022