How to deploy multiple model using nvidia dynamo graph of one architecture #3703

Nikhil-sarvam · 2025-10-17T17:44:02Z

Nikhil-sarvam
Oct 17, 2025

For example, I’m using a vLLM backend with the agg_router.yaml deployment, which follows a specific architecture. As far as I know, the router is global in this setup. Now, I need to deploy another model with the same architecture on the same cluster. How can I create a separate deployment and routing mechanism without causing requests to be misrouted to the other model?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

How to deploy multiple model using nvidia dynamo graph of one architecture #3703

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Uh oh!

How to deploy multiple model using nvidia dynamo graph of one architecture #3703

Uh oh!

Nikhil-sarvam Oct 17, 2025

Replies: 0 comments

Nikhil-sarvam
Oct 17, 2025