Argo for model serving #1127

sachinruk · 2024-07-13T14:03:54Z

sachinruk
Jul 13, 2024

First of all, thank you to the creaters and maintainers of Hera. This has been such a godsend.

Was wondering if its possible to use argo for model serving as opposed to training/ batch jobs. It is possible to deploy a k8s app that will host a dockerised fastai endpoint (that can autoscale according to requests).

I'm hoping that given the underlying k8s architecture there is a way to make a persistent (say fastapi) endpoint with Hera. If so how would I do that?

TIA.

Answered by agilgur5

Jul 15, 2024

For reference, this was x-posted to #argo-workflows Slack, where I responded and said that while Argo can quite easily fit a batch serving model, for API-driven real-time serving, KServe/Seldon/etc are a better fit and I have used them respectively for batch vs real-time inference.

You can also use Workflows to create Deployments or InferenceServices (i.e. your MLOps pipelines), but CD may suffice for that too.

In short, there are purpose built tool stacks for each of these things, although you can certainly mix some parts together.

~~Also this sounds like it should've been a Discussion rather than an issue.~~ EDIT: This has now been converted into a Discussion

View full answer

agilgur5 · 2024-07-15T01:08:55Z

agilgur5
Jul 15, 2024

For reference, this was x-posted to #argo-workflows Slack, where I responded and said that while Argo can quite easily fit a batch serving model, for API-driven real-time serving, KServe/Seldon/etc are a better fit and I have used them respectively for batch vs real-time inference.

You can also use Workflows to create Deployments or InferenceServices (i.e. your MLOps pipelines), but CD may suffice for that too.

In short, there are purpose built tool stacks for each of these things, although you can certainly mix some parts together.

~~Also this sounds like it should've been a Discussion rather than an issue.~~ EDIT: This has now been converted into a Discussion

1 reply

elliotgunton Jul 16, 2024
Maintainer

Thanks @agilgur5! +1 on KServe, and using Workflows to update deployments worked for us in the past.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Argo for model serving #1127

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment 1 reply

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

Argo for model serving #1127

sachinruk Jul 13, 2024

Replies: 1 comment · 1 reply

agilgur5 Jul 15, 2024

elliotgunton Jul 16, 2024 Maintainer

sachinruk
Jul 13, 2024

Replies: 1 comment 1 reply

agilgur5
Jul 15, 2024

elliotgunton Jul 16, 2024
Maintainer