csi

Feat: InferenceService reconcile function (kubeflow#541 )

Nov 11, 2024

a061879 · Nov 11, 2024

This branch is 6 commits ahead of, 45 commits behind kubeflow/model-registry:main.

Name	Name	Last commit message	Last commit date
parent directory ..
config/samples	config/samples	Add first draft of model registry kserve custom storage initializer (k…	Mar 8, 2024
pkg	pkg	feat(csi): support multiple model registries (kubeflow#508 )	Oct 28, 2024
scripts	scripts	feat(csi): support multiple model registries (kubeflow#508 )	Oct 28, 2024
test	test	feat(csi): support multiple model registries (kubeflow#508 )	Oct 28, 2024
.gitignore	.gitignore	Update CSI (kubeflow#154 )	Jul 25, 2024
Dockerfile	Dockerfile	Update CSI (kubeflow#154 )	Jul 25, 2024
Dockerfile.dev	Dockerfile.dev	Update CSI (kubeflow#154 )	Jul 25, 2024
GET_STARTED.md	GET_STARTED.md	feat(csi): add CSI manifests (kubeflow#491 )	Oct 22, 2024
Makefile	Makefile	csi: drop go from bin (kubeflow#292 )	Aug 22, 2024
README.md	README.md	docs: typo fix (kubeflow#509 )	Oct 28, 2024
go.mod	go.mod	Feat: InferenceService reconcile function (kubeflow#541 )	Nov 11, 2024
go.sum	go.sum	Feat: InferenceService reconcile function (kubeflow#541 )	Nov 11, 2024
main.go	main.go	feat(csi): support multiple model registries (kubeflow#508 )	Oct 28, 2024

README.md

Model Registry Custom Storage Initializer

This is a Model Registry specific implementation of a KServe Custom Storage Initializer (CSI). More details on what Custom Storage Initializer is can be found in the KServe doc.

Implementation

The Model Registry CSI is a simple Go executable that basically takes two positional arguments:

Source URI: identifies the storageUri set in the InferenceService, this must be a model-registry custom URI, i.e., model-registry://...
Destination Path: the location where the model should be stored, e.g., /mnt/models

The core logic of this CSI is pretty simple and it consists of three main steps:

Parse the custom URI in order to extract registered model name and model version
Query the model registry in order to retrieve the original model location (e.g., http, s3, gcs and so on)
Use github.com/kserve/kserve/pkg/agent/storage pkg to actually download the model from well-known protocols.

Workflow

The below sequence diagram should highlight the workflow when this CSI is injected into the KServe pod deployment.

sequenceDiagram
    actor U as User
    participant MR as Model Registry
    participant KC as KServe Controller
    participant MD as Model Deployment (Pod)
    participant MRSI as Model Registry Storage Initializer
    U->>+MR: Register ML Model
    MR-->>-U: Indexed Model
    U->>U: Create InferenceService CR
    Note right of U: The InferenceService should<br/>point to the model registry<br/>indexed model, e.g.,:<br/> model-registry://<model-registry-url>/<model>/<version>
    KC->>KC: React to InferenceService creation
    KC->>+MD: Create Model Deployment
    MD->>+MRSI: Initialization (Download Model)
    MRSI->>MRSI: Parse URI
    MRSI->>+MR: Fetch Model Metadata
    MR-->>-MRSI: Model Metadata
    Note over MR,MRSI: The main information that is fetched is the artifact URI which specifies the real model location, e.g.,: https://.. or s3://...
    MRSI->>MRSI: Download Model
    Note right of MRSI: The storage initializer will use<br/> the KServe default providers<br/> to download the model<br/> based on the artifact URI
    MRSI-->>-MD: Downloaded Model
    MD->>-MD: Deploy Model

Get Started

Please look at Get Started guide for a very simple quickstart that showcases how this custom storage initializer can be used for ML models serving operations.

Development

Build the executable

You can just run:

make build

Note

The project is currently using a fixed tag of the root Model Registry. You can use the local one by simply adding replace github.com/kubeflow/model-registry v0.2.1-alpha => ../ in the go.mod file

Which wil create the executable under bin/mr-storage-initializer.

Run the executable

You can run main.go (without building the executable) by running:

./bin/mr-storage-initializer "model-registry://model-registry-url/model/version" "./"

or directly running the main.go skipping the previous step:

make SOURCE_URI=model-registry://model-registry-url/model/version DEST_PATH=./ run

Note

model-registry-url is optional, if not provided the value of MODEL_REGISTRY_BASE_URL env variable will be used.

Note

A Model Registry service should be up and running at localhost:8080.

Build container image

Using a fixed version of the model-registry library:

make docker-build

Or, using the local model-registry module:

make docker-build-dev

By default the container image name is quay.io/${USER}/model-registry-storage-initializer:latest but it can be overridden providing the IMG env variable, e.g., make IMG=abc/ORG/NAME:TAG docker-build.

Push container image

Issue the following command:

make [IMG=..] docker-push

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

csi

csi

README.md

Model Registry Custom Storage Initializer

Implementation

Workflow

Get Started

Development

Build the executable

Run the executable

Build container image

Push container image

Files

csi

Directory actions

More options

Directory actions

More options

Latest commit

History

csi

Folders and files

parent directory

README.md

Model Registry Custom Storage Initializer

Implementation

Workflow

Get Started

Development

Build the executable

Run the executable

Build container image

Push container image