Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add basic example of NIM with Run.ai inference #81

Open
wants to merge 7 commits into
base: main
Choose a base branch
from

Conversation

mlsorensen
Copy link

No description provided.

@mlsorensen
Copy link
Author

Sorry about the tweaks, this should be ready for review.

run.ai/README.md Outdated Show resolved Hide resolved
run.ai/README.md Outdated Show resolved Hide resolved
run.ai/README.md Outdated Show resolved Hide resolved
run.ai/README.md Outdated Show resolved Hide resolved
run.ai/README.md Outdated Show resolved Hide resolved
run.ai/README.md Outdated Show resolved Hide resolved
Copy link

@resker resker left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Apologies... probably should have trickled these in...

@resker resker mentioned this pull request Aug 30, 2024
Copy link

@resker resker left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1 non-binding

README.md Outdated
@@ -14,7 +14,7 @@ This repo showcases different ways NVIDIA NIMs can be deployed. This repo contai
| | **Open Source Platforms** | |
| | | [KServe](https://github.com/NVIDIA/nim-deploy/tree/main/kserve) | |
| | **Independent Software Vendors** | |
| | | Run.ai (coming soon) | |
| | | [Run.ai](./run.ai/README.md) | |
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

FYI, I've made a lot of changes to this README in a prior draft PR. I'm abandoning that for favor of yours in general... will wait to amend this top level README until yours merges.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just FYI, I had been hard linking these URLs to better support the current copy/paste workflow for posting updates to NGC. This really only applies to the KServe README.

4. General NIM requirements: [NIM Prerequisites](https://docs.nvidia.com/nim/large-language-models/latest/getting-started.html#prerequisites)
5. An NVIDIA AI Enterprise (NVAIE) License: [Sign up for NVAIE license](https://build.nvidia.com/meta/llama-3-8b-instruct?snippet_tab=Docker&signin=true&integrate_nim=true&self_hosted_api=true) or [Request a Free 90-Day NVAIE License](https://enterpriseproductregistration.nvidia.com/?LicType=EVAL&ProductFamily=NVAIEnterprise) through the NVIDIA Developer Program.
6. An NVIDIA NGC API Key: please follow the guidance in the [NVIDIA NIM Getting Started](https://docs.nvidia.com/nim/large-language-models/latest/getting-started.html#option-2-from-ngc) documentation to generate a properly scoped API key if you haven't already.

Copy link

@resker resker Aug 30, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Prerequisites

Required: ✔️
Provided: ✅

Run.ai SaaS Air-gapped NVIDIA DGX Cloud
A conformant Kubernetes cluster (Run.ai K8s version requirements) ✔️ ✔️
Run.ai Control Plane and cluster(s) installed and operational ✔️ ✔️
Knative Serving installed and configured for the Run.ai scheduler ✔️ ✔️
NVIDIA GPU Operator installed ✔️ ✔️
General NIM requirements: [NIM Prerequisites](https://docs.nvidia.com/nim/large-language-models/latest/getting-started.html#prerequisites ✔️ ✔️ ✔️
An NVIDIA AI Enterprise (NVAIE) License: Sign up for NVAIE license or Request a Free 90-Day NVAIE License through the NVIDIA ✔️ ✔️ ✔️
An NVIDIA NGC API Key: please follow the guidance in the NVIDIA NIM Getting Started documentation to generate a properly scoped API key if you haven't already. ✔️ ✔️ ✔️

Required: ✔️
Provided: ✅

Copy link

@resker resker left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Adding knative serving to prerequisites

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants