-
Notifications
You must be signed in to change notification settings - Fork 61
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add basic example of NIM with Run.ai inference #81
base: main
Are you sure you want to change the base?
Conversation
Sorry about the tweaks, this should be ready for review. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Apologies... probably should have trickled these in...
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
+1 non-binding
README.md
Outdated
@@ -14,7 +14,7 @@ This repo showcases different ways NVIDIA NIMs can be deployed. This repo contai | |||
| | **Open Source Platforms** | | | |||
| | | [KServe](https://github.com/NVIDIA/nim-deploy/tree/main/kserve) | | | |||
| | **Independent Software Vendors** | | | |||
| | | Run.ai (coming soon) | | | |||
| | | [Run.ai](./run.ai/README.md) | | |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
FYI, I've made a lot of changes to this README in a prior draft PR. I'm abandoning that for favor of yours in general... will wait to amend this top level README until yours merges.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just FYI, I had been hard linking these URLs to better support the current copy/paste workflow for posting updates to NGC. This really only applies to the KServe README.
4. General NIM requirements: [NIM Prerequisites](https://docs.nvidia.com/nim/large-language-models/latest/getting-started.html#prerequisites) | ||
5. An NVIDIA AI Enterprise (NVAIE) License: [Sign up for NVAIE license](https://build.nvidia.com/meta/llama-3-8b-instruct?snippet_tab=Docker&signin=true&integrate_nim=true&self_hosted_api=true) or [Request a Free 90-Day NVAIE License](https://enterpriseproductregistration.nvidia.com/?LicType=EVAL&ProductFamily=NVAIEnterprise) through the NVIDIA Developer Program. | ||
6. An NVIDIA NGC API Key: please follow the guidance in the [NVIDIA NIM Getting Started](https://docs.nvidia.com/nim/large-language-models/latest/getting-started.html#option-2-from-ngc) documentation to generate a properly scoped API key if you haven't already. | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Prerequisites
Required: ✔️
Provided: ✅
Run.ai SaaS | Air-gapped | NVIDIA DGX Cloud | |
---|---|---|---|
A conformant Kubernetes cluster (Run.ai K8s version requirements) | ✔️ | ✔️ | ✅ |
Run.ai Control Plane and cluster(s) installed and operational | ✔️ | ✔️ | ✅ |
Knative Serving installed and configured for the Run.ai scheduler | ✔️ | ✔️ | ✅ |
NVIDIA GPU Operator installed | ✔️ | ✔️ | ✅ |
General NIM requirements: [NIM Prerequisites](https://docs.nvidia.com/nim/large-language-models/latest/getting-started.html#prerequisites | ✔️ | ✔️ | ✔️ |
An NVIDIA AI Enterprise (NVAIE) License: Sign up for NVAIE license or Request a Free 90-Day NVAIE License through the NVIDIA | ✔️ | ✔️ | ✔️ |
An NVIDIA NGC API Key: please follow the guidance in the NVIDIA NIM Getting Started documentation to generate a properly scoped API key if you haven't already. | ✔️ | ✔️ | ✔️ |
Required: ✔️
Provided: ✅
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Adding knative serving to prerequisites
No description provided.