amazon/bottlerocket-aws-k8s-1.25-nvidia-x86_64 ami in eks cluster #4155
-
Environment: AWS Region: eu-west-3 This is my EKS managed node group:
This is my worker configuration;
The container inside my pod, which needs to run a workload on the GPU, uses a custom image built on nvidia/cuda:12.3.1-runtime-ubuntu20.04. I have confirmed that my pod is scheduled on the correct node group. However, my pod is unable to access the GPU. I am wondering if the AMI I’m using includes the nvidia-container-toolkit package. If not, how is the GPU exposed to the container inside the pod? |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments
-
Using a Bottlerocker vended Do you mind sharing your Pod Configuration file which you use to deploy your |
Beta Was this translation helpful? Give feedback.
-
Thanks for your reply. I managed to find the issue; it was related to using pip3 to install modules on my base image. Apparently, that caused the symbolic links to break. I added a script to reestablish them, and everything works perfectly now. |
Beta Was this translation helpful? Give feedback.
Thanks for your reply. I managed to find the issue; it was related to using pip3 to install modules on my base image. Apparently, that caused the symbolic links to break. I added a script to reestablish them, and everything works perfectly now.