Problem with nvidia extension - Modules not found #9886
-
Hi, I'm following the guide for talos with nvida gpu. Created an iso from factory.talos.dev with Nvidia OSS extension: nvidia-open-gpu-kernel-modules and nvidia-container-toolkit. Did install on a clean disk, then removing the iso and doing multiple reboot. After enabling the kernel-modules, talos cannot find any of the nvidia modules. Error in the logs is: Where does talos put kernel modules (nvidia.ko, nvidia-modeset.ko, nvidia-uvm.ko, nvidia-drm.ko, and nvidia-peermem.ko), so i can check if they actually are present? |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 4 replies
-
When installing from the ISO, make sure you use a proper https://www.talos.dev/v1.8/talos-guides/install/boot-assets/#example-bare-metal-with-image-factory Or follow https://factory.talos.dev/ wizard. |
Beta Was this translation helpful? Give feedback.
-
Thank you for your response. I did use the factory wizard to create the iso. Selecting both the nvidia extensions together with siderolabs/vmtoolsd-guest-agent, and the iso-image ended up around 306MB, compared to default iso-image which is around 108mb. However I still have the same error. I tried to delete/recreate the virtual disk to be 100% sure i didn't have a old installation there. Should I be able to list a folder on the node with the modules to verify? |
Beta Was this translation helpful? Give feedback.
In air-gapped environment you can mirror that image to your internal registry and use it, like any other external image the cluster might need.
ISO doesn't contain the installer image.