You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thanks for the report @biarne-a. This may happen if some GPUs in your kubernetes cluster are in a bad state (i.e., they show up under Capacity, but not Allocatable, see NVIDIA/k8s-device-plugin#75). Restarting the Nvidia device plugin on the bad node may fix.
Though I do agree, we should have better logging here instead of failing with this assertion error.
Up to know, the command
sky show-gpus
was working fine. But this morning I got the following error:Version & Commit info:
sky -v
: 1.0.0.dev20250110sky -c
: fd1ac0eThe text was updated successfully, but these errors were encountered: