Skip to content

Latest commit

 

History

History
46 lines (26 loc) · 1 KB

File metadata and controls

46 lines (26 loc) · 1 KB

Worker Node Failure

  • Take me to the Lecture

  • Lets check the status of the Nodes in the cluster, are they Ready or NotReady

    kubectl get nodes
    
  • If they are NotReady then check the LastHeartbeatTime of the node to find out the time when node might have crashed

    kubectl describe node worker-1
    

    wrk

  • Check the possible CPU and MEMORY using top and df -h

    mem

  • Check the status and the logs of the kubelet for the possible issues.

    serivce kubelet status
    
    sudo journalctl -u kubelet
    

    kublet

  • Check the kubelet Certificates, they are not expired, and in the right group and issued by the right CA.

    openssl x509 -in /var/lib/kubelet/worker-1.crt -text
    

    cert