[问题] Device Plugin allocate 选出的 pod 是否会跟 Kubelet 绑定的不一致 #16

orainxiong · 2019-09-24T10:40:53Z

看 Kubelet 调用 allocate 的实现

		resp, err := eI.e.allocate(devs)
                 ....
		m.podDevices.insert(podUID, contName, resource, allocDevices, resp.ContainerResponses[0])

deviceplug.allocate 会

会列出该节点中所有状态为 Pending 并且ALIYUN_COM_GPU_MEM_ASSIGNED为false的 GPU Share Pod
选择出其中 Pod Annotation 的ALIYUN_COM_GPU_MEM_POD的数量与 Allocate 申请数量一致的 Pod。如果有多个符合这种条件的 Pod，就会选择其中ALIYUN_COM_GPU_MEM_ASSUME_TIME最早的 Pod。
将该 Pod 的 annotation ALIYUN_COM_GPU_MEM_ASSIGNED设置为true，并且将 Pod annotation 中的 GPU 信息转化为环境变量返回给 Kubelet 用以真正的创建 Pod。

但是在 Kubelet 调用deviceplug.allocate时已经确定了podUID. 两者是否会不同?

The text was updated successfully, but these errors were encountered:

cheyang · 2019-10-08T05:55:21Z

这里依赖是Pod在调度器是按顺序bind的，而且在bind过程中已经加了锁。是能够保证顺序性的。

payall4u · 2021-07-27T09:30:02Z

这里依赖是Pod在调度器是按顺序bind的，而且在bind过程中已经加了锁。是能够保证顺序性的。

@cheyang
bind是有顺序的，但kubelet不一定会按照bind的顺序创建pod。

xiaoxubeii · 2021-08-05T02:59:30Z

看 Kubelet 调用 allocate 的实现
		resp, err := eI.e.allocate(devs)
                 ....
		m.podDevices.insert(podUID, contName, resource, allocDevices, resp.ContainerResponses[0])
deviceplug.allocate 会

会列出该节点中所有状态为 Pending 并且ALIYUN_COM_GPU_MEM_ASSIGNED为false的 GPU Share Pod

选择出其中 Pod Annotation 的ALIYUN_COM_GPU_MEM_POD的数量与 Allocate 申请数量一致的 Pod。如果有多个符合这种条件的 Pod，就会选择其中ALIYUN_COM_GPU_MEM_ASSUME_TIME最早的 Pod。

将该 Pod 的 annotation ALIYUN_COM_GPU_MEM_ASSIGNED设置为true，并且将 Pod annotation 中的 GPU 信息转化为环境变量返回给 Kubelet 用以真正的创建 Pod。

但是在 Kubelet 调用deviceplug.allocate时已经确定了podUID. 两者是否会不同?

是的，这里实现可能会造成不一致。kubelet device plugin allocate 是按照 container 调用，但在 gpushare-device-plugin 是按照自有逻辑找到 candidate pod，不一定是 kubelet 调用的那个 pod。并且如果单 pod 下有多个 container 申请了 gpu 资源，这里肯定匹配不到。

gpushare-device-plugin/pkg/gpu/nvidia/allocate.go

Lines 54 to 88 in 5b68fe2

    
           for _, req := range reqs.ContainerRequests { 
        
           	podReqGPU += uint(len(req.DevicesIDs)) 
        
           } 
        
           log.Infof("RequestPodGPUs: %d", podReqGPU) 
        
           m.Lock() 
        
           defer m.Unlock() 
        
           log.Infoln("checking...") 
        
           pods, err := getCandidatePods(m.queryKubelet, m.kubeletClient) 
        
           if err != nil { 
        
           	log.Infof("invalid allocation requst: Failed to find candidate pods due to %v", err) 
        
           	return buildErrResponse(reqs, podReqGPU), nil 
        
           } 
        
           if log.V(4) { 
        
           	for _, pod := range pods { 
        
           		log.Infof("Pod %s in ns %s request GPU Memory %d with timestamp %v", 
        
           			pod.Name, 
        
           			pod.Namespace, 
        
           			getGPUMemoryFromPodResource(pod), 
        
           			getAssumeTimeFromPodAnnotation(pod)) 
        
           	} 
        
           } 
        
           for _, pod := range pods { 
        
           	if getGPUMemoryFromPodResource(pod) == podReqGPU { 
        
           		log.Infof("Found Assumed GPU shared Pod %s in ns %s with GPU Memory %d", 
        
           			pod.Name, 
        
           			pod.Namespace, 
        
           			podReqGPU) 
        
           		assumePod = pod 
        
           		found = true 
        
           		break 
        
           	} 
        
           }

gaoyangcaiji · 2022-07-14T12:02:31Z

请问大佬有啥好的解决办法吗？这个不一致的问题

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[问题] Device Plugin allocate 选出的 pod 是否会跟 Kubelet 绑定的不一致 #16

[问题] Device Plugin allocate 选出的 pod 是否会跟 Kubelet 绑定的不一致 #16

orainxiong commented Sep 24, 2019

cheyang commented Oct 8, 2019

payall4u commented Jul 27, 2021

xiaoxubeii commented Aug 5, 2021

gaoyangcaiji commented Jul 14, 2022

[问题] Device Plugin allocate 选出的 pod 是否会跟 Kubelet 绑定的不一致 #16

[问题] Device Plugin allocate 选出的 pod 是否会跟 Kubelet 绑定的不一致 #16

Comments

orainxiong commented Sep 24, 2019

cheyang commented Oct 8, 2019

payall4u commented Jul 27, 2021

xiaoxubeii commented Aug 5, 2021

gaoyangcaiji commented Jul 14, 2022