Reduce the delete delay with an exponential wait #4512

abel-von · 2024-11-05T06:51:34Z

Currently in the killContainer, after send SigKill, we have to wait at least 100ms and then check if the process exit. it is a little bit long, especially for those pod with many containers in, when crictl rmp -f the pod, the delay may even exceed 1s.

Here we changed it to an exponential wait, that the max wait time will be 2 secs(2^11), so the total wait time is approximatly 4s, which I think is in the same level of current 10s(100ms * 100).

currently in the `killContainer`, after send SigKill, we have to wait at least 100ms and then check if the process exit. it is a little bit long, especially for those pod with many containers in, when `crictl rmp -f` the pod, the delay may even exceed 1s. Signed-off-by: Abel Feng <[email protected]>

cyphar · 2024-11-05T14:04:42Z

If the process is going to take 1s to die, then speeding up the rate of sending signals won't make a difference (processes can't ignore SIGKILL, so it's all down to how long it takes the kernel to kill everything).

Checking whether the process is dead at a constant rate seems like it'll be more consistent on average if the death time is randomly distributed from 0 to a few seconds. I could see an argument that we should check a bit more often at the very start (I expect that process deaths happen more quickly soon after the signal is sent, with a long-ish right tail), but I don't think switching entirely to exponential backoff is the correct approach.

kolyshkin · 2024-11-06T20:29:15Z

Ideally, for new kernels we should poll on pidfd (kernel marks pidfd readable when the process is terminated).

abel-von · 2024-11-11T06:54:07Z

Sorry, I think maybe I made an unclear description, It is actually because the container could actually exit very quickly (maybe in serveral milliseconds), but we runc has to wait at least 100ms to make sure it is exited. @cyphar

abel-von · 2024-11-11T07:03:57Z

Ideally, for new kernels we should poll on pidfd (kernel marks pidfd readable when the process is terminated).

Agree, but for old kernels, the basic time waiting seems too long. every container should wait at least 100ms to kill, no matter how quick it is exited actually. @kolyshkin

lifubang · 2024-11-12T01:29:12Z

Sorry, I think maybe I made an unclear description, It is actually because the container could actually exit very quickly (maybe in serveral milliseconds), but we runc has to wait at least 100ms to make sure it is exited. @cyphar

@abel-von Could you please test whether #4517 could meet your requirement or not in a low load machine? If not, I think If we set the first try after 10ms, it's enough.
I think in a heavy load machine, we should not try to send signals to the process in a high frequency.

lifubang mentioned this pull request Nov 9, 2024

Try to use pidfd and epoll to wait init process exit #4517

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce the delete delay with an exponential wait #4512

Reduce the delete delay with an exponential wait #4512

abel-von commented Nov 5, 2024

cyphar commented Nov 5, 2024

kolyshkin commented Nov 6, 2024

abel-von commented Nov 11, 2024

abel-von commented Nov 11, 2024

lifubang commented Nov 12, 2024

Reduce the delete delay with an exponential wait #4512

Are you sure you want to change the base?

Reduce the delete delay with an exponential wait #4512

Conversation

abel-von commented Nov 5, 2024

cyphar commented Nov 5, 2024

kolyshkin commented Nov 6, 2024

abel-von commented Nov 11, 2024

abel-von commented Nov 11, 2024

lifubang commented Nov 12, 2024