Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature: Throttling job creation #161

Open
1337andre opened this issue Apr 13, 2022 · 6 comments
Open

Feature: Throttling job creation #161

1337andre opened this issue Apr 13, 2022 · 6 comments
Assignees
Labels
done Code pushed to develop branch feature New feature
Milestone

Comments

@1337andre
Copy link

We have a medium cluster with 20 nodes. When we try to cache e.g. CRD with 50 images kube-fledged will span 1000 jobs to pull that images. We can see that API server get in trouble and some workloads e.g. redis will have problems to get HA redis-cluster working. Does anyone have similar problems? Is it possible to throttling job creation?

@bkupidura
Copy link

I observed same behavior in my home lab. When bumping version of all my images (~21 managed by CRD), my kube api starts beeing unresponsible.

It would be nice if we can limit how many images can be downloaded in same time.

@senthilrch
Copy link
Owner

This is a much needed feature. Agreed.

@linuraj
Copy link

linuraj commented Apr 28, 2022

we tried throttling pod counts via ResourceQuota in the kube-fledged namespace. That didn't help either. Appreciate your help!

@senthilrch senthilrch changed the title Feature/Question possible to throttling job creation? Feature: Throttling job creation Nov 11, 2022
@senthilrch senthilrch self-assigned this Nov 11, 2022
@senthilrch senthilrch added this to the v0.11.0 milestone Nov 11, 2022
@senthilrch
Copy link
Owner

I've implented a solution for throttling jobs created by kube-fledged.
This will be delivered in v0.11.0 release

@senthilrch senthilrch added the done Code pushed to develop branch label Mar 5, 2023
@aledeulo
Copy link

Hi, I would like to ask when you've planned to release the v0.11.0 version? Thank you very much.

@thomson131
Copy link

@senthilrch , firstly thank you for your work on this project, looking forward to the next release.

For those using this project, and waiting for the next release to better handle job throttling... a potential workaround is to apply both a pod quota and count/job.batch quota to the namespace. The operator seems to handle the job creation gracefully, waiting for job resources to become available and working through the imageCache list until completion within the resource constraints.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
done Code pushed to develop branch feature New feature
Projects
None yet
Development

No branches or pull requests

6 participants