-
Notifications
You must be signed in to change notification settings - Fork 27
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
deck-chores should trigger rediscovery of containers when task errors out with critical exception container not running #57
Comments
am i understanding correctly that this was (probably?) the chain of events:
please correct me when i got a false impression. also, these questions come to my mind:
|
Thanks for the questions, let me do my best to explain the setup and events Environment
Activities:
Errors appearing container is not running
was the job executed five minutes later? - No |
it's clear that the stop of the containers that serve your api service went unnoticed to i have no clue whether the fact that you deploy in Swarm mode is relevant here. in any case, i will not try to recreate it, as i don't intend to support it. the only thing i can suggest for now is, that you run if we fail to figure out the root cause, an option might be to exit |
Sorry I missed this update. I tried figuring out the root cause but with no luck. i think an option to exit the container when a critical error is encountered may be a good idea. |
We ran into an issue where the deck-chores instance was holding on to a container that was restarted, and it was not able to recover from the failure. I am not sure if docker emitted an event and the container didn't receive it or there was a race condition and the event was lost. Minimally if deck-chores already determines if the container is not running, can it remove the container from the tasks and re-discover them?
The text was updated successfully, but these errors were encountered: