-
Notifications
You must be signed in to change notification settings - Fork 55
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add a post-start script to check node and cluster health #89
Conversation
We have created an issue in Pivotal Tracker to manage this. Unfortunately, the Pivotal Tracker project is private so you may be unable to view the contents of the story. The labels on this github issue will be updated when the story is started. |
As the PR currently stands,
I want to review the initial scenario that lead to this specific issue before I make specific proposals on making this better. cc @michaelklishin for visibility, no action required |
Hi @gerhard From how I understand you, there are some valid cases where a node can come up, but the cluster is not fully healthy (means we can not run cluster check in What do you think about running only the node check (and not the cluster check) in So I agree we should figure out why the nodes failed in the first place. Do you have a starting point to investigate this?
|
I will have more time for this on Monday, some relevant context for that conversation:
|
|
Thank you Michael. If I recall correctly, the Erlang distribution has to be running for the Mnesia to be able to synchronise, so it's a safe |
I agree that it is safe but |
Hey @MatthiasWinzeler , Thanks for the PR, apologies for the late reply. At the moment, For that guarantee, we need wait that mnesia has successfully synchronised and then execute the Coming back to your PR, as far as I understand, adding a The interesting part is to understand why, in your case, the node is failing, from the logs you posted nothing seems to be wrong. You think you could provide other logs about this issue? cc @gerhard |
@nodo Thanks, sounds reasonable to me. Which additional logs would be of interest for investigation? |
@MatthiasWinzeler - it would be interesting to check the Also, are you OK to close the PR for now and continue the discussion on the issue you opened here ? |
|
fix for #87