-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Only delay the restart of fdbserver if the process exited with an exit code other than 0 #11802
base: main
Are you sure you want to change the base?
Conversation
…t code other than 0
Result of foundationdb-pr-clang-ide on Linux CentOS 7
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Result of foundationdb-pr-clang on Linux CentOS 7
|
Result of foundationdb-pr-macos-m1 on macOS Ventura 13.x
|
Result of foundationdb-pr-clang-arm on Linux CentOS 7
|
Result of foundationdb-pr-cluster-tests on Linux CentOS 7
|
Result of foundationdb-pr-macos on macOS Ventura 13.x
|
Result of foundationdb-pr on Linux CentOS 7
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Change looks good. But can you add how you tested this change in the pr description?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Change looks good. But can you add how you tested this change in the pr description?
I haven't tested the changes yet. I'll be doing the tests manually and provide the logs after testing.
Sounds good. Let me know once your test is done, we can merge then. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Tested the code with a small FDB cluster. Here ate the logs from the fdbmonitor:
Time="1732523616.910170" Severity="20" LogGroup="jscheuermann-jdev" Process="fdbserver.1": Process 9 exited 0, restarting in 0 seconds
Time="1732523616.910459" Severity="10" LogGroup="jscheuermann-jdev" Process="fdbserver.1": Launching /usr/bin/fdbserver (207) for fdbserver.1
Time="1732523617.000728" Severity="10" LogGroup="jscheuermann-jdev" Process="fdbserver.1": FDBD joined cluster.
Time="1732523622.951968" Severity="20" LogGroup="jscheuermann-jdev" Process="fdbserver.1": Process 207 exited 0, restarting in 0 seconds
Time="1732523622.952255" Severity="10" LogGroup="jscheuermann-jdev" Process="fdbserver.1": Launching /usr/bin/fdbserver (317) for fdbserver.1
Time="1732523623.043013" Severity="10" LogGroup="jscheuermann-jdev" Process="fdbserver.1": FDBD joined cluster.
Time="1732523630.953343" Severity="20" LogGroup="jscheuermann-jdev" Process="fdbserver.1": Process 317 exited 0, restarting in 0 seconds
Time="1732523630.953650" Severity="10" LogGroup="jscheuermann-jdev" Process="fdbserver.1": Launching /usr/bin/fdbserver (427) for fdbserver.1
Time="1732523631.044001" Severity="10" LogGroup="jscheuermann-jdev" Process="fdbserver.1": FDBD joined cluster.
and for testing I ran the following command in fdbcli:
kill; kill <addresss:port> ; sleep 3; kill <addresss:port> ; sleep 5; kill <addresss:port>
Fixes: #11775
Code-Reviewer Section
The general pull request guidelines can be found here.
Please check each of the following things and check all boxes before accepting a PR.
For Release-Branches
If this PR is made against a release-branch, please also check the following:
release-branch
ormain
if this is the youngest branch)