-
Notifications
You must be signed in to change notification settings - Fork 12
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix azure sbd stonith-timeout and add concurrent-fencing #290
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There is stil thet typo [sdb]
instreadd of [sbd]
on line 42
2428605
to
3a41d6e
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
3a41d6e
to
fa9a362
Compare
fa9a362
to
b54faed
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
As discovered by @jankohoutek , stonith-timeout was set erroneously for both azure sbd and native fencing cases. This pr attempts to fix that.
This pr also enables concurrent fencing, as suggested in https://learn.microsoft.com/en-us/azure/sap/workloads/high-availability-guide-suse-pacemaker?tabs=msi#create-a-fencing-device-on-the-pacemaker-cluster
Verification Runs:
SBD:
https://openqa.suse.de/tests/15879961 PASS
https://openqa.suse.de/tests/15882210 PASS
https://openqa.suse.de/tests/15882212 PASS
https://openqa.suse.de/tests/15882213 PASS
MSI:
https://openqa.suse.de/tests/15879960 PASS
https://openqa.suse.de/tests/15882208 PASS
https://openqa.suse.de/tests/15882209 FAIL (crash site A)
https://openqa.suse.de/tests/15882211 PASS
SPN:
https://openqa.suse.de/tests/15882224 FAIL (crash site A)
https://openqa.suse.de/tests/15882225 PASS
https://openqa.suse.de/tests/15882226 FAIL (kill site A - primary)
https://openqa.suse.de/tests/15892130 PASS
VRs for the concurrent fencing:
PASS (23):
https://openqa.suse.de/tests/15893608
https://openqa.suse.de/tests/15893612 (failed for irrelevant reason at the end)
https://openqa.suse.de/tests/15893618
https://openqa.suse.de/tests/15894470
https://openqa.suse.de/tests/15894472
https://openqa.suse.de/tests/15894494
https://openqa.suse.de/tests/15894495
https://openqa.suse.de/tests/15894496
https://openqa.suse.de/tests/15894765
https://openqa.suse.de/tests/15894834
https://openqa.suse.de/tests/15894768
https://openqa.suse.de/tests/15894770
https://openqa.suse.de/tests/15894771
https://openqa.suse.de/tests/15894772
https://openqa.suse.de/tests/15894767
https://openqa.suse.de/tests/15895221
https://openqa.suse.de/tests/15895222
https://openqa.suse.de/tests/15895479
https://openqa.suse.de/tests/15895480
https://openqa.suse.de/tests/15895481
https://openqa.suse.de/tests/15895483
https://openqa.suse.de/tests/15898297
https://openqa.suse.de/tests/15898327
https://openqa.suse.de/tests/15898791
https://openqa.suse.de/tests/15898792
https://openqa.suse.de/tests/15898793
FAIL (1) at
crash_replica
due to ssh issues:https://openqa.suse.de/tests/15895479