-
Notifications
You must be signed in to change notification settings - Fork 96
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
health check fails in check_group0_tokenring_consistency
if one of the nodes is down
#9599
Comments
It's also a problem to investigate core dump of node-1 since no logs available.
|
check_group0_tokenring_consistency
if one of the nodes is down
the coredump is a known issue, and is dealt with in SCT code, I would advise to rebase. as for logs missing, seems like:
it's issue being chased at in #9444, and not don't yet know the root cause of it, it's not related to this issue. as the the issue at hand, it's has nothing todo with full utilization of the disks. |
Packages
Scylla version:
2024.3.0~dev-20241218.42cc7a4f12de
with build-id0ee8a26c08783c18bd6dead5ba27a9e622efa885
Kernel Version:
6.8.0-1021-aws
Issue description
Describe your issue in detail and steps it took to produce it.
node-1 gets no-space-left (following a core dump:
The node fails to restart scylla service as well:
The nemesis thread cannot get host id and fails permanently (the test keeps running without nemesis):
The
get_nodetool_info
probably failed to run, in this state, and returned None, so theget()
failed.Impact
Describe the impact this issue causes to the user.
How frequently does it reproduce?
Describe the frequency with how this issue can be reproduced.
Installation details
Cluster size: 3 nodes (i4i.large)
Scylla Nodes used in this run:
OS / Image:
ami-0f14cab4bda57c2b2
(aws: undefined_region)Test:
byo-longevity-test-yg2
Test id:
90bfa08f-2a3d-4ba9-b443-13ce00925638
Test name:
scylla-staging/yarongilor/byo-longevity-test-yg2
Test method:
longevity_test.LongevityTest.test_custom_time
Test config file(s):
Logs and commands
$ hydra investigate show-monitor 90bfa08f-2a3d-4ba9-b443-13ce00925638
$ hydra investigate show-logs 90bfa08f-2a3d-4ba9-b443-13ce00925638
Logs:
Jenkins job URL
Argus
The text was updated successfully, but these errors were encountered: