Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enable rasdaemon on all bare metal Linux hosts #523

Merged
merged 1 commit into from
Dec 23, 2024
Merged

Conversation

mweinelt
Copy link
Member

Part of #518

[root@haumea:~]# ras-mc-ctl --errors
No Memory errors.

No PCIe AER errors.

No ARM processor errors.

No Extlog errors.

No devlink errors.

No disk errors.

No Memory failure errors.

MCE events:
1 2024-12-19 03:52:38 +0000 error: Corrected error, no action required., CPU 2, bank Unified Memory Controller (bank=17), mcg mcgstatus=0, mci CECC, memory_channel=0,csrow=1, mcgcap=0x0000011c, status=0x9c2040000000011b, addr=0x319deb440, misc=0xd01b0fff01000000, walltime=0x67639886, cpuid=0x00870f10, bank=0x00000011
2 2024-12-20 09:33:01 +0000 error: Corrected error, no action required., CPU 2, bank Unified Memory Controller (bank=17), mcg mcgstatus=0, mci CECC, memory_channel=0,csrow=1, mcgcap=0x0000011c, status=0x9c2040000000011b, addr=0x319deb440, misc=0xd01b0fff01000000, walltime=0x676539ce, cpuid=0x00870f10, bank=0x00000011
3 2024-12-20 10:00:20 +0000 error: Corrected error, no action required., CPU 2, bank Unified Memory Controller (bank=17), mcg mcgstatus=0, mci CECC, memory_channel=0,csrow=1, mcgcap=0x0000011c, status=0x9c2040000000011b, addr=0x319deb440, misc=0xd01b0fff01000000, walltime=0x67654034, cpuid=0x00870f10, bank=0x00000011
4 2024-12-20 20:44:46 +0000 error: Corrected error, no action required., CPU 2, bank Unified Memory Controller (bank=17), mcg mcgstatus=0, mci CECC, memory_channel=0,csrow=1, mcgcap=0x0000011c, status=0x9c2040000000011b, addr=0x319deb440, misc=0xd01b0fff01000000, walltime=0x6765d73e, cpuid=0x00870f10, bank=0x00000011
5 2024-12-21 14:40:39 +0000 error: Corrected error, no action required., CPU 2, bank Unified Memory Controller (bank=17), mcg mcgstatus=0, mci CECC, memory_channel=0,csrow=1, mcgcap=0x0000011c, status=0x9c2040000000011b, addr=0x319deb440, misc=0xd01b0fff01000000, walltime=0x6766d367, cpuid=0x00870f10, bank=0x00000011
6 2024-12-21 16:18:57 +0000 error: Corrected error, no action required., CPU 2, bank Unified Memory Controller (bank=17), mcg mcgstatus=0, mci CECC, memory_channel=0,csrow=1, mcgcap=0x0000011c, status=0x9c2041000000011b, addr=0x319deb440, misc=0xd01b0fff01000000, walltime=0x6766ea71, cpuid=0x00870f10, bank=0x00000011
7 2024-12-22 00:03:10 +0000 error: Corrected error, no action required., CPU 2, bank Unified Memory Controller (bank=17), mcg mcgstatus=0, mci CECC, memory_channel=0,csrow=1, mcgcap=0x0000011c, status=0x9c2040000000011b, addr=0x319deb440, misc=0xd01b0fff01000000, walltime=0x6767573e, cpuid=0x00870f10, bank=0x00000011
8 2024-12-22 02:30:37 +0000 error: Corrected error, no action required., CPU 2, bank Unified Memory Controller (bank=17), mcg mcgstatus=0, mci CECC, memory_channel=0,csrow=1, mcgcap=0x0000011c, status=0x9c2040000000011b, addr=0x319deb440, misc=0xd01b0fff01000000, walltime=0x676779ce, cpuid=0x00870f10, bank=0x00000011

@mweinelt mweinelt requested a review from a team as a code owner December 23, 2024 03:24
@mweinelt mweinelt enabled auto-merge December 23, 2024 03:47
@mweinelt mweinelt disabled auto-merge December 23, 2024 03:48
@mweinelt mweinelt merged commit 2347389 into master Dec 23, 2024
3 checks passed
@mweinelt mweinelt deleted the rasdaemon branch December 23, 2024 03:48
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant