Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

VIC Management randomly failed to start. It won't recover. #2578

Open
vitaprimo opened this issue Aug 24, 2020 · 1 comment
Open

VIC Management randomly failed to start. It won't recover. #2578

vitaprimo opened this issue Aug 24, 2020 · 1 comment

Comments

@vitaprimo
Copy link

vitaprimo commented Aug 24, 2020

Summary

After authentication, vSphere Integrated Containers Management fails to load with error {"message":"Service not found: https://127.0.0.1:8282/","statusCode":404,"documentKind":"com:vmware:xenon:common:ServiceErrorResponse","errorCode":-2147483648}.

Restarting individual services fails, systemctl reset-failed appears to work, systemctl is-system-running will show running but it's really not.

Details

Doing systemctl without arguments showed systemd-modules-load.serviceto be failing, then journalctl showed:

-- Unit systemd-modules-load.service has begun starting up.
Aug 24 19:15:33 blablah.tld kernel: rdrand_rng: Neither RDSEED nor RDRAND is available.
Aug 24 19:15:33 blablah.tld systemd-modules-load[1003399]: Failed to insert 'rdrand_rng': No such device
Aug 24 19:15:33 blablah.tld systemd[1]: systemd-modules-load.service: Main process exited, code=exited, status=1/FAILURE
Aug 24 19:15:33 blablah.tld systemd[1]: Failed to start Load Kernel Modules.

harbor.service, armiral.service, fileserver.service and vic-machine-server.service are all active though only fileserver.service doesn't have red in systemctl status <service>.service. The plugin in vSphere appears to be working except that it won't deploy VCHs, doing it from a random computer still works and the Appliance's config help on port 9443 still is showing up. So at least Re-Initializing the appliance is probably an option. I'm not a programmer so that's my best bet I guess.

Appliance is running on vSphere 7 which has been a hassle deploying as setting any of the options in the OVA will prevent the appliance from starting. This was fixed by manually adding the vApp Options in the VM's Configure tab. Before vSphere 7 there were no issues deploying the OVA.

I attached a few screenshots.

See Also

Screen_Shot_2020-08-24_at_13_52_05
Screen Shot 2020-08-24 at 13 42 22
Screen_Shot_2020-08-24_at_13_41_28

@vitaprimo
Copy link
Author

Again, not a programmer here, but RDSEED and RDRAND sound like CPU related, I remember the last one from VPNs, so I killed the appliance and changed CPU/MMU Virtualization from Automatic to Hardware CPU and MMU booted the thing up, rushed to the portal and I got the same JSON with something about not yet ready. Eventually it redirected me to SSO and after authenticating it worked again.

Hopefully it doesn't freaks out later and I shorts circuit again. In the console I shortly saw the RDSEED and RDRAND not being present message again but it was gone from the teal screen this time around.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant