Automated fault tolerance test #297
sonallux
started this conversation in
Show and tell
Replies: 1 comment
-
Really nice work! If GitHub is running their GitHub Action runners inside e.g. kubernetes, and they use containers to create the jobs, and you use containers inside such a job to run services, how many levels deep are we and when do we reach the limbo? 😄 |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hey everyone,
to support and track the progress of my thesis about fault tolerance in microservice-based architectures I have written some automated fault tolerance tests for the ODS. They are running on GitHub actions, so this status badge will always reflect the latest status of my fault tolerance tests:
Current test cases
Restart test
Tests that every service can be restarted, without causing any problems. After starting the ODS it performs some requests to add some state. Then one service at a time is stopped and restarted. After the service is running again checks are performed to validate that no data is lost and the ODS is still completely functional. Because the services are currently not replicated, the test ensures that no traffic is performed between stopping and the successful restart of service. Persistent storage is also not erased and therefore no state inconsistencies can arise.
Following bugs have been detected by this test:
Scheduler initialization test
This test is for issue #284
Beta Was this translation helpful? Give feedback.
All reactions