You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Sep 17, 2024. It is now read-only.
As discussed in Agent team meeting, and in this issue: elastic/beats#27299
we want to implement a 'soak' test, where we leave Agent running for a number of hours and where we do some repeat manipulations and then repeatedly check various status points.
I'll offer the following explicit details that come to mind off the top:
a separate run would be nice, as compared to the regular nightly runs because these will be in play for 8 hours or 16 or 24 (or even more maybe?). Let's start assuming we want to make it configurable based in hours.
probably a separate feature file would be easiest.
the big question is how is it easiest to create a loop of the desired noted tasks, all from within the same run
Agent can be spun up with the available building blocks... like below:
1 install FS
2 install Agent
3 set policy to desired
4 add or remove an Integration
5 check health of the processes
6 check health status in API
7 check data is still being ingested after the current time-frame
8 check memory usage stats + cpu usage stats and record them
repeat from step 4 until the time delay is completed for the test, looping from step 4 every 30 seconds, or as possible when last iteration finishes (can we guarantee new data will have been added in the last 30 seconds? maybe not...)
it could work for more supported OSes/environments but may be easier to target just one to start.
If we can modify the environment over time, we could reduce the specs of the Agent host to see when it may start to fail, which can give us minimum requirements recommendations, maybe?
As discussed in Agent team meeting, and in this issue: elastic/beats#27299
we want to implement a 'soak' test, where we leave Agent running for a number of hours and where we do some repeat manipulations and then repeatedly check various status points.
I'll offer the following explicit details that come to mind off the top:
it could work for more supported OSes/environments but may be easier to target just one to start.
If we can modify the environment over time, we could reduce the specs of the Agent host to see when it may start to fail, which can give us minimum requirements recommendations, maybe?