misc: Align tc_iterate behavior #312

jkoeppeler · 2024-10-28T14:34:29Z

The execution of tc_iterate.sh can cause different behaviors depending on whether the tc_iterate binary is installed. The tc_iterate prog collects tc statistics "count"-times where as the bash loop runs for "length" seconds. This leads to different measuring durations.

This patch replaces the "count" parameter with the "length" parameter for tc_iterate.c CLI. tc_iterate.c then calculates the "count" based on the length to determine the duration.

The execution of tc_iterate.sh can cause different behaviors depending on whether the tc_iterate binary is installed. The tc_iterate prog runs collects tc statistics "count"-times where as the bash loop runs for "length" seconds. This leads to different measuring durations. Signed-off-by: Jonas Köppeler <[email protected]>

Signed-off-by: Jonas Köppeler <[email protected]>

tohojo · 2024-10-29T17:50:32Z

Hmm, so the problem with this is that with this change, flent will no longer work with an old version of tc_iterate, which can be a problem especially when running it on remote hosts. I think this may actually be why tc_iterate itself wasn't updated back when the script was.

Rather than dealing with doing this in a backwards-compatible way, I wonder if it isn't better to just deprecate the use of the tc_iterate binary entirely. I'm not sure the accuracy is actually that much worse for the shell script version. Did you observe a case where this was significant?

jkoeppeler · 2024-10-30T09:40:03Z

I compared the performance between the script and the binary by running both for 10 seconds with an interval of 0.01. This should capture 1000 data points. The bash script only captured 470 and the binary could record all 1000. Looking at the timestamps the binary is quite precisely able to trigger every 0.01 seconds where as the bash script only gets an accuracy of around 0.02.
The tc_iterate binary is also able to capture everything up to an 0.001s interval.

tohojo · 2024-11-01T13:34:54Z

flent/scripts/tc_iterate.sh

@@ -23,7 +21,7 @@ buffer=""


 command_string=$(cat <<EOF
-which tc_iterate >/dev/null && exec tc_iterate $buffer -i $interface -c $count -I $interval -C $command;
+which tc_iterate >/dev/null && exec tc_iterate $buffer -i $interface -l $length -I $interval -C $command;


OK, so one thing that comes to mind as a way of keeping backwards compatibility, is that instead of unconditionally exec'ing when the binary exists, we do a regular call to the binary, and exit the script on success. That way, if it's an old binary without the -l option, we will fall back to the script. The only potential problem with this is that if the binary runs, but still exits non-zero, we'll get duplicate results. This could happen if it explicitly killed; however, it seems that that results in a different exit code, so maybe if we only react to the return value in usage() (255), that would work?

another option could be to check the output of the binary when called with --help (or similar) and basically grep if -l or -c is in the output. This way we could distinguish between the binaries.
Or we just keep -c and calculate the count in the script instead of in the C-code. This would be probably the minimal fix. But either way, depending on your preference I am happy to implement it :)

Oh yeah, that's a good point actually: why is this change needed at all? You're basically just moving the same calculation from python to C?

I thought doing it in C makes it easier if one (like me) wants to directly execute the C-binary. And then it is just more comfortable to pass a length parameter instead of deriving the count from the duration. But if preferred, then I would just add the calculation to the python code (or in the bash script?)
And as far as I see it: The python code is just passing the length as the count parameter which results in different behavior because the tc_iterate binary will capture 10 data points instead of capturing for 10 seconds.

You're executing it manually? Why? :)

And, well, the line this patch removes from the python file already contains the exact same calculation? :D

Well, :D you are correct, sorry I misread the code. Will close this PR then.

jkoeppeler force-pushed the align-tc-iterate branch from 0e6ad4a to ef496af Compare October 28, 2024 14:40

jkoeppeler force-pushed the align-tc-iterate branch from ef496af to 634603c Compare October 28, 2024 14:43

misc: tc_iterate: ensure interval is greater 0

58ac3ce

Signed-off-by: Jonas Köppeler <[email protected]>

tohojo reviewed Nov 1, 2024

View reviewed changes

jkoeppeler closed this Nov 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

misc: Align tc_iterate behavior #312

misc: Align tc_iterate behavior #312

jkoeppeler commented Oct 28, 2024 •

edited

Loading

tohojo commented Oct 29, 2024

jkoeppeler commented Oct 30, 2024

tohojo Nov 1, 2024

jkoeppeler Nov 1, 2024

tohojo Nov 1, 2024

jkoeppeler Nov 1, 2024 •

edited

Loading

tohojo Nov 1, 2024

jkoeppeler Nov 1, 2024

misc: Align tc_iterate behavior #312

misc: Align tc_iterate behavior #312

Conversation

jkoeppeler commented Oct 28, 2024 • edited Loading

tohojo commented Oct 29, 2024

jkoeppeler commented Oct 30, 2024

tohojo Nov 1, 2024

Choose a reason for hiding this comment

jkoeppeler Nov 1, 2024

Choose a reason for hiding this comment

tohojo Nov 1, 2024

Choose a reason for hiding this comment

jkoeppeler Nov 1, 2024 • edited Loading

Choose a reason for hiding this comment

tohojo Nov 1, 2024

Choose a reason for hiding this comment

jkoeppeler Nov 1, 2024

Choose a reason for hiding this comment

jkoeppeler commented Oct 28, 2024 •

edited

Loading

jkoeppeler Nov 1, 2024 •

edited

Loading