Handle cases where signals are missed. #12

stephenworsley · 2019-10-30T15:19:45Z

As described here: https://docs.python.org/3/library/signal.html

A long-running calculation implemented purely in C (such as regular expression matching on a large body of text) may run uninterrupted for an arbitrary amount of time, regardless of any signals received. The Python signal handlers will be called when the calculation finishes.

Because of this, it is possible that many signals will be recieved but a log will be written only once. This pull attempts to fix this by estimating how many signals should have been recieved since the last one was handled. In order to get a more accurate calculation, the way signals are jittered has also been changed.

bjlittle · 2019-10-30T15:26:02Z

@stephenworsley It's not really easy to determine whether this is an improvement or not. Without testing can you offer any evidence that this is a good PR?

stephenworsley · 2019-10-30T15:39:11Z

@bjlittle This came up because I was testing Kapture with a file that had a long running call where for about 20 seconds it was not logging. On the resulting graph, this call only took up a small amount of space. This seems to me to be a tad misleading. With my fix, this call appeared on the graph as appropriately wide.

rhattersley · 2019-10-30T15:46:05Z

To avoid having to change the sampling pattern, have you considered adding the time-since-last-signal to the _Frame that gets logged? The visualisation could then be tweaked to weight the widths accordingly?

stephenworsley · 2019-10-30T16:12:57Z

@rhattersley I'm not sure that would get around the need to change the sampling pattern. If signals are being handled immediately, I don't think there should be any difference in the weight given to one which was handled ~0 seconds since the last handled signal and one which was handled at ~2 seconds since. I only want to add weight when the time since the last handled signal is sufficiently large that it is necessary that another signal was sent, but was ignored.

The thought behind changing the sampling pattern is that, with my pattern, the difference in time between two consecutive signals being sent is always less than the difference in time between, say, the j-th signal and the j+2th signal. As it is currently, the first value can be as large as 2 and the second value as small as 1. I suppose it would still be possible to approximate the number of missed signals, but with a tendency to underestimate by 1 (maybe something like half of the time). Whether this weighting took place before or after logging, I feel like the issue is still tied to the sampling pattern.

rhattersley · 2019-10-30T18:28:01Z

If signals are being handled immediately, I don't think there should be any difference in the weight given to one which was handled ~0 seconds since the last handled signal and one which was handled at ~2 seconds since.

Completely agree - hence the original implementation. 👍

And I understand why inferring the number of lost signals from the previous & current signal times, requires that the possible signal intervals can't overlap. My only concern is that changing the sampling interval to [0.5, 1] might exchange one sampling bias for another. The original jitter pattern is deliberately designed to avoid bias in exchange for increased noise, so if you run for longer it will converge to an accurate answer.

On balance I suspect the new trade-off in this PR is worth it.

rhattersley · 2019-10-30T18:30:38Z

For future reference, a couple of (totally not thought through!) ideas that probably aren't worth exploring right now:

With a bit of fiddling around it might be possible to augment the signalling to make use of the value argument to sigqueue to supply a sequential counter.
Can we move the timing of the signals from the main process to the wrap_user.py process, e.g. by using signal.setitimer. This would let the wrap_user.py process maintain its own concept of how many sampling intervals have passed.

CLAassistant · 2023-10-26T16:07:28Z

All committers have signed the CLA.

handle cases where signals are missed

5c99b61

bjlittle self-assigned this Oct 30, 2019

bjlittle added this to the v0.6.0 milestone Oct 30, 2019

bjlittle added the Type: Enhancement label Oct 30, 2019

SciTools-incubator deleted a comment from CLAassistant Aug 16, 2023

trexfeathers removed this from the v0.6.0 milestone Feb 21, 2024

bjlittle removed their assignment Apr 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle cases where signals are missed. #12

Handle cases where signals are missed. #12

stephenworsley commented Oct 30, 2019

bjlittle commented Oct 30, 2019

stephenworsley commented Oct 30, 2019

rhattersley commented Oct 30, 2019

stephenworsley commented Oct 30, 2019

rhattersley commented Oct 30, 2019

rhattersley commented Oct 30, 2019

CLAassistant commented Oct 26, 2023 •

edited

Loading

Handle cases where signals are missed. #12

Are you sure you want to change the base?

Handle cases where signals are missed. #12

Conversation

stephenworsley commented Oct 30, 2019

bjlittle commented Oct 30, 2019

stephenworsley commented Oct 30, 2019

rhattersley commented Oct 30, 2019

stephenworsley commented Oct 30, 2019

rhattersley commented Oct 30, 2019

rhattersley commented Oct 30, 2019

CLAassistant commented Oct 26, 2023 • edited Loading

CLAassistant commented Oct 26, 2023 •

edited

Loading