[queue] create PendingCount (to replace Lag) #160

philandstuff · 2024-12-09T16:10:45Z

This adds a new PendingCount command which returns the aggregate number of pending messages in a queue.

I tried working with Lag but it has the problem that it is fundamentally heuristic, based on keeping a track of two counters:

how many messages have ever been added to the stream
how many messages have ever been read by the given consumer group

"Lag" is then the difference between these counters - the number of messages that have been added to the stream but not read by the consumer group.

Unfortunately, these counters can desync horribly if you ever XDEL a message before it gets read by a consumer group. This is because deleting a message does not decrement the "added to stream" counter, nor does it increment the "read by consumer group" counter.

Currently we rely on being able to XDEL messages. Fixing that would be nontrivial.

But we can measure the size of the PEL with XPENDING, and we can measure the length of the stream with XLEN, so we can calculate lag as Len() - PendingCount().

This adds a new PendingCount command which returns the aggregate number of pending messages in a queue. ----- I tried working with Lag but it has the problem that it is fundamentally heuristic, based on keeping a track of two counters: - how many messages have ever been added to the stream - how many messages have ever been read by the given consumer group "Lag" is then the difference between these counters - the number of messages that have been added to the stream but not read by the consumer group. Unfortunately, these counters can desync horribly if you ever XDEL a message before it gets read by a consumer group. This is because deleting a message does not decrement the "added to stream" counter, nor does it increment the "read by consumer group" counter. Currently we rely on being able to XDEL messages. Fixing that would be nontrivial. But we can measure the size of the PEL with XPENDING, and we can measure the length of the stream with XLEN, so we can calculate lag as PendingCount() - Len().

evilstreak · 2024-12-09T16:13:55Z

so we can calculate lag as PendingCount() - Len().

This should be Len() - PendingCount(), right?

evilstreak

Thinking about your PR description, this has the opposite problem in that it requires we do call XDEL on everything after we're done with it. But since that's the behaviour of our system currently, and that behaviour is relied on by our autoscaler for working out how many pods to run, I think that's fine!

philandstuff · 2024-12-09T16:17:42Z

so we can calculate lag as PendingCount() - Len().

This should be Len() - PendingCount(), right?

er.. yes

This is just like #160, but we calculate length and pending count atomically. Before this change, I was subtracting the pending count from the length to get a "waiting messages" metric; but as these numbers could not be measured atomically I sometimes saw negative waiting messages. This changes PendingCount() into Stats() which returns both length and pending count.

philandstuff requested a review from a team as a code owner December 9, 2024 16:10

evilstreak approved these changes Dec 9, 2024

View reviewed changes

philandstuff merged commit 6c095f7 into main Dec 9, 2024
2 checks passed

philandstuff deleted the change-lag-to-pending-count branch December 9, 2024 16:18

philandstuff mentioned this pull request Dec 10, 2024

PLAT-604 [queue] create Stats (to replace PendingCount) #161

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[queue] create PendingCount (to replace Lag) #160

[queue] create PendingCount (to replace Lag) #160

philandstuff commented Dec 9, 2024 •

edited

Loading

evilstreak commented Dec 9, 2024

evilstreak left a comment

philandstuff commented Dec 9, 2024

[queue] create PendingCount (to replace Lag) #160

[queue] create PendingCount (to replace Lag) #160

Conversation

philandstuff commented Dec 9, 2024 • edited Loading

evilstreak commented Dec 9, 2024

evilstreak left a comment

Choose a reason for hiding this comment

philandstuff commented Dec 9, 2024

philandstuff commented Dec 9, 2024 •

edited

Loading