Skip to content

Possible Duplicate Job Execution in Cluster After DB Connection loss and resume #585

Answered by kagkarlsson
KBA2024 asked this question in Q&A
Discussion options

You must be logged in to vote

If you loose the db-connection to such a degree that heartbeating also stops (which runs in another thread on another db-connection), then most likely the ongoing transaction will never commit, since the app probably need to re-establish all database-connections.

There are scenarios where it still might happen, like stop-the-world events (gc or similar) with a duration longer than the limit for dead-executions discovery. You could in theory get the execution from the database before committing and checking that the version number is unchanged. I don't know of anyone doing this though (it is I think overly paranoid), but you could

(the version-number is incremented on most updates on the e…

Replies: 1 comment 3 replies

Comment options

You must be logged in to vote
3 replies
@KBA2024
Comment options

@kagkarlsson
Comment options

Answer selected by KBA2024
@KBA2024
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants