Fix NodeStateManager lost update problem #25069

brybacki · 2025-02-18T16:44:07Z

Description

The NodeStateManager's drain implementation suffered from a lost update problem. A rapid sequence of drain/active/drain requests could cause the worker to prematurely exit a new draining state, ignoring in-flight tasks, because it failed to distinguish between old and new drain operations.

This was resolved by adding versioning to the node state, ensuring each state change request is uniquely identified and processed to completion. Each time the worker state is changed, the version number increases. So even if the state changes quickly from DRAINING,1 to ACTIVE,2 and back to DRAINING,3, the handling code can distinguish which state it was called for. This ensures the handling code only completes a drain action when it's working on the correct (latest) version of the request.

Additional context and related issues

Example:

NodeStateManager is in draining state handling drain -> waitActiveTasksToFinish.
It transitions to ACTIVE - this finishes waitTasksToFinish(activeTasks); and it goes to sleepUninterruptibly(gracePeriod.toMillis(), MILLISECONDS);
It gets some new tasks
It transitions to DRAINING again so new drain is started
The original drain finishes sleepUninterruptibly(gracePeriod.toMillis(), MILLISECONDS); and goes to drainingComplete() - it will just start transition to DRAINED, while it should not.

The ACTIVE was not observed by all the checks here. The worker will be in a NEW DRAINING state, with new tasks to drain, it should not continue to execute previous draining/drained logic.

Release notes

(x) This is not user-visible or is docs only, and no release notes are required.
( ) Release notes are required. Please propose a release note for me.
( ) Release notes are required, with the following suggested text:

## Section
* Fix some things. ({issue}`issuenumber`)

Changing "NodeState" to "Worker State change" clarifies that the logs refer to the worker's state transitions and makes the log consistent with messages in ServerInfoResource. Consistent logs are easier to grep.

The NodeStateManager's drain implementation suffered from a lost update problem. A rapid sequence of drain/active/drain requests could cause the worker to prematurely exit a new draining state, ignoring in-flight tasks, because it failed to distinguish between old and new drain operations. This was resolved by adding versioning to the node state, ensuring each state change request is uniquely identified and processed to completion. Each time the worker state is changed, the version number increases. So even if the state changes quickly from DRAINING,1 to ACTIVE,2 and back to DRAINING,3, the handling code can distinguish which state it was called for. This ensures the handling code only completes a drain action when it's working on the correct (latest) version of the request.

losipiuk · 2025-02-25T13:22:01Z

core/trino-main/src/main/java/io/trino/server/NodeStateManager.java

+
+    private record VersionedState(NodeState state, long version)
+    {
+        private static final AtomicLong versionProvider = new AtomicLong(0);


put it in NodeStateManager and make non-static

Also it does not need to be atomic as it is accessed from synchronized blocks.

brybacki added 2 commits February 18, 2025 17:28

Improves the logging in the NodeStateManager

faaacc6

Changing "NodeState" to "Worker State change" clarifies that the logs refer to the worker's state transitions and makes the log consistent with messages in ServerInfoResource. Consistent logs are easier to grep.

cla-bot bot added the cla-signed label Feb 18, 2025

brybacki requested review from wweiss-starburst and losipiuk February 18, 2025 16:45

losipiuk reviewed Feb 25, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix NodeStateManager lost update problem #25069

Fix NodeStateManager lost update problem #25069

brybacki commented Feb 18, 2025 •

edited

Loading

losipiuk Feb 25, 2025

losipiuk Feb 25, 2025

Fix NodeStateManager lost update problem #25069

Are you sure you want to change the base?

Fix NodeStateManager lost update problem #25069

Conversation

brybacki commented Feb 18, 2025 • edited Loading

Description

Additional context and related issues

Release notes

losipiuk Feb 25, 2025

Choose a reason for hiding this comment

losipiuk Feb 25, 2025

Choose a reason for hiding this comment

brybacki commented Feb 18, 2025 •

edited

Loading