Rack-aware load balancing #31

nyh · 2024-08-28T08:24:47Z

Currently, all our Alternator load-balancing implementations in this repository ignore rack (a.k.a Amazon availability zone, AZ) information: We use the "/localnodes" API to get a list of all live Scylla servers in this data center (a.k.a Amazon region), and send the request to one of it.

But when the Scylla DC has multiple racks on different Amazon AZs, cross-AZ traffic costs money. It is cheaper for the client running on a specific AZ to send the request to a random node on the same AZ - and not to nodes on other AZs. This issue requests that the load balancers do this: Prefer to send requests to a node on the client's rack, not a node on other racks.

See scylladb/scylladb#12147 on a server-side modification to "/localnodes" that can help us get the list of nodes in the current AZ.

Beyond server-size modifications the following two points will also need to be considered:

If the client itself is not balanced across the different AZs (e.g., is only running on a single AZ), using current-AZ-only load balancing would be inefficient as it would lead the Scylla nodes on this AZ to be more loaded than others. If we can't recognize this situation automatically, we should at least make rack-aware load balancing optional.
If the cluster has just 3 nodes across 3 AZs, then each AZ has just a single node. If this node is temporarily down, listing the nodes on this AZ will return no node. So in any case that the current-AZ node list is empty, we should fall back to getting all the nodes in the whole DC, and use that list.

dkropachev · 2024-10-01T18:04:11Z

@nyh , I don't know details about alternator implementation, but I know that it uses LWT under the hood, recently we have been asked to make sure that regualr gocql drivers void load balancing logic for LWT because it creates extra congestion on server side, because if two queries that target same PK lands on two different replicas, replicas have to go through reconciliation process to serialize queries properly.
So I wonder if same applies for alternator.

If so, then we need to do token-aware load balancing as well to get more performance out of the cluster.

nyh · 2024-10-01T18:38:34Z

@dkropachev this is true - we have #11 for token-aware load balancer, but as I noted there, there is a difficulty: it will mean we'll need to monkey-patch the AWS SDK at a different place than we do today, to let it see the full query - and parse it (unfortunately) - to decide where to route a write request (for reads, LWT is not relevant).

We also have scylladb/scylladb#5703 on the Scylla side, which says that if the AWS SDK isn't token-aware (like it isn't today), we can rescue the contention problem by forward writes to the "right" node.

But you're right - if the load balancer is rack-aware (as this issue proposes), and different racks will send writes to different nodes, we will end up with more LWT contention. I don't know what to do about this - other than making rack-aware load balancing optional. Personally, I think the LWT whole-partition-contention problems need to be fixed (scylladb/scylladb#16261) instead of trying to work around them in the load balancer.

mykaul · 2024-10-02T08:29:26Z

CC @kostja - thoughts?

dkropachev · 2024-10-02T10:37:46Z

if we have plans to enable LWT on tablets, then either we need to bring drivers to the alternator load balancers, or we need to expose routing info via API.

mykaul · 2024-10-29T09:52:22Z

@dkropachev - any progress on this? (regardless of tablets)

mykaul · 2024-11-04T10:17:02Z

@dkropachev ?

dkropachev · 2024-11-04T17:07:29Z

@mykaul, it is scheduled to the next sprint

mykaul · 2024-11-05T13:34:10Z

@mykaul, it is scheduled to the next sprint

It'd be great if it can be prioritized and delivered sooner. It has a material impact.

mykaul · 2024-11-19T14:01:36Z

@mykaul, it is scheduled to the next sprint

It'd be great if it can be prioritized and delivered sooner. It has a material impact.

@dkropachev , @roydahan - what's the status of this?

nyh · 2024-11-19T14:23:52Z

@dkropachev , @roydahan - what's the status of this?

@mykaul do you know which of the languages and SDK versions that we support you want this feature to appear in first?

mykaul · 2024-11-19T15:31:45Z

@dkropachev , @roydahan - what's the status of this?

@mykaul do you know which of the languages and SDK versions that we support you want this feature to appear in first?

Java for sure. Not sure about SDK version.

roydahan · 2024-11-19T16:03:33Z

The first PRs are only for Java.

dkropachev · 2024-11-19T16:47:36Z

@dkropachev , @roydahan - what's the status of this?

@mykaul do you know which of the languages and SDK versions that we support you want this feature to appear in first?

Java, both SDK versions.

nyh · 2024-12-01T13:24:27Z

Implemented in Java, see pull request #40.
Not yet implemented in other languages.

nyh closed this as completed Aug 28, 2024

nyh reopened this Aug 28, 2024

mykaul added the enhancement New feature or request label Aug 28, 2024

nyh mentioned this issue Sep 11, 2024

Alternator: a better, and rack-aware, /localnodes request scylladb/scylladb#12147

Closed

nyh mentioned this issue Oct 1, 2024

alternator: add "dc" and "rack" options to "/localnodes" request scylladb/scylladb#20915

Closed

roydahan assigned dkropachev Nov 11, 2024

roydahan added the type/epic label Nov 11, 2024

nyh mentioned this issue Nov 24, 2024

Support token aware feature. #11

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rack-aware load balancing #31

Rack-aware load balancing #31

nyh commented Aug 28, 2024 •

edited

Loading

dkropachev commented Oct 1, 2024

nyh commented Oct 1, 2024

mykaul commented Oct 2, 2024

dkropachev commented Oct 2, 2024

mykaul commented Oct 29, 2024

mykaul commented Nov 4, 2024

dkropachev commented Nov 4, 2024

mykaul commented Nov 5, 2024

mykaul commented Nov 19, 2024

nyh commented Nov 19, 2024

mykaul commented Nov 19, 2024

roydahan commented Nov 19, 2024

dkropachev commented Nov 19, 2024

nyh commented Dec 1, 2024

Rack-aware load balancing #31

Rack-aware load balancing #31

Comments

nyh commented Aug 28, 2024 • edited Loading

dkropachev commented Oct 1, 2024

nyh commented Oct 1, 2024

mykaul commented Oct 2, 2024

dkropachev commented Oct 2, 2024

mykaul commented Oct 29, 2024

mykaul commented Nov 4, 2024

dkropachev commented Nov 4, 2024

mykaul commented Nov 5, 2024

mykaul commented Nov 19, 2024

nyh commented Nov 19, 2024

mykaul commented Nov 19, 2024

roydahan commented Nov 19, 2024

dkropachev commented Nov 19, 2024

nyh commented Dec 1, 2024

nyh commented Aug 28, 2024 •

edited

Loading