Shard selecting load balancing #791

wprzytula · 2023-08-21T09:03:37Z

Motivation

Most of our drivers, being inherited from Cassandra, load balance only over nodes, not specific shards. Multiple ideas have arised that could benefit from having a shard-selecting load balancing. Among them:

shard-aware batching (Shard aware batching - add Session::shard_for_statement & Batch::enforce_target_node #738);
tablets support:
with tablets enabled (ATM experimental in ScyllaDB), target shard is derived from token (computed from partition key), but rather read from system.tablets. Therefore, a load balancer should be able to decide a target shard on its own, by abstracting over either token ring or tablets being used for cluster topology.
overloaded shard optimisation:
some tests have shown that sometimes, when a shard is particularly overloaded, it may be beneficial (performance-wise) to send the request to the proper node, but a wrong shard. That shard would then do part of the work that the overloaded shard would else have to do itself.

Design

LB policy now is to return a (NodeRef, Shard) pair, enabling finer-grained control over targeted shards.
regarding tablets support: ReplicaLocator is the place where the abstraction over either token ring or tablets is to be implemented. Ideally, the LB policy does not have to be aware of the actual mechanism (token ring or tablets) being used for a particular query.

What's done

internal and public load-balancing-related interfaces are changed from NodeRef to (NodeRef, Shard) pair,
shard selection logic is removed from NodeConnectionPool; a method is added there that returns a connection to a specific shard,
Session's logic propagates the load balancing policy's target shard down to the connection pool,
a stub implementation of shard selection is added to ReplicaLocator. At the moment, it simply computes the shard based on the token, the same way as it was done in the connection pool layer before.

Pre-review checklist

I have split my patch into logically separate commits.
All commit messages clearly explain what they change and why.
I added relevant tests for new features and bug fixes.
All commits compile, pass static checks and pass test.
PR description sums up the changes and reasons why they should be introduced.
I have provided docstrings for the public items that I want to introduce.
I have adjusted the documentation in ./docs/source/.
I added appropriate Fixes: annotations to PR description.

This way, we avoid clones of datacenters that appear multiple times.

…ltPolicy

havaker · 2023-08-22T10:00:43Z

scylla/src/transport/load_balancing/mod.rs

+        &'a self,
+        query: &'a RoutingInfo,
+        cluster: &'a ClusterData,
+    ) -> Option<(NodeRef<'a>, Shard)>;


I'm more for hiding NodeRef, Shard pair into some struct e.g.

pub struct PlanElement<'n> { node_ref: NodeRef<'n>, shard: Shard, }

Converting load_balancing interfaces to use it instead of a plain tuple would allow us to have a greater flexibility in adding/removing PlanElement's fields without breaking API.

If we hide them, then how could a user implement their own policies???

By using PlanElements produced by the locator.

What operations do you propose to be pub on PlanElement ? Will it be possible to examine its contents? Will it be possible to alter them? What about crafting one's own PlanElement ?

What operations do you propose to be pub on PlanElement ? Will it be possible to examine its contents?

Viewing shard and node_ref should be user-accessible.

What about crafting one's own PlanElement ?

I don't immediately see why one would like to craft their own PlanElement. Objects required to do so (NodeRef + Shard) can only be obtained through locator.

Will it be possible to alter them?

Allowing the user to influence a shard selection sounds like a good reason for altering the shard field.

I don't immediately see why one would like to craft their own PlanElement. Objects required to do so (NodeRef + Shard) can only be obtained through locator.
Precisely the case of #738 is one where a user would like to craft their own PlanElement, or at least this is a case close to crafting one's own.

BTW, is ReplicaLocator only responsible for locating replicas or is it, from the LB policy point of view, also the only source of knowledge about non-necessarily-replica nodes in the cluster?

Lorak-mmk · 2024-01-12T15:52:59Z

I'll close this - we won't be merging this PR on it's own, it will be part of tablets PR.

wprzytula added 6 commits August 21, 2023 09:09

locator: ReplicaLocator::new(): clone only late

fdd9256

This way, we avoid clones of datacenters that appear multiple times.

Moved everything to (NodeRef, Shard), except ReplicaLocator and Defau…

4e71c78

…ltPolicy

DefaultPolicy & ReplicaLocator partial changes to pair

ae37866

ReplicaLocator & DefaultPolicy minimal changes to compile & pass tests

0123014

connection pool respects provided shard

17da76e

tmp

e6bec08

wprzytula requested review from Lorak-mmk, havaker and avelanarius August 21, 2023 09:03

havaker reviewed Aug 22, 2023

View reviewed changes

Lorak-mmk closed this Jan 12, 2024

Lorak-mmk mentioned this pull request Mar 2, 2024

Shard selecting load balancing #944

Merged

8 tasks

Lorak-mmk mentioned this pull request Jul 22, 2024

Add BlobCompressor to compress blob and text fields on fly scylladb/gocql#221

Draft

wprzytula deleted the shard-selecting-lb branch October 2, 2024 11:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shard selecting load balancing #791

Shard selecting load balancing #791

wprzytula commented Aug 21, 2023

havaker Aug 22, 2023

wprzytula Aug 22, 2023

havaker Aug 22, 2023

wprzytula Aug 22, 2023

havaker Aug 22, 2023

wprzytula Aug 23, 2023

wprzytula Aug 23, 2023

Lorak-mmk commented Jan 12, 2024

Shard selecting load balancing #791

Shard selecting load balancing #791

Conversation

wprzytula commented Aug 21, 2023

Motivation

Design

What's done

Pre-review checklist

havaker Aug 22, 2023

Choose a reason for hiding this comment

wprzytula Aug 22, 2023

Choose a reason for hiding this comment

havaker Aug 22, 2023

Choose a reason for hiding this comment

wprzytula Aug 22, 2023

Choose a reason for hiding this comment

havaker Aug 22, 2023

Choose a reason for hiding this comment

wprzytula Aug 23, 2023

Choose a reason for hiding this comment

wprzytula Aug 23, 2023

Choose a reason for hiding this comment

Lorak-mmk commented Jan 12, 2024