replica selector refactor #1142

crazycs520 · 2024-01-26T02:01:30Z

What changed and how does it work?

Since the old version replicaSelector is hard to test and understand, This PR introduces a new replica selector implementation replicaSelectorV2, which is less code and easer to understand.

For rollback, I don't remove the old version replicaSelector, and add a new config EnableReplicaSelectorV2 whose default value is true, which means use replicaSelectorV2 by default, set it's value to false will use old version replicaSelector. After replicaSelectorV2 is stable, maybe we can consider remove old version replicaSelector and config EnableReplicaSelectorV2.

How to test?

All the old tests are pass.
I add many test case in replica_selector_test.go to make sure EnableReplicaSelectorV2 has the same behavior with old version replicaSelector.
- TestReplicaReadAccessPathByCase
- TestReplicaReadAccessPathByCase2
- TestReplicaReadAccessPathByBasicCase
- TestReplicaReadAccessPathByLeaderCase
- TestReplicaReadAccessPathByFollowerCase
- TestReplicaReadAccessPathByMixedAndPreferLeaderCase
- TestReplicaReadAccessPathByStaleReadCase
- TestReplicaReadAccessPathByTryIdleReplicaCase
- TestReplicaReadAccessPathByFlashbackInProgressCase
- TestReplicaReadAccessPathByProxyCase
- TestReplicaReadAccessPathByLearnerCase
I write a TestReplicaReadAccessPathByGenError test to generate various combinations of region errors to verify that EnableReplicaSelectorV2 has the same behavior with old version replicaSelector. From the following log, you can see this test generated a total 2200723 cases.
```
[2024/03/05 10:27:52.562 +08:00] [INFO] [replica_selector_test.go:2511] ["TestReplicaReadAccessPathByGenError Finished"] [total-case=2200723] [valid-case=857630] [invalid-case=1343093]
```
The test coverage of replicaSelectorV2 is more than 98%.

Advice when reviewing PR

If you have any questions about the logic about replicaSelector, you can try to modify the logic code, and then run the test to see whether the test results are as expected.

cd internal/locate
go test -run=TestReplicaReadAccessPathBy.*Case   # run some test cases, this is fast and less than 1s on my m1 mbp.
go test > test.log   # run all pkg test, this may cost 230s on my m1 mbp.

Signed-off-by: crazycs520 <[email protected]>

…ad request Signed-off-by: crazycs520 <[email protected]>

Signed-off-by: crazycs520 <[email protected]>

zyguan

Overall LGTM. We may need to do some basic benchmarks and stablility tests to make sure there is no obvious regression.

internal/locate/region_cache.go

internal/locate/replica_selector.go

internal/locate/region_request.go

internal/locate/replica_selector.go

Signed-off-by: crazycs520 <[email protected]>

internal/locate/region_cache.go

MyonKeminta · 2024-03-05T03:34:29Z

internal/locate/replica_selector.go

+) (*replicaSelectorV2, error) {
+	cachedRegion := regionCache.GetCachedRegionWithRLock(regionID)
+	if cachedRegion == nil || !cachedRegion.isValid() {
+		return nil, errors.New("cached region invalid")


Ok. It's fine to me for this PR to keep the logic consistent, but actually I think it doesn't make sense to let this error be recorded as an rpcError, which marks the RPC's result undetermined:

// (in getRPCContext) if err != nil { s.rpcError = err return nil, nil }

I think it would be good if the error handling can be refined in the future

internal/locate/replica_selector.go

MyonKeminta · 2024-03-05T08:51:12Z

internal/locate/replica_selector.go

+	metrics.TiKVReplicaSelectorFailureCounter.WithLabelValues("exhausted").Inc()
+	for _, r := range replicas {
+		if r.deadlineErrUsingConfTimeout {
+			// when meet deadline exceeded error, do fast retry without invalidate region cache.


Is it sure that the deadlineErrUsingConfTimeout is produced during the current attempt?

No. The logic is if have meet deadlineErrUsingConfTimeout error in this round, don't invalidate region to let up layer to remove kv_read_timeout then do fast retry.

This logic is same with the old version.

MyonKeminta · 2024-03-05T09:23:57Z

internal/locate/replica_selector.go

+				// when has match labels, prefer leader than not-matched peers.
+				score += scoreOfPreferLeader


Should a not-slow normal follower (NotSlow + NormalPeer -> 2) have the same priority as a slow leader in prefer-leader mode (PreferLeader -> 2) ?

Is there an assumption that in cases that leader must be expected (such as the first attempt in prefer-leader mode and the second attempt in stale read mode) must be done with the ReplicaSelectLeaderStrategy instead of using the calculateScore? If so, I think there should be some comments to explain that this function doesn't handle those cases.

If LabelMatch always has higher priority than PreferLeader, and PreferLeader always has higher priority than NormalPeer and NotSlow, is it possible that we use different bits for these factors, so that the more significant bit a factor is at, the more dominant status the factor is? For factors that has the same priority, we reserve more bits for them. For example, we can have such definition:

const ( flagLabelMatches = 1 << 3 flagPreferLeader = 1 << 2 flagNormalPeer = 1 flagNotSlow = 1 // So that we have definition of the final score: // MSB LSB // [unused bits][1 bit: LabelMatches][1 bit: PreferLeader][2 bits: NormalPeer + NotSlow] )

I'm not sure if this is possible to give the same result as your current code, but I think in this way, we can have less cases like:

Factor A < Factor C

Factor B < Factor C

Factor A + Factor B > Factor C

which I'm afraid may make the priority calculation less comprehensible. Another benefit is that we can mostly know how a score comes from by reading its bits, which may help in diagnosing.

Great catch, done as your advice.

Is there an assumption that in cases that leader must be expected (such as the first attempt in prefer-leader mode and the second attempt in stale read mode) must be done with the ReplicaSelectLeaderStrategy instead of using the calculateScore? If so, I think there should be some comments to explain that this function doesn't handle those cases.

I'm not very sure. If so, please consider adding some comments to note that to avoid misunderstanding of the scoring rule.

Signed-off-by: crazycs520 <[email protected]>

internal/locate/region_request.go

internal/locate/replica_selector.go

Signed-off-by: crazycs520 <[email protected]>

MyonKeminta

Mostly LGTM but I'm not sure if I've understood it totally... I need some more time

MyonKeminta · 2024-03-06T08:16:56Z

internal/locate/region_request3_test.go

+	sender := NewRegionRequestSender(s.cache, s.regionRequestSender.client)
+	sender.regionCache.enableForwarding = true


Is there any reason to create a new sender instead of using the prepared one?

I was worried that sharing the same sender for each test might have concurrency issues, so I create a new sender for each test.

🤔 I assume that the fields initialized in SetupTest should be done for each single test

// SetupTestSuite has a SetupTest method, which will run before each // test in the suite. type SetupTestSuite interface { SetupTest() }

MyonKeminta · 2024-03-06T08:49:10Z

internal/locate/region_request.go

@@ -633,7 +645,7 @@ func (state *accessByKnownProxy) onSendFailure(bo *retry.Backoffer, selector *re
 }

 func (state *accessByKnownProxy) onNoLeader(selector *replicaSelector) {
-	selector.state = &invalidLeader{}
+	selector.state = &tryFollower{leaderIdx: state.leaderIdx, lastIdx: state.leaderIdx, fromAccessKnownLeader: true}


So is it that it won't trigger upper-layer's backoff in this case?

This is for #1207

Since I need to test replicaSelectorV2 has some beha behavior with old replicaSelector, so I just change the old replicaSelector behavior here. If I don't change this, I'll have to modify the replicaSelectorV2 code to accommodate this strange behavior.

internal/locate/replica_selector.go

MyonKeminta · 2024-03-06T09:03:38Z

internal/locate/replica_selector.go

+				// when has match labels, prefer leader than not-matched peers.
+				score += scoreOfPreferLeader


Is there an assumption that in cases that leader must be expected (such as the first attempt in prefer-leader mode and the second attempt in stale read mode) must be done with the ReplicaSelectLeaderStrategy instead of using the calculateScore? If so, I think there should be some comments to explain that this function doesn't handle those cases.

I'm not very sure. If so, please consider adding some comments to note that to avoid misunderstanding of the scoring rule.

internal/locate/region_request.go

Signed-off-by: crazycs520 <[email protected]>

MyonKeminta

Rest LGTM

internal/locate/replica_selector.go

Signed-off-by: crazycs520 <[email protected]>

MyonKeminta

LGTM

Signed-off-by: crazycs520 <[email protected]>

internal/locate/replica_selector.go

Signed-off-by: crazycs520 <[email protected]>

you06

rest LGTM

internal/locate/replica_selector_test.go

Signed-off-by: crazycs520 <[email protected]>

crazycs520 added 18 commits January 23, 2024 14:16

init

9920ba3

Signed-off-by: crazycs520 <[email protected]>

add ReplicaSelector interface

1a3f367

Signed-off-by: crazycs520 <[email protected]>

add replica_selector_v2, todo: fix test

31f9f5d

Signed-off-by: crazycs520 <[email protected]>

fix all test

5e57170

Signed-off-by: crazycs520 <[email protected]>

fix test in another way to compatible with old version about stale-re…

fc9b192

…ad request Signed-off-by: crazycs520 <[email protected]>

tiny refactor

b60f4e1

Signed-off-by: crazycs520 <[email protected]>

refactor to remove duplicate code

0df7353

Signed-off-by: crazycs520 <[email protected]>

add more test

42d67bd

Signed-off-by: crazycs520 <[email protected]>

refine onServerIsBusy region error handing logic

36cd5c1

Signed-off-by: crazycs520 <[email protected]>

support forwarding by proxy

3cf3cdd

Signed-off-by: crazycs520 <[email protected]>

support busyThreshold and tiny reractor code

3ff9542

Signed-off-by: crazycs520 <[email protected]>

add config

1265452

Signed-off-by: crazycs520 <[email protected]>

Merge branch 'master' of https://github.com/tikv/client-go into cs-dev1

6a11f6e

fix test

ce864de

Signed-off-by: crazycs520 <[email protected]>

fix proxy bug

2b7c636

Signed-off-by: crazycs520 <[email protected]>

fix test and tiny refactor

fafb520

Signed-off-by: crazycs520 <[email protected]>

Merge branch 'master' of https://github.com/tikv/client-go into cs-dev1

d97a061

add some test

c6a10bf

Signed-off-by: crazycs520 <[email protected]>

zyguan reviewed Feb 7, 2024

View reviewed changes

crazycs520 added 11 commits February 18, 2024 14:37

add more test

0309452

Signed-off-by: crazycs520 <[email protected]>

tidy refine

dc55c5b

Signed-off-by: crazycs520 <[email protected]>

address comment

055204f

Signed-off-by: crazycs520 <[email protected]>

fix test in v1

6514588

Signed-off-by: crazycs520 <[email protected]>

add backoff-cnt check and timeout test

b411664

Signed-off-by: crazycs520 <[email protected]>

refine test

a6bda0c

Signed-off-by: crazycs520 <[email protected]>

Merge branch 'master' of https://github.com/tikv/client-go into cs-dev1

f03a51f

Signed-off-by: crazycs520 <[email protected]>

refine code

62e52e0

Signed-off-by: crazycs520 <[email protected]>

refine code

190e0cb

Signed-off-by: crazycs520 <[email protected]>

add benchmark test

6f5b52c

Signed-off-by: crazycs520 <[email protected]>

Merge branch 'master' of https://github.com/tikv/client-go into cs-dev1

29bed25

move test

88a19a1

Signed-off-by: crazycs520 <[email protected]>

MyonKeminta reviewed Mar 5, 2024

View reviewed changes

crazycs520 added 3 commits March 5, 2024 19:45

address comment

cd38d95

Signed-off-by: crazycs520 <[email protected]>

use new score calculation by flag bit

db60472

Signed-off-by: crazycs520 <[email protected]>

Merge branch 'master' into cs-dev1

4fd6021

you06 reviewed Mar 6, 2024

View reviewed changes

internal/locate/region_request.go Outdated Show resolved Hide resolved

internal/locate/replica_selector.go Outdated Show resolved Hide resolved

crazycs520 added 2 commits March 6, 2024 10:18

make test stable

fb0419f

Signed-off-by: crazycs520 <[email protected]>

refine comment,license and fix lint

6f55235

Signed-off-by: crazycs520 <[email protected]>

MyonKeminta reviewed Mar 6, 2024

View reviewed changes

crazycs520 added 2 commits March 6, 2024 18:39

refine comment

b7f9933

Signed-off-by: crazycs520 <[email protected]>

Merge branch 'master' into cs-dev1

1225db1

cfzjywxk requested review from zyguan, you06 and MyonKeminta March 7, 2024 02:43

zyguan approved these changes Mar 7, 2024

View reviewed changes

MyonKeminta reviewed Mar 7, 2024

View reviewed changes

internal/locate/replica_selector.go Show resolved Hide resolved

internal/locate/replica_selector.go Outdated Show resolved Hide resolved

refine comment

8557bfd

Signed-off-by: crazycs520 <[email protected]>

MyonKeminta approved these changes Mar 7, 2024

View reviewed changes

MyonKeminta mentioned this pull request Mar 7, 2024

Performance/health feedback to client and the Unified Health Controller tikv/tikv#16297

Closed

8 tasks

fix race test timeout

e4d576c

Signed-off-by: crazycs520 <[email protected]>

MyonKeminta reviewed Mar 7, 2024

View reviewed changes

internal/locate/replica_selector.go Outdated Show resolved Hide resolved

crazycs520 added 3 commits March 8, 2024 13:56

add flagNotAttemp in score

da0b829

Signed-off-by: crazycs520 <[email protected]>

Merge branch 'master' into cs-dev1

f98282c

Merge branch 'master' of https://github.com/tikv/client-go into cs-dev1

b92fc47

you06 reviewed Mar 11, 2024

View reviewed changes

internal/locate/replica_selector_test.go Outdated Show resolved Hide resolved

internal/locate/replica_selector_test.go Outdated Show resolved Hide resolved

internal/locate/replica_selector_test.go Outdated Show resolved Hide resolved

address comment

6e64263

Signed-off-by: crazycs520 <[email protected]>

you06 approved these changes Mar 11, 2024

View reviewed changes

cfzjywxk approved these changes Mar 11, 2024

View reviewed changes

cfzjywxk merged commit 8d6a95f into tikv:master Mar 11, 2024
10 checks passed

crazycs520 mentioned this pull request Apr 7, 2024

remove old replica selector #1266

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

replica selector refactor #1142

replica selector refactor #1142

crazycs520 commented Jan 26, 2024 •

edited

Loading

zyguan left a comment

MyonKeminta Mar 5, 2024

MyonKeminta Mar 5, 2024

crazycs520 Mar 5, 2024

crazycs520 Mar 5, 2024

MyonKeminta Mar 5, 2024

crazycs520 Mar 5, 2024

MyonKeminta Mar 6, 2024

MyonKeminta left a comment

MyonKeminta Mar 6, 2024

crazycs520 Mar 6, 2024

MyonKeminta Mar 7, 2024 •

edited

Loading

MyonKeminta Mar 6, 2024

crazycs520 Mar 6, 2024

MyonKeminta Mar 6, 2024

MyonKeminta left a comment

MyonKeminta left a comment •

edited

Loading

you06 left a comment

		// when has match labels, prefer leader than not-matched peers.
		score += scoreOfPreferLeader

		sender := NewRegionRequestSender(s.cache, s.regionRequestSender.client)
		sender.regionCache.enableForwarding = true

replica selector refactor #1142

replica selector refactor #1142

Conversation

crazycs520 commented Jan 26, 2024 • edited Loading

What changed and how does it work?

How to test?

Advice when reviewing PR

zyguan left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

MyonKeminta left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

MyonKeminta Mar 7, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

MyonKeminta left a comment

Choose a reason for hiding this comment

MyonKeminta left a comment • edited Loading

Choose a reason for hiding this comment

you06 left a comment

Choose a reason for hiding this comment

crazycs520 commented Jan 26, 2024 •

edited

Loading

MyonKeminta Mar 7, 2024 •

edited

Loading

MyonKeminta left a comment •

edited

Loading