Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

controller: try use atomic replace the lock #8968

Draft
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

nolouch
Copy link
Contributor

@nolouch nolouch commented Jan 3, 2025

What problem does this PR solve?

Issue Number: Close #8918

What is changed and how does it work?

Check List

Tests

  • Unit test
  • Integration test
  • Manual test (add detailed scripts or steps below)
  • No code

Code changes

Side effects

  • Possible performance regression
  • Increased code complexity
  • Breaking backward compatibility

Related changes

Release note

None.

Copy link
Contributor

ti-chi-bot bot commented Jan 3, 2025

Skipping CI for Draft Pull Request.
If you want CI signal for your change, please convert it to an actual PR.
You can still manually trigger a test run with /test all

@ti-chi-bot ti-chi-bot bot added release-note-none Denotes a PR that doesn't merit a release note. do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. dco-signoff: yes Indicates the PR's author has signed the dco. labels Jan 3, 2025
Copy link
Contributor

ti-chi-bot bot commented Jan 3, 2025

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
Once this PR has been reviewed and has the lgtm label, please ask for approval from nolouch, ensuring that each of them provides their approval before proceeding. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@ti-chi-bot ti-chi-bot bot added the size/L Denotes a PR that changes 100-499 lines, ignoring generated files. label Jan 3, 2025
@nolouch
Copy link
Contributor Author

nolouch commented Jan 3, 2025

The benchmark shows that the atomic version does not perform better than the lock version:

  • Local MPB (CPU(s): 12):
controller git:(optimize-locks) go test -run=^$ -benchmem -bench ^BenchmarkRequestAndResponseConsumption ./ -count=3 --tags=intest -cpu=2,4,8,16,1024,2048 -v
goos: darwin
goarch: arm64
pkg: github.com/tikv/pd/client/resource_group/controller
cpu: Apple M2 Max
BenchmarkRequestAndResponseConsumptionLockVer
BenchmarkRequestAndResponseConsumptionLockVer-2                 12052750               100.9 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionLockVer-2                 12201801               100.8 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionLockVer-2                 11879060                99.62 ns/op          128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionLockVer-4                  8846917               138.1 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionLockVer-4                  8755874               138.1 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionLockVer-4                  8746006               136.2 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionLockVer-8                  6519382               175.4 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionLockVer-8                  6812086               172.5 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionLockVer-8                  6747466               178.2 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionLockVer-16                 5521766               195.8 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionLockVer-16                 5893826               195.3 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionLockVer-16                 6167022               196.3 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionLockVer-1024               4183626               279.9 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionLockVer-1024               4416201               258.6 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionLockVer-1024               4511895               270.3 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionLockVer-2048               4261791               284.3 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionLockVer-2048               3896812               277.4 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionLockVer-2048               4593768               274.1 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionAtomicVer
BenchmarkRequestAndResponseConsumptionAtomicVer-2               18196218                64.43 ns/op          128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionAtomicVer-2               17640043                61.28 ns/op          128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionAtomicVer-2               18291357                62.72 ns/op          128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionAtomicVer-4               10625161               112.5 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionAtomicVer-4               10201930                99.93 ns/op          128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionAtomicVer-4               10800978               110.4 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionAtomicVer-8                3272788               365.4 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionAtomicVer-8                3302931               370.1 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionAtomicVer-8                3450266               320.7 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionAtomicVer-16               1738362              1126 ns/op             128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionAtomicVer-16               1000000              1134 ns/op             128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionAtomicVer-16               1750414              1099 ns/op             128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionAtomicVer-1024             1000000              1016 ns/op             128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionAtomicVer-1024             1000000              1085 ns/op             128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionAtomicVer-1024             1000000              1076 ns/op             128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionAtomicVer-2048             1000000              1068 ns/op             128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionAtomicVer-2048             1878140               645.7 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionAtomicVer-2048             1763342              1088 ns/op             129 B/op          2 allocs/op
PASS
  • cpu: Intel(R) Xeon(R) Gold 6240 CPU @ 2.60GHz. (CPU(s): 72)
➜  controller git:(optimize-locks)  go test -run=^$ -benchmem -bench ^BenchmarkRequestAndResponseConsumption ./ -count=3 --tags=intest -cpu=2,4,8,16,1024,2048 -v
goos: linux
goarch: amd64
pkg: github.com/tikv/pd/client/resource_group/controller
cpu: Intel(R) Xeon(R) Gold 6240 CPU @ 2.60GHz
BenchmarkRequestAndResponseConsumptionLockVer
BenchmarkRequestAndResponseConsumptionLockVer-2                  8621672               133.5 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionLockVer-2                  8597760               133.8 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionLockVer-2                  8630604               134.1 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionLockVer-4                  7042933               167.9 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionLockVer-4                  6869895               165.9 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionLockVer-4                  7071890               167.5 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionLockVer-8                  5608933               213.3 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionLockVer-8                  5561970               228.7 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionLockVer-8                  5012782               241.9 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionLockVer-16                 3368307               398.2 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionLockVer-16                 2986978               389.7 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionLockVer-16                 4168164               316.8 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionLockVer-1024               3739953               412.6 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionLockVer-1024               3014074               391.3 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionLockVer-1024               3042794               355.9 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionLockVer-2048               3114734               425.2 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionLockVer-2048               3019845               443.8 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionLockVer-2048               3350582               350.5 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionAtomicVer
BenchmarkRequestAndResponseConsumptionAtomicVer-2                3692481               302.4 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionAtomicVer-2                4055090               372.3 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionAtomicVer-2                4726200               250.8 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionAtomicVer-4                3653085               321.7 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionAtomicVer-4                3776768               337.6 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionAtomicVer-4                3859587               311.1 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionAtomicVer-8                3380364               693.6 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionAtomicVer-8                2895282               433.0 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionAtomicVer-8                2670873               389.0 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionAtomicVer-16               1639359               732.7 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionAtomicVer-16               2004878               546.0 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionAtomicVer-16               3183615               739.7 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionAtomicVer-1024             1214854               935.7 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionAtomicVer-1024             1540971               983.6 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionAtomicVer-1024             1538190               776.2 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionAtomicVer-2048             1567233               921.2 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionAtomicVer-2048             1218561               975.5 ns/op           128 B/op          2 allocs/op
BenchmarkRequestAndResponseConsumptionAtomicVer-2048             1566546               760.2 ns/op           128 B/op          2 allocs/op
PASS

@nolouch
Copy link
Contributor Author

nolouch commented Jan 14, 2025

Here is a benchmark like:

package main

import (
	"sync"
	"sync/atomic"
)

type AtomicCounter struct {
	A int64
	B int64
	C int64
	D int64
}

type MutexCounter struct {
	A  int64
	B  int64
	C  int64
	D  int64
	mu sync.Mutex
}

// Atomic Increment
func IncrementAtomic(c *AtomicCounter, delta int64) {
	atomic.AddInt64(&c.A, delta)
	atomic.AddInt64(&c.B, delta)
	atomic.AddInt64(&c.C, delta)
	atomic.AddInt64(&c.D, delta)
}

// Mutex Increment
func IncrementMutex(c *MutexCounter, delta int64) {
	c.mu.Lock()
	defer c.mu.Unlock()
	c.A += delta
	c.B += delta
	c.C += delta
	c.D += delta
}

// Benchmark for Atomic Increment
func BenchmarkIncrementAtomic(b *testing.B) {
	counter := &AtomicCounter{}
	b.RunParallel(func(pb *testing.PB) {
		for pb.Next() {
			IncrementAtomic(counter, 1)
		}
	})
}

// Benchmark for Mutex Increment
func BenchmarkIncrementMutex(b *testing.B) {
	counter := &MutexCounter{}
	b.RunParallel(func(pb *testing.PB) {
		for pb.Next() {
			IncrementMutex(counter, 1)
		}
	})
}

the results show mutex performance is better:

 go test -run=^$ -benchmem -bench ^Benchmark ./ -count=3 --tags=intest -cpu=2,4,8,16,1024,2048 -v
goos: darwin
goarch: arm64
pkg: t
cpu: Apple M2 Max
BenchmarkIncrementAtomic
BenchmarkIncrementAtomic-2              47868838                25.02 ns/op            0 B/op          0 allocs/op
BenchmarkIncrementAtomic-2              44583422                25.07 ns/op            0 B/op          0 allocs/op
BenchmarkIncrementAtomic-2              46245669                24.83 ns/op            0 B/op          0 allocs/op
BenchmarkIncrementAtomic-4              22093156                50.63 ns/op            0 B/op          0 allocs/op
BenchmarkIncrementAtomic-4              22180888                51.15 ns/op            0 B/op          0 allocs/op
BenchmarkIncrementAtomic-4              21932145                49.07 ns/op            0 B/op          0 allocs/op
BenchmarkIncrementAtomic-8              16764277                73.96 ns/op            0 B/op          0 allocs/op
BenchmarkIncrementAtomic-8              16094618                72.94 ns/op            0 B/op          0 allocs/op
BenchmarkIncrementAtomic-8              16233921                70.96 ns/op            0 B/op          0 allocs/op
BenchmarkIncrementAtomic-16              9643352               149.1 ns/op             0 B/op          0 allocs/op
BenchmarkIncrementAtomic-16              9222926               157.1 ns/op             0 B/op          0 allocs/op
BenchmarkIncrementAtomic-16              9830589               152.1 ns/op             0 B/op          0 allocs/op
BenchmarkIncrementAtomic-1024            9880392               136.6 ns/op             0 B/op          0 allocs/op
BenchmarkIncrementAtomic-1024           10059806               101.4 ns/op             0 B/op          0 allocs/op
BenchmarkIncrementAtomic-1024           10251189               103.5 ns/op             0 B/op          0 allocs/op
BenchmarkIncrementAtomic-2048           14877484               101.5 ns/op             0 B/op          0 allocs/op
BenchmarkIncrementAtomic-2048           10221018               116.7 ns/op             0 B/op          0 allocs/op
BenchmarkIncrementAtomic-2048           13793964                87.44 ns/op            0 B/op          0 allocs/op
BenchmarkIncrementMutex
BenchmarkIncrementMutex-2               28088242                52.05 ns/op            0 B/op          0 allocs/op
BenchmarkIncrementMutex-2               17808486                68.88 ns/op            0 B/op          0 allocs/op
BenchmarkIncrementMutex-2               22694785                68.41 ns/op            0 B/op          0 allocs/op
BenchmarkIncrementMutex-4               13536346               132.1 ns/op             0 B/op          0 allocs/op
BenchmarkIncrementMutex-4               10546023               121.0 ns/op             0 B/op          0 allocs/op
BenchmarkIncrementMutex-4               12471565               128.4 ns/op             0 B/op          0 allocs/op
BenchmarkIncrementMutex-8               15427584                80.62 ns/op            0 B/op          0 allocs/op
BenchmarkIncrementMutex-8               14986394                79.86 ns/op            0 B/op          0 allocs/op
BenchmarkIncrementMutex-8               15414992                79.47 ns/op            0 B/op          0 allocs/op
BenchmarkIncrementMutex-16              15839230                76.11 ns/op            0 B/op          0 allocs/op
BenchmarkIncrementMutex-16              15756336                76.09 ns/op            0 B/op          0 allocs/op
BenchmarkIncrementMutex-16              16016398                75.84 ns/op            0 B/op          0 allocs/op
BenchmarkIncrementMutex-1024            13441694                83.53 ns/op            0 B/op          0 allocs/op
BenchmarkIncrementMutex-1024            13345411                83.49 ns/op            0 B/op          0 allocs/op
BenchmarkIncrementMutex-1024            13679526                83.46 ns/op            0 B/op          0 allocs/op
BenchmarkIncrementMutex-2048            12699543                83.21 ns/op            0 B/op          0 allocs/op
BenchmarkIncrementMutex-2048            13008399                81.19 ns/op            0 B/op          0 allocs/op
BenchmarkIncrementMutex-2048            12804404                80.50 ns/op            0 B/op          0 allocs/op

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
dco-signoff: yes Indicates the PR's author has signed the dco. do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. release-note-none Denotes a PR that doesn't merit a release note. size/L Denotes a PR that changes 100-499 lines, ignoring generated files.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Optimize the lock usage in resource controller
1 participant