Replace dict with new hashtable: sorted set datatype by rainsupreme · Pull Request #1427 · valkey-io/valkey

rainsupreme · 2024-12-11T17:23:58Z

This PR replaces dict with hashtable in the ZSET datatype. Instead of mapping key to score as dict did, the hashtable maps key to a node in the skiplist, which contains the score. This takes advantage of hashtable performance improvements and saves 15 bytes per set item - 24 bytes overhead before, 9 bytes after.

Closes #1096

zuiderkwast · 2024-12-17T23:10:56Z

This PR is ready for review! Needed changes have been merged and I've rebased

Great! You could mark it as not draft then. :) We use the top comment and PR title for the final commit message when it gets merged, so you could update those to concisely describe the change (i.e. like a commit message).

It's late in my time zone so I'll look tomorrow.

codecov · 2024-12-18T06:59:32Z

Codecov Report

Attention: Patch coverage is 92.55663% with 23 lines in your changes missing coverage. Please review.

Project coverage is 70.75%. Comparing base (b3b4bdc) to head (8316c90).
Report is 11 commits behind head on unstable.

Files with missing lines	Patch %	Lines
src/module.c	0.00%	10 Missing ⚠️
src/defrag.c	84.37%	5 Missing ⚠️
src/object.c	37.50%	5 Missing ⚠️
src/db.c	89.47%	2 Missing ⚠️
src/debug.c	92.30%	1 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff              @@
##           unstable    #1427      +/-   ##
============================================
- Coverage     70.83%   70.75%   -0.09%     
============================================
  Files           120      120              
  Lines         64911    64959      +48     
============================================
- Hits          45982    45962      -20     
- Misses        18929    18997      +68

Files with missing lines	Coverage Δ
src/aof.c	`80.23% <100.00%> (+0.11%)`	⬆️
src/evict.c	`98.47% <100.00%> (-0.38%)`	⬇️
src/geo.c	`93.58% <100.00%> (+0.02%)`	⬆️
src/rdb.c	`76.75% <100.00%> (+0.67%)`	⬆️
src/server.c	`87.61% <100.00%> (+0.14%)`	⬆️
src/server.h	`100.00% <ø> (ø)`
src/sort.c	`94.82% <100.00%> (-0.34%)`	⬇️
src/t_zset.c	`96.80% <100.00%> (+1.13%)`	⬆️
src/debug.c	`52.12% <92.30%> (+0.13%)`	⬆️
src/db.c	`89.48% <89.47%> (-0.07%)`	⬇️
... and 3 more

... and 37 files with indirect coverage changes

zuiderkwast

This looks awesome. Just a few nits.

ranshid

great work @SoftlyRaining !

Some minor comments and I feel somewhat uncomfortable about the way we implemented the union.

zuiderkwast

Just fix the naming converion (compareNodeScoreEle -> zslCompareNodeScoreEle, etc.). Then I think it's good to merge.

I don't really get why you want to delete zslGetRank. If it's an optimization beyond the replacement of dict, then it can as well be a follow up, right? As you want.

rainsupreme · 2025-01-02T19:23:55Z

I poked around and found that all uses of zslGetRank could more efficiently use zslGetRankByNode, and it became dead code. I already made the revision at any rate but you're right, it might've been a separate small cleanup PR 😅

Signed-off-by: Rain Valentine <rsg000@gmail.com>

…etRank Signed-off-by: Rain Valentine <rsg000@gmail.com>

ranshid · 2025-01-05T06:59:08Z

I poked around and found that all uses of zslGetRank could more efficiently use zslGetRankByNode, and it became dead code. I already made the revision at any rate but you're right, it might've been a separate small cleanup PR 😅

@SoftlyRaining My only concern is a potential small degradation in performance of zcount and zlexcount. can we just make sure to verify we have no impact on these operations?

rainsupreme · 2025-01-06T21:43:35Z

I poked around and found that all uses of zslGetRank could more efficiently use zslGetRankByNode, and it became dead code. I already made the revision at any rate but you're right, it might've been a separate small cleanup PR 😅

@SoftlyRaining My only concern is a potential small degradation in performance of zcount and zlexcount. can we just make sure to verify we have no impact on these operations?

I will remove that aspect of the change so it can be more thoroughly investigated and benchmarked as a separate PR. I'd prefer to avoid delaying the core hashtable work. :)

Signed-off-by: Rain Valentine <rsg000@gmail.com>

zuiderkwast

LGTM

@ranshid The zslGetRank changes have been removed, so I guess it's safe to merge. WDYT?

ranshid

Small comment which I think we could skip handling for now and maybe only extend the comment.
I did not rescan the entire change so LGTM

ranshid · 2025-01-08T08:03:43Z

+    if ((node->backward == NULL || node->backward->score < newscore) &&
+        (node->level[0].forward == NULL || node->level[0].forward->score > newscore)) {
+        node->score = newscore;
+        return NULL;
+    }


Small mark: it is still possible that a node position change will NOT take place after this check. For example in case we update the score to something that exactly matches the score of the prev or next node.
The check can be extended to check also equality of the score (but will also need to compare the key order).
I guess this is fine for now, but maybe extend the comment above to explain that?

Right, the edge cases. Now, we remove and re-insert the node at the same position in this case, which is OK. It's not introduced in this PR anyway. We can convert the comment to a follow-up issue.

We could use zslCompareNodes here, but then we'd need to set the new score before we compare and then revert it if the check fails. Or use a stack-allocated temporary node for comparing, just to be able to use zslCompareNodes.

Suggested change

if ((node->backward == NULL || node->backward->score < newscore) &&

(node->level[0].forward == NULL || node->level[0].forward->score > newscore)) {

node->score = newscore;

return NULL;

}

double oldscore = node->score;

node->score = newscore;

if ((node->backward == NULL || zslCompareNodes(node->backward, node) <= 0) &&

zslCompareNodes(node->level[0].forward, node) >= 0) {

return NULL;

} else {

/* Restore score to restore skiplist order. */

node->score = oldscore;

}

about edge cases: allowing score to be equal to pref or next node and also compare ele in these cases. Signed-off-by: Viktor Söderqvist <viktor.soderqvist@est.tech>

This PR replaces dict with hashtable in the ZSET datatype. Instead of mapping key to score as dict did, the hashtable maps key to a node in the skiplist, which contains the score. This takes advantage of hashtable performance improvements and saves 15 bytes per set item - 24 bytes overhead before, 9 bytes after. Closes valkey-io#1096 --------- Signed-off-by: Rain Valentine <rsg000@gmail.com> Signed-off-by: Viktor Söderqvist <viktor.soderqvist@est.tech> Co-authored-by: Viktor Söderqvist <viktor.soderqvist@est.tech> Signed-off-by: proost <jwalag87@gmail.com>

This PR replaces dict with hashtable in the ZSET datatype. Instead of mapping key to score as dict did, the hashtable maps key to a node in the skiplist, which contains the score. This takes advantage of hashtable performance improvements and saves 15 bytes per set item - 24 bytes overhead before, 9 bytes after. Closes valkey-io#1096 --------- Signed-off-by: Rain Valentine <rsg000@gmail.com> Signed-off-by: Viktor Söderqvist <viktor.soderqvist@est.tech> Co-authored-by: Viktor Söderqvist <viktor.soderqvist@est.tech>

* Embed sds element inside skiplist nodes: Changed zset dict to store zskiplistNode* as keys (with no_value=1) instead of storing sds keys and double* values, eliminating redundant sds storage and enabling single-allocation nodes * Single allocation for skiplist nodes: Each node now contains: fixed fields + level[] array + embedded sds, reducing memory fragmentation and allocation overhead. This optimization is based on valkey-io/valkey#1427 * Optimize lookups with dictFindLink: Use dictFindLink in zsetAdd to avoid double hash table lookup when inserting new elements (find + add becomes single operation) * Simplify score updates

rainsupreme marked this pull request as draft December 11, 2024 17:24

rainsupreme force-pushed the zset-datatype branch 2 times, most recently from d2d854d to 6a8ee44 Compare December 17, 2024 22:49

rainsupreme force-pushed the zset-datatype branch from 6a8ee44 to 382d0ba Compare December 18, 2024 00:21

rainsupreme marked this pull request as ready for review December 18, 2024 00:36

rainsupreme force-pushed the zset-datatype branch from 382d0ba to 37811f6 Compare December 18, 2024 00:43

rainsupreme changed the title ~~[draft] replace dict with hashtable: ZSET datatype~~ replace dict with hashtable: ZSET datatype Dec 18, 2024

enjoy-binbin added release-notes This issue should get a line item in the release notes run-extra-tests Run extra tests on this PR (Runs all tests from daily except valgrind and RESP) labels Dec 18, 2024

zuiderkwast requested review from ranshid and zuiderkwast December 18, 2024 16:52

zuiderkwast reviewed Dec 18, 2024

View reviewed changes

Comment thread src/db.c Outdated

Comment thread src/t_zset.c Outdated

Comment thread src/t_zset.c Outdated

Comment thread src/t_zset.c Outdated

Comment thread src/defrag.c

Comment thread src/server.c Outdated

Comment thread src/t_zset.c Outdated

ranshid reviewed Dec 19, 2024

View reviewed changes

Comment thread src/db.c Outdated

Comment thread src/defrag.c Outdated

Comment thread src/t_zset.c Outdated

zuiderkwast changed the title ~~replace dict with hashtable: ZSET datatype~~ Replace dict with new hashtable: sorted set datatype Dec 21, 2024

ranshid reviewed Jan 1, 2025

View reviewed changes

Comment thread src/t_zset.c Outdated

Comment thread src/t_zset.c Outdated

rainsupreme mentioned this pull request Jan 2, 2025

Replace dict with new hashtable: hash datatype #1502

Merged

zuiderkwast reviewed Jan 2, 2025

View reviewed changes

Comment thread src/defrag.c Outdated

rainsupreme force-pushed the zset-datatype branch from b87e508 to e35c608 Compare January 2, 2025 19:20

rainsupreme added 4 commits January 4, 2025 00:19

convert ZSET from dict -> hashtable (squashed)

511bb86

Signed-off-by: Rain Valentine <rsg000@gmail.com>

PR feedback

8b91917

Signed-off-by: Rain Valentine <rsg000@gmail.com>

PR feedback, no temp hashtable for unions, misc cleanup

a5a234b

Signed-off-by: Rain Valentine <rsg000@gmail.com>

zslCompareNodes to define order, optimize zslUpdateScore, remove zslG…

6644ac5

…etRank Signed-off-by: Rain Valentine <rsg000@gmail.com>

rainsupreme force-pushed the zset-datatype branch from e35c608 to 6644ac5 Compare January 4, 2025 00:20

ranshid mentioned this pull request Jan 5, 2025

Improve readability and efficiency for ZUNION operation #1506

Closed

restore alt rank method for later investigation

c8e1ba7

Signed-off-by: Rain Valentine <rsg000@gmail.com>

zuiderkwast approved these changes Jan 8, 2025

View reviewed changes

ranshid approved these changes Jan 8, 2025

View reviewed changes

zuiderkwast reviewed Jan 8, 2025

View reviewed changes

Comment thread src/t_zset.c Outdated

Add TODO comment in zslUpdateScore

8316c90

about edge cases: allowing score to be equal to pref or next node and also compare ele in these cases. Signed-off-by: Viktor Söderqvist <viktor.soderqvist@est.tech>

zuiderkwast merged commit ab627d6 into valkey-io:unstable Jan 8, 2025

rainsupreme deleted the zset-datatype branch January 31, 2025 23:33

moticless mentioned this pull request Jan 18, 2026

Optimize zset to use dict with no_value=1 redis/redis#14701

Merged

Conversation

rainsupreme commented Dec 11, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

zuiderkwast commented Dec 17, 2024

Uh oh!

codecov Bot commented Dec 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

zuiderkwast left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ranshid left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

zuiderkwast left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

rainsupreme commented Jan 2, 2025

Uh oh!

ranshid commented Jan 5, 2025

Uh oh!

rainsupreme commented Jan 6, 2025

Uh oh!

zuiderkwast left a comment

Choose a reason for hiding this comment

Uh oh!

ranshid left a comment

Choose a reason for hiding this comment

Uh oh!

ranshid Jan 8, 2025

Choose a reason for hiding this comment

Uh oh!

zuiderkwast Jan 8, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

rainsupreme commented Dec 11, 2024 •

edited

Loading

codecov Bot commented Dec 18, 2024 •

edited

Loading

ranshid left a comment •

edited

Loading