Optimize ZRANK by avoiding string comparisons during skiplist traversal #14636

moticless · 2025-12-21T18:13:04Z

This optimization is based on Valkey valkey-io/valkey#1389

Summary
This PR optimizes the ZRANK command by eliminating expensive string comparisons during skiplist traversal.

The issue
The original ZRANK implementation:

Locates the element in the sorted set hash table
Retrieves the score and uses it to search for the element in the skiplist
During skiplist traversal, performs string comparisons at each level to find the exact element
Sums spans along the path to calculate the rank

String comparisons are expensive as they require accessing multiple memory locations and comparing byte-by-byte. - Performance profiling showed these comparisons can consume up to 20% of the rank calculation time.

The Optimization
Leverages the fact that the hash table value points directly to &node->score in the skiplist node. Using pointer arithmetic, we can:

Get the skiplist node directly from the hash table entry (O(1) lookup)

Walk upward from the node to the end of the list using stored node heights
Sum spans at the highest level of each node
Calculate rank as list_length - sum_of_spans

Key insight: At level 0, the span is always 1 (or 0 for the last node), so we repurpose level[0].span to store the node's height. This enables O(1) access to node height without adding new fields.

This optimization improves ZRANK performance by 2-14% across different data distributions (asissted lua and randomize zrank scores. TPS):

scores	elements_per_score	opt	unstable	diff_%
1	1000000	250613	220587	-11.98%
10	100000	270210	231733	-14.24%
100	10000	268750	233641	-13.06%
1000	1000	276227	241396	-12.61%
10000	100	271355	247029	-8.96%
100000	10	275287	263154	-4.41%
1000000	1	267492	261516	-2.23%

This optimization is based on Valkey PR redis#1389. The key improvements are: 1. Repurpose level[0].span to store node height (since it's always 1 or 0) 2. Add helper functions to manage span and height access 3. Implement zslGetRankByNode() to calculate rank by walking upward from a node 4. Add zsetGetSLNodeByEntry() to get skiplist node directly from dict entry 5. Update zsetRank() to use the optimized path for skiplist encoding Performance improvements (based on Valkey benchmarks with 1M elements): - Throughput: 8-45% improvement (best with fewer unique scores) - Latency: 6-30% reduction (best with fewer unique scores) The optimization avoids expensive string comparisons by using pointer arithmetic to get the skiplist node directly from the dict entry, then walking upward through the skiplist using stored node heights. All existing tests pass. No changes to command syntax, RDB format, AOF format, or replication protocol.

src/t_zset.c

shahsb · 2025-12-28T10:02:12Z

src/server.h

+        /* Span is the number of elements between this node and the next node at this level.
+         * At level 0, span is always 1 (or 0 for the last node), so we repurpose it to store
+         * the node's level. This enables O(1) access to node level for rank calculations. */
        unsigned long span;


This assumes level-0 spans are always 1 or 0, which holds for a standard skiplist (every node connects forward at L0 unless it's the tail). But confirm: Does insertion/deletion ever set non-trivial L0 spans? If so, this could corrupt ranks.

The suggested code is designed to use level[0].span exclusively for the node’s metadata. Needs to be verified it meets the expected behavior.

shahsb · 2025-12-28T10:08:03Z

src/t_zset.c

+/* Get the skiplist node from a dict entry. The dict value points to &node->score,
+ * so we use pointer arithmetic to get the node address. This enables O(1) lookup
+ * of the skiplist node from the hash table, used by optimized ZRANK. */
+static inline zskiplistNode *zsetGetSLNodeByEntry(dictEntry *de) {


Unit Tests: New TCL tests for ZRANK on large zsets (e.g., 1M elements, random inserts) verifying ranks match pre-PR. Include tail-node and max-height cases.

I verified it manually and also got covered by benchmark tests. Not sure if tcl is the right place to maintain such tests. ... The good news is that this kind of corruption is expected to be revealed fairly quickly.

sundb reviewed Dec 22, 2025

View reviewed changes

src/t_zset.c Outdated Show resolved Hide resolved

moticless added 2 commits December 22, 2025 09:55

Modify terminology from height to level

9179ad3

Merge branch 'unstable' into opt-zset-zrank

be6b769

shahsb suggested changes Dec 28, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Optimize ZRANK by avoiding string comparisons during skiplist traversal #14636

Optimize ZRANK by avoiding string comparisons during skiplist traversal #14636

moticless commented Dec 21, 2025 •

edited

Loading

Uh oh!

Uh oh!

shahsb Dec 28, 2025

Uh oh!

moticless Dec 28, 2025

Uh oh!

shahsb Dec 28, 2025

Uh oh!

moticless Dec 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Optimize ZRANK by avoiding string comparisons during skiplist traversal #14636

Are you sure you want to change the base?

Optimize ZRANK by avoiding string comparisons during skiplist traversal #14636

Conversation

moticless commented Dec 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

shahsb Dec 28, 2025

Choose a reason for hiding this comment

Uh oh!

moticless Dec 28, 2025

Choose a reason for hiding this comment

Uh oh!

shahsb Dec 28, 2025

Choose a reason for hiding this comment

Uh oh!

moticless Dec 28, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

moticless commented Dec 21, 2025 •

edited

Loading