Fix: hybrid bench sort issues #209

ashbob999 · 2025-01-05T22:03:24Z

Fixes issues found in the hybrid bench sort function:

intial string bytes inserted in the wrong order.
order vector being erased (because offset_in_word was 0)

Which led to it having better performance that it actually had.

Also add logic to split the single final sorts into multiple may not be the most efficient or tidy. Which is needed as these functions would perform similar or worse than the equivalent stdvm version, and now are around ~2x faster than there alternative.

Fixes: #208

ashvardanian · 2025-01-05T22:09:33Z

scripts/bench_sort.cpp

+    for (size_t i = 0; i != strings.size(); ++i) {
+        size_t index = order[i];
+
+        for (size_t j = 0; j < std::min<std::size_t>(strings[(sz_size_t)index].size(), 4ul); ++j) {


This is just a memcpy and byte order reversal, right? Should be faster without loops.

Yes, do you have an example on how to do it without loops?

ashvardanian · 2025-01-05T22:13:07Z

scripts/bench_sort.cpp


    std::sort(order, order + strings.size(), [&](sz_u64_t i, sz_u64_t j) {
        char *i_bytes = (char *)&i;
        char *j_bytes = (char *)&j;
        return *(uint32_t *)(i_bytes + offset_in_word) < *(uint32_t *)(j_bytes + offset_in_word);
    });

-    for (size_t i = 0; i != strings.size(); ++i) std::memset((char *)&order[i] + offset_in_word, 0, 4ul);
+    const auto extract_bytes = [](sz_u64_t v) -> uint32_t {


I’m not sure if I understand the purpose of the following part. Can you please clarify?

Once I had fixed the byte order issues, these functions were performing similar to their alternatives.

It turns out the final sort call was the slowest part, but since we are already partially sorting the strings, we only really need to sort each unsorted sub group (all strings with equal first 4 chars).

But to do this we need to keep the bytes in the orders whilst we find the start and end of each group to sort. The extract_bytes lambda just made it easier to get those first 4 string bytes.

I know it's not the prettiest/simplest code.

scripts/bench_sort.cpp

ashbob999 added 2 commits January 5, 2025 20:14

Fix: stable sort bench tests failing

821d19e

Fix: hybrid bench sorts loading initial stirng bytes incorrectly

455508f

ashvardanian reviewed Jan 5, 2025

View reviewed changes

Improve: hybrid bench sort performance

9880f26

ashbob999 force-pushed the fix-hybrid-sort branch from e6863bb to 9880f26 Compare January 5, 2025 22:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix: hybrid bench sort issues #209

Fix: hybrid bench sort issues #209

ashbob999 commented Jan 5, 2025

ashvardanian Jan 5, 2025

ashbob999 Jan 5, 2025

ashvardanian Jan 5, 2025

ashbob999 Jan 6, 2025

Fix: hybrid bench sort issues #209

Are you sure you want to change the base?

Fix: hybrid bench sort issues #209

Conversation

ashbob999 commented Jan 5, 2025

ashvardanian Jan 5, 2025

Choose a reason for hiding this comment

ashbob999 Jan 5, 2025

Choose a reason for hiding this comment

ashvardanian Jan 5, 2025

Choose a reason for hiding this comment

ashbob999 Jan 6, 2025

Choose a reason for hiding this comment