Error Handling, Update Stability, Improved Java SDK #402

ashvardanian · 2024-05-05T01:38:14Z

USearch implementation had 2 layers, the core HNSW structure implemented in index.hpp and the high-level wrapper for dense equidimensional vectors in index_dense.hpp. In this release, we've made the top layer thinner and cleaner, also making this APIs more similar, error-handling more consistent, and builds faster.

Reducing Dependencies & Accelerating Builds

Previously index_dense.hpp had the following STL includes:

#include <thread> // `std::thread`
#include <functional> // `std::function`
#include <vector> // `std::vector`
#include <numeric> // `std::iota`

Those are some of the most common includes in C++ projects, also the ones I like the least. Here are a couple of reasons to hate them, taken from the "C++ Compile Health Watchdog":

name	compilation time	lines of code	binary size
`<functional>`	82 .. 228 ms	12.9 .. 27.4 kLoC	0 .. 141 kB
`<vector>`	32 .. 48 ms	7.1 .. 8.0 kLoC	0 .. 8.2 kB
`<numeric>`	7 .. 13 ms	1.6 .. 2.1 kLoC	0 .. 3.3 kB
`<thread>`	110 .. 189 ms	17.5 .. 20.3 kLoC	0 .. 153 kB

Improved Java, C#, and Swift SDK

Thanks to @mccullocht, the Java SDK has been extended with get, loadFromPath, and viewFromPath APIs.

Usage examples to come :)

Other SDK improvements include:

Better docs for C#
Better test coverage for Swift

Reduced Memory Consumption for DBMS-like Users

Most databases using USearch, would prefer to have a smaller index at the cost of some functionality. As mentioned by @rschu1ze, if enable_key_lookups is disabled, and some external DBMS is responsible for "key to vector" mappings, the memory consumption of the index_dense_gt can be further reduced.

Improving Stability

In update-heavy workloads, it's possible to encounter an undefined behavior, leading to a corrupt index state.
In aligned-alloc an integer overflow was resolved by @antonio2368.
Search is now exhaustive by default, so if you have ≥10 items in the index and ask for 10 closest vectors, you are expected to get 10, not ≤10.
On GCC 12 and older the #pragma region warning was resolved by @mbautin.
Casts to uint40_t slot IDs have been fixed by @Ngalstyan4.
Casts to b1x8_t vectors have been fixed by @Ngalstyan4.
Python type annotation has been added by @jamesbraza.
C++ example was patched by @SheldonFung98.
JavaScript duplicate exports patched by @eryue0220 & @johnhorsema.

Fixes ClickHouse/ClickHouse#61780 Co-authored-by: Antonio Andelic <[email protected]>

Relates to the #377 and the comment: #377 (comment) This temporarily disables the failing CI pipeline to generate and update docs.

This commit drops `std::vector` dependency, making compilation time shorter and error handling universal across abstraction layers.

In the past, if we got "too lucky" traversing the graph, we could exit early before accumulating K top matches, even if the index had more than K entries. This patch changes that behavior, making output more predicatable.

This patch addresses the issue #399, originally observed in the Swift layer. Reimplementing it in C++ helped locate the issue and lead to refactoring the `update` procedure in the lowest-layer `index_gt`. Now, `add` and `update` share less code. The `add` is one branch shorter (not that it would be noticeable), and `update` brings additional logic to avoid spilling `updated_slot` into top-results and consequently introducing self-loops. Closes #399

… main-dev

Relates to #355

Both `view` and `load` would `reset` the thread contexts. After that, the very first `search` and `add` would fail, as no thread-local contexts are initialized. It would require a `reserve` call with a non-zero second arcgument to define the number of concurrent threads, for which the queues & buffers need to be allocated. That design is counter-intuitive, so this patch re-inits the same number of threads as before the `load` & `view` or one, if none existed.

Co-authored-by: Ash Vardanian <[email protected]>

Co-authored-by: Adolfo Garcia <[email protected]>

… main-dev

As noted by Robert Schulze, we can avoid populating `slot_lookup_` during insertions, if `enable_key_lookups` is not set. This would lead to lower memory consumption for large indexes of tiny vectors, particularly common in GIS. Co-authored-by: Robert Schulze <[email protected]>

… main-dev

rschu1ze · 2024-05-22T21:30:40Z

@ashvardanian I am happy to see this PR getting merged but it is getting bigger and bigger :-) Question: could we get the stability fixes + the reduced mem consumption commits merged as separate PRs into Usearch's main branch? None of them seem to be breaking, and it would be nice to accelerate things a bit.

ashvardanian · 2024-05-22T23:44:26Z

Agreed @rschu1ze, will merge soon.

ashvardanian · 2024-05-23T22:15:05Z

@rschu1ze, there is a weird behavior on very large queries, where the distance to matches isn't monotonically decreasing in some cases. The new extended tests catch it, but I am not sure if that issue was present in the past. I'd really like at least that one issue to be resolved before merging.

Everyone is welcome to join the bug hunt 🤗

Ngalstyan4 · 2024-05-25T03:37:13Z

#422 might be related. It had caused similar symptoms in lantern - distances would not be monotonically decreasing in index search results.
This was caught in the development branch of lantern, once we added quantization support and tests:

-- create an 8-bit quantized index
CREATE INDEX ind8 ON sift_base1k USING lantern_hnsw (v_scaled) WITH (dim=128, M=8, quant_bits=8);
-- look for vectors closest to the vector with id 42
SELECT id, v_scaled <-> :'v_scaled42' as dist FROM sift_base1k ORDER BY 2 LIMIT 10;

Below is the diff with the bug fix

diff -u /lantern_shared/test/expected/hnsw_sq.out /tmp/lantern/tmp_output/results/hnsw_sq.out
--- /lantern_shared/test/expected/hnsw_sq.out   2024-05-24 04:43:03.172237129 +0000
+++ /tmp/lantern/tmp_output/results/hnsw_sq.out 2024-05-25 03:18:11.796896512 +0000
@@ -106,15 +106,15 @@
  id  |   dist
 -----+-----------
   42 |         0
- 285 |   16.7111
- 261 |   16.0935
- 195 | 17.061296
-  48 | 5.1038003
-  50 | 11.509201
   36 |     1.053
-  46 | 10.790699
- 216 | 18.992302
+  48 | 5.1038003
   39 |  5.626501
+ 886 |  7.163699
+ 402 | 7.7013006
+ 518 |  8.502399
+ 331 |  8.779598
+ 340 |  9.726098
+  46 | 10.790699
 (10 rows)

Previously we were down-casting floats to the target type (e.g. int8_t), and then clamping to [-100, 100] range. This means that e.g. 129 would be cast to -127 and then converted to -100, in stead of becoming 100 The fix does clamping first, and then casts the resulting number (which is guaranteed to be in range [-100, 100], due to clamping) from source type to target int8_t. Given the clamping, this will never overflow. --------- Co-authored-by: Ash Vardanian <[email protected]>

ashvardanian · 2024-05-25T20:59:30Z

Thanks for the patch, @Ngalstyan4! I've merged it, but it doesn't solve the issue we have with tests right now. We still have a failing assertion at cpp/test.cpp:426 where we check that the output distances are monotonically increasing for long-tail queries, like getting top-500 closest neighbors. Help always appreciated 🤗

When calling `index_dense_gt`, the thread lock was not propagating with the `search_result_t`. That is a an error-prone API. When too many threads are running in parallel (ideally, more than physical CPU cores) another thread may start reusing the `context_t` before the original caller finishes exporting entries with `dump_to`. This solution is backwards compatible and passes the tests.

We can't yet rely on the SemVer tool semantic-release/release-notes-generator#633 (comment)

Tracing implicit conversions of `std::uint32_t` and other primitive types isn't always easy in concurrent apps. This commit adds support for `enum` types to be used for safer implementation of `index_gt` specializations.

Adding type annotation for Python native modules solves the `Skipping analyzing "usearch.index" module` warning due to `missing library stubs or py.typed marker`. Closes #424

When converting floating point arrays to binary, we use bitwise OR operations to set the relevant bits in the output buffer to 1. We do nothing if the bit is zero, so we assume that the bit is zero to start with. The `memset` statement makes sure this assumption holds.

Closes #418 Closes #426

Fix: build.gradle deprecations

Closes #426 Relates to #420

Closes #432

In high-connectivity graphs, the number of distance computations can be dominated by the number of "refine" heuristic computations performed by the core structure. The extended `add_result_t` now includes both: - `computed_distances_in_refines` - `computed_distances_in_reverse_refines` This commit also extends the documentation.

… main-dev

Co-authored-by: Ash Vardanian <[email protected]>

This indirectly fixes the crash in C# layer

… main-dev

jaysenmarais and others added 14 commits April 29, 2024 00:46

Improve: Swift test for issue #399 (#400)

244dea6

Fix: Integer overflow in aligned-alloc

9c16dba

Fixes ClickHouse/ClickHouse#61780 Co-authored-by: Antonio Andelic <[email protected]>

Make: Disable Windows NPM builds

291c8f1

Relates to the #377 and the comment: #377 (comment) This temporarily disables the failing CI pipeline to generate and update docs.

Fix: Going beyond level 0 in clustering

1e24eb9

Improve: Error handling in index_dense_gt

1b4de90

This commit drops `std::vector` dependency, making compilation time shorter and error handling universal across abstraction layers.

Improve: Remove std::function calls

6d4f43f

Improve: Remove std::thread from index_dense_gt

07300a6

Improve: std::vector -> buffer_gt in plugins

4aea1f7

Add: usearch_change_threads_search

fa09c8b

Fix: index_dense_t::make(path)

bcc27c4

Fix: Exhastive Search

ffa5986

In the past, if we got "too lucky" traversing the graph, we could exit early before accumulating K top matches, even if the index had more than K entries. This patch changes that behavior, making output more predicatable.

Fix: Misc warnings & compilation issues

8490f1d

Fix: Misc warnings & compilation issues

30a490b

ashvardanian force-pushed the main-dev branch from 8490f1d to 30a490b Compare May 8, 2024 01:15

ashvardanian and others added 15 commits May 8, 2024 02:49

Merge branch 'main-dev' of https://github.com/unum-cloud/usearch into…

3e01bb3

… main-dev

Fix: Detect ring_gt being full

8a2515f

Relates to #355

Fix: uint32_t to uint40_t cast (#404)

37b47b0

Co-authored-by: Ash Vardanian <[email protected]>

Docs: Mention b1 in README.md

a1cf33e

Co-authored-by: Adolfo Garcia <[email protected]>

Docs: Cover new users

f390b16

Improve: Updates stability & catch bug

35e7326

Merge branch 'main-dev' of https://github.com/unum-cloud/usearch into…

c3930b4

… main-dev

Fix: Dereferencing member_iterator_t

1f93994

Add: Java get API (#407)

e50d698

Fix: Compilation with uint40_t keys

d9daeec

Merge branch 'main-dev' of https://github.com/unum-cloud/usearch into…

1a38509

… main-dev

Add: AutoClosable using c_destroy for Java (#408)

70c2669

Fix: Rare deadlock on tiny collections

0e1c2a5

ashvardanian added 2 commits May 21, 2024 05:24

Docs: Rearrange

fef9234

Merge branch 'main-dev' of https://github.com/unum-cloud/usearch into…

ee63e64

… main-dev

ashvardanian added 3 commits May 23, 2024 06:44

Fix: Reserving contexts post-reload

8b28bfa

Improve: Detect more failures in tests

8428b7f

Improve: Log failing lines

c453582

ashvardanian and others added 13 commits May 25, 2024 23:36

Make: Manually bump version to 2.13.0

78c677a

We can't yet rely on the SemVer tool semantic-release/release-notes-generator#633 (comment)

Improve: Support enum slots

8d2e495

Tracing implicit conversions of `std::uint32_t` and other primitive types isn't always easy in concurrent apps. This commit adds support for `enum` types to be used for safer implementation of `index_gt` specializations.

Add: MANIFEST.in for py.typed (#425)

8fb6b9f

Adding type annotation for Python native modules solves the `Skipping analyzing "usearch.index" module` warning due to `missing library stubs or py.typed marker`. Closes #424

Fix: esm duplicate import bug in jest (#420)

782fb3d

Closes #418 Closes #426

Fix: build.gradle deprecations

8a628b5

Merge pull request #431 from MarkReedZ/fix_gradle_deprecation

81c4ccb

Fix: build.gradle deprecations

Fix: ESM build support (#433)

3ad49e7

Closes #426 Relates to #420

Fix: capacity() assertion in Rust (#436)

72f8ce0

Closes #432

Fix: Computing stats(i).max_edges

e678678

Merge branch 'main-dev' of https://github.com/unum-cloud/usearch into…

032556d

… main-dev

ashvardanian mentioned this pull request Jun 10, 2024

Bug: On the SIFT1M test set, the speed of Usearch is much slower than HNSWlib #439

Open

3 tasks

ashvardanian and others added 5 commits June 19, 2024 21:29

Docs: Fix spelling

5def153

Merge branch 'main-dev' of https://github.com/unum-cloud/usearch into…

2cc8fe5

… main-dev

Fix: Wolfram bindings (#437)

00032d0

Co-authored-by: Ash Vardanian <[email protected]>

Fix: Pre-reserve enough threads for C users

527e69f

This indirectly fixes the crash in C# layer

Merge branch 'main-dev' of https://github.com/unum-cloud/usearch into…

df20d05

… main-dev

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error Handling, Update Stability, Improved Java SDK #402

Error Handling, Update Stability, Improved Java SDK #402

ashvardanian commented May 5, 2024 •

edited

Loading

rschu1ze commented May 22, 2024

ashvardanian commented May 22, 2024

ashvardanian commented May 23, 2024

Ngalstyan4 commented May 25, 2024

ashvardanian commented May 25, 2024 •

edited

Loading

Error Handling, Update Stability, Improved Java SDK #402

Are you sure you want to change the base?

Error Handling, Update Stability, Improved Java SDK #402

Conversation

ashvardanian commented May 5, 2024 • edited Loading

Reducing Dependencies & Accelerating Builds

Improved Java, C#, and Swift SDK

Reduced Memory Consumption for DBMS-like Users

Improving Stability

rschu1ze commented May 22, 2024

ashvardanian commented May 22, 2024

ashvardanian commented May 23, 2024

Ngalstyan4 commented May 25, 2024

ashvardanian commented May 25, 2024 • edited Loading

ashvardanian commented May 5, 2024 •

edited

Loading

ashvardanian commented May 25, 2024 •

edited

Loading