feat: Generalised Minkowski Metric (L_p norm) by cbueth · Pull Request #287 · sdd/kiddo

cbueth · 2026-02-08T00:59:16Z

This PR builds on #286 and adds the power metric $\mathcal{L}_p^p$. If a user wants Minkowski distances, they can use $\mathcal{L}_p^p$ and then take the $\sqrt[p]{\text{dist}}$, similar to SquaredEuclidean which is $\mathcal{L}_2^2$. While this is not depending on the max-based changes from the previous PR, it uses the metric testing infrastructure to reassure correct behaviour.

The new metrics are:

Minkowski<const P: u32>: An implementation for integer powers using powi. It should be significantly faster than the floating-point version.
MinkowskiF64<const P_BITS: u64>: A flexible implementation for arbitrary floating-point powers (like $P=0.5$), utilizing a const-generic bit-representation to remain compatible with stable Rust.

I have written the docstrings for correct usage, guiding users toward Manhattan ($P=1$) and SquaredEuclidean ($P=2$) for common cases, as these might have a performance advantage. At compile time it's checked if the given P is 1 or 2. The new metrics easily satisfy the common_metric_tests suite from #286, verifying mathematical properties (non-negativity, identity, symmetry) across dimensions 1–5. They also work with the nearest-neighbor tests (Gaussian, ties, no-ties) for both integer and fractional powers. Furthermore, I added documentation aliases for "taxicab", "l1", "l2", and "euclidean" to help users find the correct metric via standard mathematical terminology.

depends on:

feat(WIP): Add Chebyshev distance ($L_∞$ norm) support #286

…ded in Rust 1.54 instead

fixes bug where nearest_n_within accessed self.content_items instead of remainder_items for remainder elements, causing incorrect results when dataset size % CHUNK_SIZE != 0. Also removed unnecessary unsafe code in best_n_within. Signed-off-by: Markus Zoppelt <markus.zoppelt@helsing.ai>

tests nearest_n_within with size-33 dataset to verify items in remainder region are found correctly. Before the fix, this would access self.content_items[0] instead of remainder_items[0], returning wrong items. Signed-off-by: Markus Zoppelt <markus.zoppelt@helsing.ai>

If leaf_items.len() exceeds u32::MAX (~4.3 billion), this silently truncates. For datasets with billions of points, this is realistic and causes severe corruption.

* release-plz checkout depth fixed so that full changelogs are generated * add commitlint with conventional commits config

…thin_unsorted_iter within_unsorted_iter is modified to decouple the lifetime of the iterator from that of the query by performing a generally very cheap copy just once at the start of the query

see RustCrypto/utils#1304

codecov · 2026-02-08T19:08:13Z

Codecov Report

❌ Patch coverage is 96.01911% with 25 lines in your changes missing coverage. Please review.
✅ Project coverage is 95.03%. Comparing base (2056051) to head (77d5bfc).
⚠️ Report is 1 commits behind head on master.

Files with missing lines	Patch %	Lines
src/fixed/distance.rs	84.68%	15 Missing and 2 partials ⚠️
src/float/distance.rs	98.36%	3 Missing and 5 partials ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #287      +/-   ##
==========================================
+ Coverage   94.89%   95.03%   +0.13%     
==========================================
  Files          54       54              
  Lines        5705     6307     +602     
  Branches     5705     6307     +602     
==========================================
+ Hits         5414     5994     +580     
- Misses        273      288      +15     
- Partials       18       25       +7

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

sdd · 2026-02-08T19:20:30Z

src/float/distance.rs

+                "MinkowskiF64<P as F64> with power that is basically integer. Consider using Minkowski<P as u32> instead.",
+            );
+        }
+    };


It may be worth adding a check here that P is finite and not NaN when cast back to f64

sdd · 2026-02-08T19:23:52Z

src/float/distance.rs

    #[inline]
+    #[allow(clippy::let_unit_value)]
    fn dist(a: &[A; K], b: &[A; K]) -> A {
+        let _ = Self::CHECK_P;


Can we add a small comment to indicate that this is a compile-time-only check thanks to the const implementation

sdd · 2026-02-08T19:26:55Z

Hi Carlson,

Another great PR 😄 Thanks for this one also. I agree, there are likely to be some optimisations that can be made for the more common L1 and L2 cases and so I like that you've kept separate definitions for those rather than them being re-implemented purely as type aliases to Minkowski<1> and Minkowski<2>. Regarding the float power variant, I'm going to experiment with some of these changes on the v6.x branch and see how well they fit in.

As with the other PR, can we raise this one targeting the v5.x.x branch rather than master as well if possible.

Thanks again!

…lidean distance metrics

- deprecate `rd_update` with `D::accumulate` for consistent handling of sum-based and max-based metrics - conditional logic for SIMD (L1/L2) and general L∞ - differentiate distance accumulation behaviour

- integration `nearest_n` tests (Chebyshev, Manhattan, SquaredEuclidean).

…doc, add Gaussian scenario to tests

… trait - improve test coverage

…fy usage scenarios

…rised power

sdd and others added 22 commits December 8, 2025 07:07

ci: Update CI workflow triggers to include PR and workflow_dispatch

aac7f18

style: remove unnecessary parentheses

46b0c56

ci: permit coverage to run for PRs as well

809d457

deps: remove doc-comment dependency and use doc attribute that was ad…

ed3b7d7

…ded in Rust 1.54 instead

chore: use doc attribute instead of doc_comment!

18c8bb3

style: fix formatting

b4a40ce

fix: use try_from() with error for leaf_items.len()

6c5bcbf

If leaf_items.len() exceeds u32::MAX (~4.3 billion), this silently truncates. For datasets with billions of points, this is realistic and causes severe corruption.

chore(deps): update actions/checkout action to v6

f3f2ec3

chore(deps): update codspeedhq/action action to v4

5e3ee0f

chore(deps): update ad-m/github-push-action action to v1

695c97c

chore(deps): update rust crate rstest to 0.26

9a1e218

chore(deps): update rust crate codspeed-criterion-compat to v4

db1fbe9

ci: fix release-plz and add commitlint

aa7b565

* release-plz checkout depth fixed so that full changelogs are generated * add commitlint with conventional commits config

Added WithinUnsortedIterOwned

6c2940b

fix: update to use transform function

ba36d8d

refactor: remove within_unsorted_iter_owned in favour of modifying wi…

7b88de8

…thin_unsorted_iter within_unsorted_iter is modified to decouple the lifetime of the iterator from that of the query by performing a generally very cheap copy just once at the start of the query

docs: update Cargo.toml, changelog and docs for 5.2.3

6e1afdd

deps: bump cmov to 0.4 as all other versions were yanked

e746edf

see RustCrypto/utils#1304

docs: update changelog, readme, and Cargo.toml for 5.2.4 release

bc89bc5

chore: update changelog

f87b965

sdd reviewed Feb 8, 2026

View reviewed changes

cbueth mentioned this pull request Feb 8, 2026

feat(WIP): Add Chebyshev distance ($L_∞$ norm) support #286

Open

cbueth added 3 commits February 12, 2026 22:56

test: Add coverage for Manhattan and Squared Euclidean distance metrics

47db176

feat: Add Chebyshev distance metric and test coverage

8178908

test: add integration tests for Chebyshev, Manhattan, and Squared Euc…

57dff79

…lidean distance metrics

cbueth added 13 commits February 12, 2026 22:56

inclusive radius matching and leaf note remainder loops

b1fe052

fix over-pruning for L_inf

4a59cb8

Add D::accumulate and D::IS_MAX_BASED

28859a9

- deprecate `rd_update` with `D::accumulate` for consistent handling of sum-based and max-based metrics - conditional logic for SIMD (L1/L2) and general L∞ - differentiate distance accumulation behaviour

feat: add fixed Chebyshev distance metric

c8dcb3e

- integration `nearest_n` tests (Chebyshev, Manhattan, SquaredEuclidean).

refactor: in-loop accumulation for max-based metrics

3f32f6b

unify distance accumulation logic with D::accumulate

93b98c1

remove D::IS_MAX_BASED, unify heap logic, improve DistanceMetric …

c934656

…doc, add Gaussian scenario to tests

change test comment & lint

fa3361f

refactor: make metric property tests reusable

7ebb80b

chore: add default implementation of accumulate to DistanceMetric…

44a0f1e

… trait - improve test coverage

feat: add Minkowski and MinkowskiF64 distance metrics with tests

5c424bb

compile-time validations and doc aliases for Minkowski metrics, clari…

cca6e57

…fy usage scenarios

integration nearest_n tests (Minkowski, MinkowskiF64) with paramete…

8970913

…rised power

cbueth force-pushed the feature/minkowski branch from 77d5bfc to 8970913 Compare February 12, 2026 23:55

cbueth added 2 commits February 13, 2026 01:04

chore: saturating add for fixed metrics

7adffe8

Merge branch 'feature/chebychev' into feature/minkowski

d7bebf0

cbueth mentioned this pull request Feb 13, 2026

feat: Generalised Minkowski Metric (L_p norm) #291

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

feat: Generalised Minkowski Metric (L_p norm)#287

feat: Generalised Minkowski Metric (L_p norm)#287
cbueth wants to merge 40 commits intosdd:masterfrom
cbueth:feature/minkowski

cbueth commented Feb 8, 2026 •

edited

Loading

Uh oh!

codecov bot commented Feb 8, 2026

Uh oh!

sdd Feb 8, 2026

Uh oh!

sdd Feb 8, 2026

Uh oh!

sdd commented Feb 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Comments

Conversation

cbueth commented Feb 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

depends on:

Uh oh!

codecov bot commented Feb 8, 2026

Codecov Report

Uh oh!

sdd Feb 8, 2026

Choose a reason for hiding this comment

Uh oh!

sdd Feb 8, 2026

Choose a reason for hiding this comment

Uh oh!

sdd commented Feb 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

cbueth commented Feb 8, 2026 •

edited

Loading