Optimize Msm #796

ashWhiteHat · 2023-09-06T06:05:14Z

Msm Optimization

Hi there.
I optimized the best_multiexp algorithm and replaced it with a more efficient rayon method.

What I did

Task Size

Before
Current implementation divides task into coeffs length / thread number parts.
The greater the parallelism used by rayon (by default given by the number of logical cores as determined by std::thread::available_parallelism), the smaller the task size of each thread process.
It causes task size to be too heavy for small-core PCs and too small for large-core PCs.
After
Flatten task size to addition and sum of bucket for each segment.

Parallelization Scope

Before
The current implementation performs final sum with a non-parallel process.
After
Include the final sum to parallel scope.

Rayon Method

Before
According to official documentation, speed relation is as follows: join ≤ par_iter ≤ scope
After
Replace best_multiexp scope with par_iter

Other

Refactoring Bucket and get_at.

Benchmark

I ran best_multiexp benchmark twiceafter -> before and before -> after in order to make machine condition same.

First

k	After	Before	Changes
8	2.0513 ms 2.1645 ms 2.3813 ms	3.2626 ms 3.3768 ms 3.4963 ms	+56.013%
9	4.3738 ms 4.8203 ms 5.3524 ms	5.4233 ms 5.8586 ms 6.2619 ms	+29.595%
10	6.3364 ms 6.7132 ms 7.1306 ms	8.5226 ms 9.1460 ms 9.9017 ms	+45.833%
11	10.845 ms 11.595 ms 12.519 ms	15.413 ms 16.760 ms 18.197 ms	+54.984%
12	19.057 ms 21.092 ms 22.951 ms	26.509 ms 28.703 ms 31.225 ms	+43.060%
13	32.293 ms 33.667 ms 35.468 ms	39.831 ms 40.534 ms 41.308 ms	+20.397%
14	56.018 ms 57.386 ms 58.956 ms	67.785 ms 70.827 ms 74.546 ms	+23.423%

Second

k	Before	After	Changes
8	3.3917 ms 3.5141 ms 3.6436 ms	2.1795 ms 2.2404 ms 2.3090 ms	-36.243%
9	5.8065 ms 6.2340 ms 6.6611 ms	3.8219 ms 4.2643 ms 4.7574 ms	-34.941%
10	9.2216 ms 9.8577 ms 10.495 ms	6.2926 ms 6.6160 ms 6.9694 ms	-36.077%
11	13.582 ms 14.203 ms 15.074 ms	10.587 ms 11.265 ms 11.983 ms	-24.557%
12	24.982 ms 27.123 ms 29.894 ms	17.488 ms 17.926 ms 18.550 ms	-32.419%
13	48.558 ms 52.740 ms 57.494 ms	33.177 ms 34.301 ms 35.544 ms	-34.961%
14	71.452 ms 74.974 ms 78.797 ms	58.742 ms 61.173 ms 63.892 ms	-18.408%

I would appreciate it if you could confirm.
Thank you.

str4d

ACK 27fddaf. Thanks!

halo2_proofs/src/arithmetic.rs

str4d

ACK 7de8af3

str4d · 2025-12-04T18:58:07Z

halo2_proofs/src/arithmetic.rs

+                let mut acc = C::Curve::identity();
+                let mut sum = C::Curve::identity();
+                buckets.iter().rev().for_each(|b| {
+                    sum = b.add(sum);


That doesn't work, because sum has type C::Curve, not Bucket, and we can't impl<C: Curve> AddAssign<Bucket<C::AffineRepr>> for C because it would be a foreign impl.

EDIT: I now see you edited your suggestion, and that one works fine.

str4d · 2025-12-04T19:05:36Z

halo2_proofs/src/arithmetic.rs

-                    }
-                    Bucket::Projective(a) => other + &a,
-                }
+    fn add(self, mut other: C::Curve) -> C::Curve {


Bucket::add was an existing function that is preserved in this PR, so in the interest of not needing to go through another round of CI, I'm going to defer this internal rename.

nuttycom

utACK 7de8af3

nuttycom · 2025-12-04T19:16:31Z

halo2_proofs/src/multicore.rs

+    type Item;
+
+    /// Combines the best of `std::iter` and `rayon` reductions.
+    fn the_best_reduce(


🙃 at the name, but this is fine.

I needed something that was not likely to clash in future with a std or rayon name 😄

ashWhiteHat added 3 commits September 6, 2023 10:46

arithmetic::best_multiexp add benchmark

af1713f

arithmetic::best_multiexp parallelize bucket arithmetic

e00f0d1

arithmetic::best_multiexp refactor buckets

24e3ec3

kilic mentioned this pull request Jan 23, 2024

MSM optimisations: CycloneMSM privacy-ethereum/halo2curves#130

Merged

Add test for best_multiexp

27fddaf

str4d approved these changes Dec 4, 2025

View reviewed changes

str4d requested changes Dec 4, 2025

View reviewed changes

halo2_proofs/src/arithmetic.rs Outdated Show resolved Hide resolved

halo2_proofs: Add helper to resolve Iterator::reduce mismatch

7de8af3

str4d force-pushed the feature/msm-optimization branch from d897fd4 to 7de8af3 Compare December 4, 2025 16:40

str4d approved these changes Dec 4, 2025

View reviewed changes

str4d reviewed Dec 4, 2025

View reviewed changes

nuttycom approved these changes Dec 4, 2025

View reviewed changes

str4d merged commit 61cce59 into zcash:main Dec 4, 2025
22 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Optimize Msm #796

Optimize Msm #796

Uh oh!

ashWhiteHat commented Sep 6, 2023 •

edited by daira

Loading

Uh oh!

str4d left a comment

Uh oh!

Uh oh!

str4d left a comment

Uh oh!

str4d Dec 4, 2025 •

edited

Loading

Uh oh!

str4d Dec 4, 2025

Uh oh!

nuttycom left a comment

Uh oh!

nuttycom Dec 4, 2025

Uh oh!

str4d Dec 4, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Optimize Msm #796

Optimize Msm #796

Uh oh!

Conversation

ashWhiteHat commented Sep 6, 2023 • edited by daira Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Msm Optimization

What I did

Task Size

Parallelization Scope

Rayon Method

Other

Benchmark

First

Second

Uh oh!

str4d left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

str4d left a comment

Choose a reason for hiding this comment

Uh oh!

str4d Dec 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

str4d Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

nuttycom left a comment

Choose a reason for hiding this comment

Uh oh!

nuttycom Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

str4d Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ashWhiteHat commented Sep 6, 2023 •

edited by daira

Loading

str4d Dec 4, 2025 •

edited

Loading