Add Distance over Gradients Stepsize #505

Nimrais · 2025-09-05T13:25:12Z

This PR adding a new stepsize schedule — Distance over Gradients (RDoG). RDoG is a learning-rate-free stepsize that adapts automatically without hyperparameter tuning https://arxiv.org/pdf/2406.02296.

You can use it like any other stepsize schedule.

using Manopt
using Manifolds
using LinearAlgebra
using Random

Random.seed!(42)

# Minimize negative Rayleigh quotient on the sphere S^1
M = Sphere(1)
A = randn(2, 2); A = A' * A  # symmetric positive definite

f(M, p) = -p' * A * p
function grad_f(M, p)
    g = -2 * A * p
    return g - dot(g, p) * p  # project to tangent space
end

p0 = rand(M)

x = gradient_descent(
    M, f, grad_f, p0;
    stepsize = DistanceOverGradients(M; initial_distance = 1e-2, use_curvature = false),
    stopping_criterion = StopAfterIteration(200) | StopWhenGradientNormLess(1e-8),
)

println("final cost = ", f(M, x))

codecov · 2025-09-05T13:32:29Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 99.78%. Comparing base (a364659) to head (9b6a8ec).
⚠️ Report is 3 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #505      +/-   ##
==========================================
- Coverage   99.80%   99.78%   -0.03%     
==========================================
  Files          85       85              
  Lines        9375     9418      +43     
==========================================
+ Hits         9357     9398      +41     
- Misses         18       20       +2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

kellertuer · 2025-09-05T14:08:48Z

Nice, thanks!

At first glance it seem the current constructor is “the old one” with M necessary – maybe we can have a second ode employing the factory (I wrote that after our discussions at last years JuliaCon)?

I will try to find time to review your code, but it would also be nice to write a test case and have test coverage. Maybe just one further run in the gradient descent tests where you have a cost and gradient already...

kellertuer

Thanks for the PR!

I am (maybe I said tha already) not so much a fan of that paper. Illustrating that a gradient descent with a constant stepsize does not converge, is no wonder – one needs an Armijo linesearch for example.
Sure they could have compared to Armijo and argued why theirs is easier, ... that would have been the proper comparison. But well, that is not your fault of couse

Here are a few small comment, mainly on the documentation, which I rendered locally.
Test coverage would be nice.

docs/src/references.bib

src/Manopt.jl

src/plans/stepsize.jl

Co-authored-by: Ronny Bergmann <[email protected]>

mateuszbaran · 2025-09-08T08:09:15Z

Thanks for the contribution, it's good to have a wider choice of algorithms here.

I am (maybe I said tha already) not so much a fan of that paper. Illustrating that a gradient descent with a constant stepsize does not converge, is no wonder – one needs an Armijo linesearch for example.

Yes, doing better than a constant stepsize isn't a great achievement. On the other hand, it's stochastic optimization where AFAIK standard Armijo doesn't have any convergence guarantees. This paper proposes a modification that does have some guarantees: https://par.nsf.gov/servlets/purl/10208765 . It can be trivially adapted to the Riemannian setting, so the authors of the RDoG paper could have included it in their comparison.

Nimrais · 2025-09-08T08:59:02Z

I have read the feedback, thanks. it's not hard to address. However I have one problem about this one.

At first glance it seem the current constructor is “the old one” with M necessary – maybe we can have a second ode employing the factory (I wrote that after our discussions at last years JuliaCon)?

DoG algorithms are inherently dependent on your starting point, so that is why I do need the manifold argument M. It's essentially a poor man way to allocate a container for the starting position. Later, I am overwriting whatever is written there from the optimizer state on iteration 0. The only way how alternative constructor can be implemented is with a point on Manifold provided. Is it better? Any other ideas?

kellertuer · 2025-09-08T11:44:49Z

I have read the feedback, thanks. it's not hard to address. However I have one problem about this one.

At first glance it seem the current constructor is “the old one” with M necessary – maybe we can have a second ode employing the factory (I wrote that after our discussions at last years JuliaCon)?

DoG algorithms are inherently dependent on your starting point, so that is why I do need the manifold argument M. It's essentially a poor man way to allocate a container for the starting position. Later, I am overwriting whatever is written there from the optimizer state on iteration 0. The only way how alternative constructor can be implemented is with a point on Manifold provided. Is it better? Any other ideas?

Hm but the one and only idea of the factory (from the one without a suffix to the one with stepwise) is to “plug in” the manifold (and call the constructor then) to some later point. So that scheme has exactly use usage, namely to not write M in the constructor as you wrote above. You can but you do not have to.

Is that the case here?

test/plans/test_stepsize.jl

test: add Hyperbolic test for DoG

kellertuer · 2025-09-08T16:52:57Z

Thanks for your work today. Coverage is already on a good way.

Could you add an entry to the Changelog as well? Just roughly follow the format in there, I can fix it before doing the release.
If you feel this is enough of a contribution, we can also check whether you get an entry on the about.md and the zenodo metadata. For me this is on the brink, but if you like, sure we can add you :)

Nimrais · 2025-09-08T21:30:03Z

I am (maybe I said tha already) not so much a fan of that paper. Illustrating that a gradient descent with a constant stepsize does not converge, is no wonder – one needs an Armijo linesearch for example.

Yes, doing better than a constant stepsize isn't a great achievement. On the other hand, it's stochastic optimization where AFAIK standard Armijo doesn't have any convergence guarantees.

I agree with you both. To my taste, the paper does not have the proper scholarly tone :). But it does not change the fact that the method is really cheap — you really just need one additional number to store to outperform the constant stepsize. So once Armijo becomes computationally infeasible because it's just too slow to backtrack, at least you can try this as something cheap that is better than constant stepsize without playing with strange schedules for the stepsize.

This paper proposes a modification that does have some guarantees: https://par.nsf.gov/servlets/purl/10208765 . It can be trivially adapted to the Riemannian setting, so the authors of the RDoG paper could have included it in their comparison

Interesting! I will take a look. I haven't seen this paper; the authors of the RDoG probably haven't seen it as well, as I do not see them citing it.

Nimrais · 2025-09-08T21:57:46Z

If you feel this is enough of a contribution, we can also check whether you get an entry on the about.md and the zenodo metadata. For me this is on the brink, but if you like, sure we can add you :)

Ah no I am fine you don't need to add me. I will return later with RDoWG I just do not have time to add it into this PR.

kellertuer · 2025-09-09T04:20:35Z

Great!

And sure, shorter PRs that focus on one thing are better than super long PRs, so that decision is a very good one I think :)

Let me just check the rendered docs during the day somewhen, the rest looks already good I think.

kellertuer

This overall looks very nice already!

I have two small comments on the docs and one for test coverage.

src/plans/stepsize.jl

kellertuer · 2025-09-09T11:21:05Z

since we are nearly finished here, I will wait with #503 for this one and release both together as a new version.

Mainly because by waiting I have to merge the changelog, then you do not have to worry about that.

src/plans/stepsize.jl

Co-authored-by: Ronny Bergmann <[email protected]>

src/plans/stepsize.jl

Co-authored-by: Ronny Bergmann <[email protected]>

Nimrais · 2025-09-09T16:38:47Z

@kellertuer I think it's in a good shape now, do you see smt that still needs to be addressed?

kellertuer

Looks good! Thanks!

Nimrais added 13 commits August 18, 2025 23:15

feat: add RDoG

29b95d4

fix(example): compare with Armijo as well

4147596

fix: clean Armijo parameters

d4729a0

test: add distance over gradient tests

d4c9129

test: test add Hyperbolic RDoG

df9ba68

test(fix): correct point on the manifold

f1dc73f

fix: raname to DoG

007b47f

fix: remove example from the code

11e2f5b

Merge branch 'JuliaManifolds:master' into master

d8944ce

style: 🎨

8bfeba1

Merge branch 'master' of github.com:Nimrais/Manopt.jl

1027ba5

style: runic style

7b74eef

docs: update RDoG cite

c1a8b2d

kellertuer reviewed Sep 7, 2025

View reviewed changes

Nimrais and others added 3 commits September 7, 2025 11:07

Update docs/src/references.bib

88806f7

Co-authored-by: Ronny Bergmann <[email protected]>

style: fix citation style

3855ae4

docs: add DoG in stepsize.md

ac7ed55

Nimrais added 2 commits September 8, 2025 11:35

docs: correct ref for geometric_curvature_function

3518ffe

style: two lines for export

eb3af83

Nimrais added 2 commits September 8, 2025 14:15

fix: promote types in constructor

25db631

style: 🎨

eb03f99

kellertuer reviewed Sep 8, 2025

View reviewed changes

test/plans/test_stepsize.jl Outdated Show resolved Hide resolved

Nimrais added 3 commits September 8, 2025 14:27

test(fix): remove M from DistanceOverGradients constructor

da3277c

fix: clamp 0

40cd58c

test(fix): remove magic constant

f30b2cc

Nimrais added 5 commits September 8, 2025 15:10

fix: remove @show

46465c5

docs: fix grammar in rtol comment

ec3ed83

test: get last stepsize tests

5b09abf

fix: style 🎨

8af6351

test: add Hyperbolic optimization test for DoG

823b972

test: add Hyperbolic test for DoG

Nimrais added 4 commits September 8, 2025 22:16

test: add repr test for DoG

ed415de

test(fix): remove hand crafted projections

77ee0c5

docs: update changelog

5755db7

style: 🎨

b941227

kellertuer reviewed Sep 9, 2025

View reviewed changes

src/plans/stepsize.jl Show resolved Hide resolved

src/plans/stepsize.jl Outdated Show resolved Hide resolved

src/plans/stepsize.jl Show resolved Hide resolved

Nimrais added 2 commits September 9, 2025 11:50

test: add stupid test for get_initial_stepsize

2682813

test: fix update get_initial_stepsize

c0b3252

kellertuer reviewed Sep 9, 2025

View reviewed changes

src/plans/stepsize.jl Outdated Show resolved Hide resolved

kellertuer reviewed Sep 9, 2025

View reviewed changes

src/plans/stepsize.jl Outdated Show resolved Hide resolved

Update src/plans/stepsize.jl

46104f6

Co-authored-by: Ronny Bergmann <[email protected]>

kellertuer reviewed Sep 9, 2025

View reviewed changes

src/plans/stepsize.jl Outdated Show resolved Hide resolved

Nimrais and others added 2 commits September 9, 2025 15:00

Update src/plans/stepsize.jl

3e79b81

Co-authored-by: Ronny Bergmann <[email protected]>

fix: remove untested numerical trick

9b6a8ec

kellertuer approved these changes Sep 9, 2025

View reviewed changes

kellertuer merged commit a91e67a into JuliaManifolds:master Sep 9, 2025
13 of 14 checks passed

kellertuer mentioned this pull request Sep 9, 2025

Adding RDoG #502

Closed

Add Distance over Gradients Stepsize #505

Add Distance over Gradients Stepsize #505

Uh oh!

Conversation

Nimrais commented Sep 5, 2025

Uh oh!

codecov bot commented Sep 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

kellertuer commented Sep 5, 2025

Uh oh!

kellertuer left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mateuszbaran commented Sep 8, 2025

Uh oh!

Nimrais commented Sep 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kellertuer commented Sep 8, 2025

Uh oh!

Uh oh!

kellertuer commented Sep 8, 2025

Uh oh!

Nimrais commented Sep 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Nimrais commented Sep 8, 2025

Uh oh!

kellertuer commented Sep 9, 2025

Uh oh!

kellertuer left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kellertuer commented Sep 9, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Nimrais commented Sep 9, 2025

Uh oh!

kellertuer left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov bot commented Sep 5, 2025 •

edited

Loading

Nimrais commented Sep 8, 2025 •

edited

Loading

Nimrais commented Sep 8, 2025 •

edited

Loading