Automatic step sizes for SVRG #207

bagibence · 2024-08-08T19:53:46Z

Attempt to automatically determine the batch- and step sizes for SVRG when fitting a GLM with Poisson observations and a softplus inverse link function.
Based on this paper.

… by SVRG

bagibence · 2024-08-09T17:55:40Z

This needs to be more exhaustively tested to make sure it works on real datasets.
I have even encountered toy examples where my current implementation didn't work either.

bagibence · 2024-08-09T18:20:20Z

The regularization strength might have to be added to the L and L_max constants determined.
It could also be useful as bound for the convexity when using ridge.

BalzaniEdoardo · 2024-08-26T12:30:38Z

This PR provides the infrastructure for computing an optimal stepsize and batch_size for SVRG based on the GLM configurations.

The optimal hyperparameters depends on the loss function L-smoothness. This means that for each model configuration (observation noise, link function, regularization), one may need to to compute a different estimate of the smoothness parameters.

Here, I implemented a look-up table that should be easy to extend whenever new estimates becomes available (for example if we derive the L-smoothness for Gamma + softplus observations).

BalzaniEdoardo · 2024-10-08T18:09:28Z

With my edits, I added:

A new module solvers, which is private (not in the nemos.__init__) which includes:
- _svrg.py: the SVRG implementation;
- _svrg_defaults.py: functions to compute default params for svrg (batch and step size)
- _compute_defaults.py: includes the lookup table that receives a model as input and checks if defaults are available.
- added an abstract method for BaseRegressor responsible of selecting the configurations for optimizing the solver parameters. This should be easy to extend.
A number of tests that checks the behavior of the GLM defaults over all possible configuration of regularizers, obs models, link function;

sjvenditto · 2024-10-10T14:16:11Z

tests/test_glm.py

+        else:
+            assert opt_state.stepsize > 0
+            assert isinstance(opt_state.stepsize, float)
+            model.fit(X, y)


I would consider removing model.fit() calls in this test (as well as in test_glm_optimal_config_set_initial_state_pytree, and both functions with the same names in the TestPopulationGLM class) since its not being consistently called across all cases, and since no checks are happening after the model is fit

sjvenditto · 2024-10-10T14:36:15Z

tests/test_svrg_defaults.py

+    else:
+        assert (
+            "stepsize" in result and result["stepsize"] > 0
+        ), "Stepsize should be computed since it was not provided."


I don't know if I missed it in one of the previous tests, but I didn't notice any test that explicitly tests the result of a computed stepsize. They only check that stepsize > 0. Is it worth it to have a test check the result explicitly, similar to test_calculate_optimal_batch_size_svrg_all_config at the end?

bagibence added 2 commits August 7, 2024 17:07

Add automatic batch and step size for softplus-Poisson GLMs optimized…

0f9f1da

… by SVRG

Add docstrings

a88bc61

bagibence and others added 17 commits August 9, 2024 14:23

Handle stepsize not being in solver_kwargs

a021cbb

Add new way to calculate stepsize and also b_tilde

e606fc1

Merge branch 'development' into auto_stepsize_svrg

e94cbc8

started renaming vars

95870f8

added ref to algorithm

5661899

renamed function and generalized lookup

481220d

added the table calculations

68c3240

Merge branch 'development' into auto_stepsize_svrg

db5b70f

brought back maxiter to 10K.

730b789

moved pieces around

d15ad8d

improved doscrsrings and added test for config

ab4dbfd

improved naming and docstrings

886bdeb

started testing

d5baa02

changed naming

deae6a1

linted

496702a

added two missed lines for cov

e084844

added test all table cases

0d36aaf

BalzaniEdoardo marked this pull request as ready for review August 26, 2024 12:19

BalzaniEdoardo self-requested a review as a code owner August 26, 2024 12:19

BalzaniEdoardo added 2 commits August 26, 2024 14:31

linted

aada505

linted

e9028a8

BalzaniEdoardo marked this pull request as draft August 26, 2024 12:44

BalzaniEdoardo added 2 commits August 26, 2024 16:59

added glm tests

b8801b5

linted

7e8d576

BalzaniEdoardo marked this pull request as ready for review August 26, 2024 15:00

BalzaniEdoardo added 10 commits August 26, 2024 17:55

improved glm docstrings

06b9f53

removed args from docstrings

cae93ac

Merge branch 'development' into auto_stepsize_svrg

5eec1e3

added billy's comments

35622bd

Merge branch 'development' into auto_stepsize_svrg

e49603b

Merge branch 'main' into auto_stepsize_svrg

8dcc9b2

merged dev

2cadad0

added tests for auto-stepsize

8f33e7f

removed unused import

028952d

fixed warns in tests

7d766eb

BalzaniEdoardo assigned gviejo, billbrod and sjvenditto Oct 8, 2024

BalzaniEdoardo added 4 commits October 8, 2024 14:23

fix warn svrg default

93a7033

moved the methods around for re-usability

3e55ce7

fixed mockregressor

4fdcf32

fix comment

30934ca

sjvenditto reviewed Oct 10, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Automatic step sizes for SVRG #207

Automatic step sizes for SVRG #207

bagibence commented Aug 8, 2024

bagibence commented Aug 9, 2024

bagibence commented Aug 9, 2024

BalzaniEdoardo commented Aug 26, 2024

BalzaniEdoardo commented Oct 8, 2024 •

edited

Loading

sjvenditto Oct 10, 2024

sjvenditto Oct 10, 2024

Automatic step sizes for SVRG #207

Are you sure you want to change the base?

Automatic step sizes for SVRG #207

Conversation

bagibence commented Aug 8, 2024

bagibence commented Aug 9, 2024

bagibence commented Aug 9, 2024

BalzaniEdoardo commented Aug 26, 2024

BalzaniEdoardo commented Oct 8, 2024 • edited Loading

sjvenditto Oct 10, 2024

Choose a reason for hiding this comment

sjvenditto Oct 10, 2024

Choose a reason for hiding this comment

BalzaniEdoardo commented Oct 8, 2024 •

edited

Loading