Add transforming adaptation with normalizing flows #154

aseyboldt · 2024-10-17T09:41:10Z

Experimental new algorithm that uses a normalizing flow instead of a mass matrix.

Set up using pixi:

git clone https://github.com/pymc-devs/nutpie
cd nutpie
git fetch origin pull/154/head:transform
git switch transform

pixi run develop
pixi shell

Gives a shell with an appropriate python setup.

Usage with pymc:

import pymc as pm
import nutpie
import numpy as np
import jax

jax.config.update("jax_enable_x64", True)

with pm.Model() as model:
    log_sd = pm.Normal("log_sd")
    pm.Normal("y", sigma=np.exp(log_sd))

compiled = nutpie.compile_pymc_model(model, backend="jax", gradient_backend="jax")

compiled = (
    compiled
    .with_transform_adapt(
        # Neural network width, default is half the number of model parameters
        nn_width=None,
        # Number of normalizing flow layers
        num_layers=8,
        # Depth of the neural network in each flow layer
        nn_depth=1,
        # Print status update of the optimizer.
        verbose=False,
        # Number of gradients to use in each training phase
        window_size=5000,
        # Learning rate of the optimizer
        learning_rate=1e-3,
        # Print progress bars for the optimization. Very spammy...
        show_progress=False,
        # Number of initial windows with a diagonal mass matrix
        num_diag_windows=10,
    )
)

trace_ = nutpie.sample(
    compiled,
    transform_adapt=True,
    chains=2,
    tune=1000,
    draws=1000,
    cores=1,
    seed=123,
)

Usage with stan:

import pymc as pm
import nutpie
import numpy as np
import jax
import os

os.environ["TBB_CXX_TYPE"] = "clang"
jax.config.update("jax_enable_x64", True)

code = """
parameters {
    real log_sigma;
    real x;
}
model {
    log_sigma ~ normal(0, 1);
    x ~ normal(0, exp(log_sigma));
}
"""


compiled = nutpie.compile_stan_model(code=code)

compiled = (
    compiled
    .with_transform_adapt(
        # Neural network width, default is half the number of model parameters
        nn_width=None,
        # Number of normalizing flow layers
        num_layers=8,
        # Depth of the neural network in each flow layer
        nn_depth=1,
        # Print status update of the optimizer.
        verbose=False,
        # Number of gradients to use in each training phase
        window_size=5000,
        # Learning rate of the optimizer
        learning_rate=1e-3,
        # Print progress bars for the optimization. Very spammy...
        show_progress=False,
        # Number of initial windows with a diagonal mass matrix
        num_diag_windows=10,
    )
)

trace = nutpie.sample(
    compiled,
    transform_adapt=True,
    chains=2,
    tune=1000,
    draws=1000,
    cores=1,
    seed=123,
)

The optimization can be quite expensive computationally (but luckily doen't need any extra gradient evaluations). A GPU is very helpful here. (Jax should pick up a cuda device automatically)

aseyboldt added help wanted Extra attention is needed normalizing-flows Needed for adaptation through normalizing-flows labels Oct 17, 2024

aseyboldt force-pushed the transforms branch from 445101e to c22b438 Compare October 17, 2024 14:05

aseyboldt force-pushed the transforms branch from a63377e to 3f64e42 Compare November 14, 2024 16:43

aseyboldt added 12 commits November 14, 2024 17:45

chore: Update dependencies

835febd

style: Add some type hints

e336d67

feat: Add transforming adaptation

9d82b52

docs: Add documentation template

4faf55f

Clean up optimizer arguments

7f14b8c

Fix logdet sign

100fbef

chore: Update dependencies

25246e9

Add transforming adaptation for stan

799b8d4

feat: Add transformation file for normalizing flows

300c2db

chore: Update dependencies

42374ea

fix: Move mvscale import to be conditional

ea15908

fix: Some rebase issues

7b4dd2e

aseyboldt force-pushed the transforms branch from 3f64e42 to 7b4dd2e Compare November 14, 2024 17:08

aseyboldt added 2 commits November 14, 2024 18:13

style: Fix ruff issues

6ec04e5

feat: Add pixi config for development

2c29e11

aseyboldt force-pushed the transforms branch from b0cc403 to 2c29e11 Compare November 14, 2024 17:14

aseyboldt added 10 commits November 15, 2024 13:32

Expose batch size

2abcb05

fix: Add lock around init_func

6f4c784

Work on default normalizing flow

bbed638

Update deps

3439de8

feat: Change default number of draws to None to let the sampler decide

d6fda12

feat: allow dynamic extensions of normalizing flows

e1acc6c

chore: Adapt to new nuts-rs transform code

4797c04

chore: update to pyo3 0.23

cf3e921

style: Reformat some code

329a8b4

chore: improve gitignore

fd57509

aseyboldt added 10 commits December 12, 2024 14:42

chore: Update pixi dependencies

b020883

chore: update nuts-rs

9747722

chore: add items to gitignore

5707c03

chore: update bridgestan

04af51c

fix: add lock for pymc init point func

9a1da91

chore: update dependencies

086997a

chore: remove fluff from pyproject

2a3ad45

feat: Add option to freeze pymc models

ccfd9fe

style: Reformat some code

48f9019

fix: Use arrow list with i64 offsets to store trace

94c3e51

aseyboldt mentioned this pull request Dec 23, 2024

Update pre-commit versions and minor style fixes #164

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add transforming adaptation with normalizing flows #154

Add transforming adaptation with normalizing flows #154

aseyboldt commented Oct 17, 2024 •

edited

Loading

Add transforming adaptation with normalizing flows #154

Are you sure you want to change the base?

Add transforming adaptation with normalizing flows #154

Conversation

aseyboldt commented Oct 17, 2024 • edited Loading

aseyboldt commented Oct 17, 2024 •

edited

Loading