Diffusion preconditioners refactor #1317

CharlelieLrt · 2026-01-08T22:53:17Z

PhysicsNeMo Pull Request

Overview

This PR refactors the diffusion model preconditioners to introduce a clean, extensible architecture. The goal is to standardize how preconditioners wrap neural network models and apply the preconditioning formula, making it easier to implement new preconditioning schemes and maintain existing ones.

New BasePreconditioner abstract base class in physicsnemo/diffusion/preconditioners/preconditioners.py that defines a standardized interface for wrapping diffusion models preconditioning. The base class handles the common forward pass logic while subclasses implement compute_coefficients() to define their specific preconditioning scheme. Subclasses can optionally override sigma() to implement custom noise schedules (time-to-noise mappings). A key improvement of this refactor is to enable dependency-injection design patterns by wrapping arbitrary physicsnemo.Module instance with a preconditioner.
Four preconditioner reimplementations based on the new BasePreconditioner: VPPreconditioner, VEPreconditioner, IDDPMPreconditioner, and EDMPreconditioner. These are cleaner, standalone versions of the existing legacy preconditioners with comprehensive docstrings including mathematical formulas for the preconditioning coefficients and noise schedules.
Migrated legacy preconditioners in legacy.py to inherit from the new base classes. This eliminates code duplication while maintaining full backward compatibility—all existing method signatures, attributes, and behaviors are preserved. Users of the legacy API do not need to change their code.
Comprehensive CI tests in test/diffusion/test_preconditioners.py covering:
- Constructor instantiation and attribute verification
- sigma() and compute_coefficients() methods
- Forward pass with non-regression testing against saved reference data
- Checkpoint save/load roundtrip via physicsnemo.Module.from_checkpoint()
Closes 🚀[FEA]: Allow passing models as to EDMPrecond #796 .

Checklist

I am familiar with the Contributing Guidelines.
New or existing tests cover these changes.
The documentation is up to date with these changes.
The CHANGELOG.md is up to date with these changes.
An issue is linked to this pull request.
If I am implementing a new model or modifying any existing model, I have followed the Models Implementation Coding Standards.

Dependencies

Review Process

All PRs are reviewed by the PhysicsNeMo team before merging.

Depending on which files are changed, GitHub may automatically assign a maintainer for review.

We are also testing AI-based code review tools (e.g., Greptile), which may add automated comments with a confidence score.
This score reflects the AI’s assessment of merge readiness and is not a qualitative judgment of your work, nor is
it an indication that the PR will be accepted / rejected.

AI-generated feedback should be reviewed critically for usefulness.
You are not required to respond to every AI comment, but they are intended to help both authors and reviewers.
Please react to Greptile comments with 👍 or 👎 to provide feedback on their accuracy.

No tests fixed yet.

phsyicsnemo.utils, launch.config is just gone. It was empty.

Signed-off-by: Charlelie Laurent <[email protected]>

greptile-apps

Greptile Overview

Greptile Summary

Refactors diffusion preconditioners by introducing a clean BasePreconditioner abstract class that standardizes how preconditioners wrap neural networks and apply preconditioning formulas. Implements four preconditioners (VPPreconditioner, VEPreconditioner, IDDPMPreconditioner, EDMPreconditioner) as standalone classes with comprehensive docstrings including mathematical formulas. Migrates legacy preconditioners to inherit from new base classes, eliminating code duplication while maintaining full backward compatibility with existing APIs.

Important Files Changed

File Analysis

Filename	Score	Overview
physicsnemo/diffusion/preconditioners/preconditioners.py	4/5	Introduces clean BasePreconditioner architecture and four preconditioner implementations with comprehensive docstrings
physicsnemo/diffusion/preconditioners/legacy.py	4/5	Migrates legacy preconditioners to inherit from new base classes while maintaining full backward compatibility
test/diffusion/test_preconditioners.py	4/5	Comprehensive tests covering constructors, sigma/coefficients methods, forward pass, and checkpoint loading with non-regression testing

physicsnemo/diffusion/preconditioners/preconditioners.py

physicsnemo/diffusion/preconditioners/legacy.py

test/diffusion/test_preconditioners.py

physicsnemo/diffusion/preconditioners/preconditioners.py

physicsnemo/diffusion/preconditioners/legacy.py

Signed-off-by: Charlelie Laurent <[email protected]>

physicsnemo/diffusion/preconditioners/preconditioners.py

Signed-off-by: Charlelie Laurent <[email protected]>

physicsnemo/diffusion/preconditioners/preconditioners.py

pzharrington · 2026-01-09T19:58:07Z

physicsnemo/diffusion/preconditioners/preconditioners.py

+    condition : Dict[str, torch.Tensor]
+        Dictionary of conditioning tensors. Each tensor must have shape
+        :math:`(B, *)` where the batch size :math:`B` matches that of ``x``.
+        These are passed to the wrapped ``model`` without modification.


This is should be marked optional, no? What about unconditional diffusion models? Similarly the underlying models themselves should not have to require a condition argument

Yeah I was thinking about making condition optional, but if we make it an optional argument, how do we separate it from those in **model_kwargs? There might be conflicts and mix-up between the two

Similarly the underlying models themselves should not have to require a condition argument

Right, but then similar problem: how do we differentiate between models that require one and others that don't?
I found making condition required was the cleanest solution to remove all type of confusion and ambiguity, even though this argument is not needed for many cases.

But if you have a better ideaI I am open to it

Ugh, yeah good points the model kwargs make this annoying. It seems hard to avoid some sort of awkwardness here, apart from flat out defining separate preconditioners for the conditional and unconditional cases. To me it just feels wrong to have to wrap underlying unconditional backbones with something to pass a dummy condition arg, and similarly for the top-level preconditioner, but I'm having trouble coming up with alternate solutions.

What do you think of a capability flag passed to preconditioner init conditional which will specify whether or not the preconditioner (and underlying model) are expected to use the condition arg? Then we could have

forward(x, condition: TensorDict | None = None, **model_kwargs)

where within the forward pass we call things based on the value of self.conditional.

Alternate, and possibly spicier suggestion, we drop the mention of condition entirely from the forward signature. It is absorbed into model_kwargs, may or may not be passed, and is up to the user to do the input validation (which is the only operation on condition within the forward of the preconditioner). This would also allow for flexible nomenclature of the conditioning in principle, i.e. some bespoke model wants to call it era5_condition(+ optionally add others liketime_of_day_condition`, etc), then it is welcome to.

(That is unless in the samplers theres some specific operation that needs conditional fields explicitly)

What do you think of a capability flag passed to preconditioner init conditional which will specify whether or not the preconditioner (and underlying model) are expected to use the condition arg?

@pzharrington if it were only for the precondtioner, I would say okay. But we would need this conditional flag everywhere the model is passed as a callback (loss, samplers, etc). That will make things very heavy IMO. Also loss object is supposed to be purely functional, so the flag would need to be passed to the __call__ and not the __init__.

I prefer the second alternative that you proposed. Another option I just thought of would be to make the condition argument keyword-only, with a signature model(x, t, *, condition=None, **model_kwargs). That removes some possible ambiguity and mix up because by forcing to always pass condition by name, we forbid calls such as model(x, t, condition={...}, kwargs1=..., condition={...}, other_kwarg=...), which are anyways invalid.

@NickGeneva 's comment above reminded another reason why I wanted to have an explicit argument for condition. In multi-diffusion (whose implementation is still at the stage of philosophical reflection as of now), the model needs to know which argument is condition, because it needs to apply specific operations on the conditioning tensors (patching, sometimes interpolation). So, in multi-diffusion, condition cannot be considered the same as any model_kwargs.

Roughly, it should like:

multi_diffusion_model = MultiDiffusionModel(model, patching_options) x0 = multi_diffusion_model(x, t, condition={"y": y, "z": z}, some_kwarg=some_val)

Under the hood the forward pass to multi_diffusion_model applies patching (+ optionally interpolation) and concatenation to the items in condition, but it leaves all other kwargs untouched.

In multi-diffusion, the model needs to know which argument is condition

And it would, right? The underlying model would define condition in its forward kwargs and handle accordingly within its forward pass. Under my second suggestion, the preconditioner wrapping that would simply pass it through as part of the **model_kwargs, it doesn't need to know about or explicitly do anything with the condition (there is no condition-dependent preconditioning and if there was, that's crazy 😅).

E.g., in your snippet, we'd have

multi_diffusion_model = MultiDiffusionModel(model, patching_options) # Wrap a base backbone with multidiffusion model_precond = EDMPreconditioner(multi_diffusion_model) # Wrapped by preconditioner, ready for training x0 = model_precond(x, t, condition={"y": y, "z": z}, some_kwarg=some_val)

Preconditioner doesn't care about the conditioning or the patching/interpolation applied to it, it delegates that to the underlying models. This is fine, no? Or am I missing something?

Hmm I'm trying to understand that...

In your snippet above, what would be the signatures of model, model_precond, and multi_diffusion_model ? (And I mean the actual signatures, not just the way they are called, because model(x, t, condition={}, kwarg=val) may look the same whether condition is a separate keyword argument or part of the kwargs, but the signature signals intent)

physicsnemo/diffusion/preconditioners/preconditioners.py

Signed-off-by: Charlelie Laurent <[email protected]>

…d of python float Signed-off-by: Charlelie Laurent <[email protected]>

CharlelieLrt · 2026-01-09T22:18:32Z

/blossom-ci

Signed-off-by: Charlelie Laurent <[email protected]>

CharlelieLrt · 2026-01-10T00:22:19Z

/blossom-ci

Signed-off-by: Charlelie Laurent <[email protected]>

CharlelieLrt · 2026-01-10T01:12:45Z

/blossom-ci

Signed-off-by: Charlelie Laurent <[email protected]>

CharlelieLrt · 2026-01-10T02:43:26Z

/blossom-ci

coreyjadams added 30 commits November 3, 2025 08:02

Move filesystems and version_check to core

46d2880

Fix version check tests

c6d04ad

Reorganize distributed, domain_parallel, and begin nn / utils cleanup.

6f36f03

Move modules and meta to core. Move registry to core.

7824091

No tests fixed yet.

Add missing init files

f753573

Update build system and specify some deps.

2ef835e

Merge branch 'main' into refactor

1603067

Reorganize tests.

1e8df52

Update init files

2e1195c

Clean up neighbor tools.

a698685

Update testing

258d988

Fix compat tests

0638b97

Move core model tests to tests/core/

b6327cb

Add import lint config

3ce049a

Relocate layers

95fa450

Move graphcast utils into model directory

ba6813d

Relocating util functionalities.

3f10463

Further clean up and organize tests.

339b484

Merge branch 'NVIDIA:main' into refactor

18df402

utils tests are passing now

d6946d9

Cleaning up distributed tests

66f8d15

Patching tests working again in nn

2ee76db

Fix sdf test

33d525d

Fix zenith angle tests

a06ad0a

Some organization of tests. Checkpoints is moved into utils.

4c845cc

Remove launch.utils and launch.config. Checkpointing is moved to

3bb64f4

phsyicsnemo.utils, launch.config is just gone. It was empty.

Most nn tests are passing

4aa332e

Further cleanup. Getting there!

45686cc

Remove constants file

bbc54f6

Add import linting to pre-commit.

8453fea

CharlelieLrt added 3 commits January 8, 2026 14:37

Adedd a few details in BasePreconditioner doctrsing

5f9a309

Signed-off-by: Charlelie Laurent <[email protected]>

Merge branch 'main' into diffusion-preconditioners-refactor

5af0aff

Updated CHANGELOG.md

17ee57e

Signed-off-by: Charlelie Laurent <[email protected]>

greptile-apps bot reviewed Jan 8, 2026

View reviewed changes

Improved documentation of signature requirement in BasePreconditioner

1e2d2a7

Signed-off-by: Charlelie Laurent <[email protected]>

CharlelieLrt self-assigned this Jan 8, 2026

CharlelieLrt added the 3 - Ready for Review Ready for review by team label Jan 8, 2026

CharlelieLrt requested a review from pzharrington January 8, 2026 23:17

NickGeneva reviewed Jan 8, 2026

View reviewed changes

physicsnemo/diffusion/preconditioners/preconditioners.py Show resolved Hide resolved

NickGeneva reviewed Jan 8, 2026

View reviewed changes

physicsnemo/diffusion/preconditioners/preconditioners.py Show resolved Hide resolved

NickGeneva reviewed Jan 8, 2026

View reviewed changes

physicsnemo/diffusion/preconditioners/preconditioners.py Show resolved Hide resolved

NickGeneva reviewed Jan 8, 2026

View reviewed changes

physicsnemo/diffusion/preconditioners/preconditioners.py Show resolved Hide resolved

NickGeneva reviewed Jan 8, 2026

View reviewed changes

physicsnemo/diffusion/preconditioners/preconditioners.py Show resolved Hide resolved

NickGeneva reviewed Jan 8, 2026

View reviewed changes

physicsnemo/diffusion/preconditioners/preconditioners.py Outdated Show resolved Hide resolved

CharlelieLrt added 2 commits January 8, 2026 17:14

Renamed BasePreconditioner into BaseAffinePreconditioner

ffcb026

Signed-off-by: Charlelie Laurent <[email protected]>

Added DiffusionModel protocol to specify diffusion models signature

5d5c66a

Signed-off-by: Charlelie Laurent <[email protected]>

pzharrington reviewed Jan 9, 2026

View reviewed changes

physicsnemo/diffusion/preconditioners/preconditioners.py Show resolved Hide resolved

pzharrington reviewed Jan 9, 2026

View reviewed changes

physicsnemo/diffusion/preconditioners/preconditioners.py Show resolved Hide resolved

pzharrington reviewed Jan 9, 2026

View reviewed changes

physicsnemo/diffusion/preconditioners/preconditioners.py Show resolved Hide resolved

CharlelieLrt added 2 commits January 9, 2026 12:57

Changed condition argument to TensorDict instead of Dict of tensors

993db63

Signed-off-by: Charlelie Laurent <[email protected]>

Moved all preconditioners scalar attributes to pytorch buffers instea…

22dac49

…d of python float Signed-off-by: Charlelie Laurent <[email protected]>

Improvements to make precondtioners tests more robust on GPU

9c5f53b

Signed-off-by: Charlelie Laurent <[email protected]>

Removed deterministic setting for tests

f561aa6

Signed-off-by: Charlelie Laurent <[email protected]>

CharlelieLrt added 2 commits January 9, 2026 17:54

Added examples in docstrings of all precondtioners

b53de49

Signed-off-by: Charlelie Laurent <[email protected]>

Fix bug in docstring examples

edf00f3

Signed-off-by: Charlelie Laurent <[email protected]>

Diffusion preconditioners refactor #1317

Are you sure you want to change the base?

Diffusion preconditioners refactor #1317

Uh oh!

Conversation

CharlelieLrt commented Jan 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PhysicsNeMo Pull Request

Overview

Checklist

Dependencies

Review Process

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Greptile Overview

Greptile Summary

Important Files Changed

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pzharrington Jan 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

CharlelieLrt commented Jan 9, 2026

Uh oh!

CharlelieLrt commented Jan 10, 2026

Uh oh!

CharlelieLrt commented Jan 10, 2026

Uh oh!

CharlelieLrt commented Jan 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

CharlelieLrt commented Jan 8, 2026 •

edited

Loading

pzharrington Jan 10, 2026 •

edited

Loading