Place conditions within States #445

hyeok9855 · 2025-12-05T12:35:23Z

I've read the .github/CONTRIBUTING.md file
My code follows the typing guidelines
I've added appropriate tests
I've run pre-commit hooks locally

Description

This PR tries to relocate the conditions from elsewhere to inside the States. There're pros and cons of this design.

Pros

Cleaner API, e.g., we can simply use env.reward(states) instead of env.reward(states, conditions)
Trajectory management becomes easier since conditions are handled in States

Cons

Redundancy; the shape of condition tensor for batch of trajectories is (max_length, batch_size, condition_dim), i.e., we now need to store max_length copy of conditions (possibly increase memory overhead)
States is now a bit messy
Less intuitive (personally); conditions can exist without States

josephdviviano

Hey @hyeok9855

I actually like this. I'm curious to hear @younik 's feedback on this design.

It does not seem to influence the make_States_class factory, which was my main concern. Conditions live inside states which keeps APIs relatively clean, and conceptually having them live together makes sense to me.

What I don't love is having a distinct ConditionalEnv, as in the parent PR, because I still think that Env definition is the most confusing element of our library and I'd like to reduce that complexity as much as possible, but I think it would be worth a longer discussion.

Want to break it down on slack or schedule a call?

josephdviviano · 2025-12-11T17:43:19Z

src/gfn/env.py

-            A tensor of shape (batch_size,) containing the log rewards.
+            A tensor of shape (batch_size,) containing the rewards.
        """
-        return torch.log(self.reward(states, conditions))


Why can't we leave this in as a default? And why not log_reward like everything else?

This reward method can be removed actually, since they are exactly same as the one in the parent class. I will remove it.

josephdviviano · 2025-12-11T17:45:05Z

src/gfn/gflownet/detailed_balance.py

        # LogF is potentially a conditional computation.
-        if transitions.conditions is not None:
+        if transitions.states.has_conditions:
+            assert transitions.states.conditions is not None


this kind of thing, btw, will break torch.compile. (is not None). No action needed, I'll look at this in a different PR.

younik · 2025-12-12T02:35:36Z

I actually like this. I'm curious to hear @younik 's feedback on this design.

I also prefer this design, it looks cleaner, even if as far as I understood the conditioning cannot change during an episode.

put conditions inside states

ce5ec3c

hyeok9855 requested review from josephdviviano and younik December 5, 2025 12:35

hyeok9855 self-assigned this Dec 5, 2025

josephdviviano approved these changes Dec 11, 2025

View reviewed changes

remove redundancy

beda65a

hyeok9855 merged commit d000cbd into refactor-conditions Dec 12, 2025
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Place conditions within States #445

Place conditions within States #445

Uh oh!

hyeok9855 commented Dec 5, 2025

Uh oh!

josephdviviano left a comment

Uh oh!

josephdviviano Dec 11, 2025

Uh oh!

hyeok9855 Dec 12, 2025

Uh oh!

josephdviviano Dec 11, 2025

Uh oh!

younik commented Dec 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Place conditions within States #445

Place conditions within States #445

Uh oh!

Conversation

hyeok9855 commented Dec 5, 2025

Description

Pros

Cons

Uh oh!

josephdviviano left a comment

Choose a reason for hiding this comment

Uh oh!

josephdviviano Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

hyeok9855 Dec 12, 2025

Choose a reason for hiding this comment

Uh oh!

josephdviviano Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

younik commented Dec 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants