Multinode experiments: improve wandb viz, fix n_modes computation, increase scoring function complexity #447

younik · 2025-12-10T15:54:17Z

I've read the .github/CONTRIBUTING.md file
My code follows the typing guidelines
I've added appropriate tests
I've run pre-commit hooks locally

Description

This is the latest code I am using for my experiments (https://wandb.ai/torchgfn/torchgfn/reports/Scaling-number-of-policies-total-batch-size-1024--VmlldzoxNTMzMjI1OA)

It includes the new scoring function work (from @josephdviviano), the fix for the number of modes, and improved visualization for wandb (using grouping).

josephdviviano

looks good - thank you!

josephdviviano · 2025-12-11T17:19:52Z

src/gfn/containers/replay_buffer.py

+        dist.recv(score_tensor, src=self.remote_manager_rank)
+        score_dict = Message.deserialize(score_tensor).message_data
+
+        return score_dict


why is this a dict?

josephdviviano · 2025-12-11T17:21:03Z

tutorials/examples/multinode/install_multinode_dependencies

-              'pytorch==2.2' cpuonly \
-              mkl -y
+
+conda install gcc_linux-64 gxx_linux-64 gfortran_linux-64 impi_rt impi-devel intel-opencl-rt intel-openmp mpi4py oneccl_bind_pt oneccl intel-extension-for-pytorch


thanks - i need to finally clean this up.

josephdviviano and others added 15 commits October 30, 2025 18:23

sync changes to cluster

3307fff

sync to mila

dbace9d

changes to sync for agent

cde64bb

simple prototype (need to add FAISS)

5f4033b

getting ready for sync

418f49f

merge

32df6c4

sketch of scroring is working

e78e363

remove perf tracker

1cd9cf6

fix score for single node

2766082

improve logging

b41878b

changes for train hypergrid

6b5d57c

Merge branch 'master' into multinode_experiments

6063f24

pre-commit

aedb7d8

fix pre-commit

2087ff0

fix isort

3ccb8db

younik requested a review from josephdviviano December 10, 2025 16:32

josephdviviano approved these changes Dec 11, 2025

View reviewed changes

josephdviviano merged commit 5ca6408 into master Dec 11, 2025
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Multinode experiments: improve wandb viz, fix n_modes computation, increase scoring function complexity #447

Multinode experiments: improve wandb viz, fix n_modes computation, increase scoring function complexity #447

younik commented Dec 10, 2025 •

edited

Loading

Uh oh!

josephdviviano left a comment

Uh oh!

josephdviviano Dec 11, 2025

Uh oh!

josephdviviano Dec 11, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Multinode experiments: improve wandb viz, fix n_modes computation, increase scoring function complexity #447

Multinode experiments: improve wandb viz, fix n_modes computation, increase scoring function complexity #447

Conversation

younik commented Dec 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Uh oh!

josephdviviano left a comment

Choose a reason for hiding this comment

Uh oh!

josephdviviano Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

josephdviviano Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

younik commented Dec 10, 2025 •

edited

Loading