Add GRPOConfig for Arbor #8882

zhassan223 · 2025-10-01T14:19:50Z

new data class for GRPO train_kwargs instead of dictionary. Makes it cleaner and has in-data-class argument checks.

…or only

zhassan223 · 2025-10-01T14:21:38Z

@Ziems

Ziems

Can you update the tutorials as well and test that they still work?

Those are in docs/docs/tutorials/rl_multihop and docs/docs/tutorials/rl_papillon

Ziems · 2025-10-01T14:31:10Z

dspy/teleprompt/arbor_grpo/arbor_grpo_config.py

@@ -0,0 +1,138 @@
+from dataclasses import dataclass, field


Is it possible for us to make a telepromp/grpo/ directory where we can put grpo.py and grpo_config.py? The GEPA optimizer does this so I think there is already a precedent allowing us to do this.

Ziems · 2025-10-01T14:34:53Z

@zhassan223 Great work! Requested some small changes but wonderful progress!

…ted GRPOConfig for rl multihop notebook

…d parameters

docs/docs/tutorials/rl_multihop/index.ipynb

Ziems · 2025-10-02T02:44:29Z

docs/docs/tutorials/rl_papillon/index.ipynb

    "    multitask=True,\n",
    "    num_dspy_examples_per_grpo_step=4,\n",
-    "    num_samples_per_input=8,\n",
+    "    num_rollouts_per_grpo_step=8,#changed from num_generations since that parameter doesn't exist anymore\n",


Good catch!

docs/docs/tutorials/rl_papillon/index.ipynb

… backwards compatibility to only use GRPOConfig dataclass

docs/docs/tutorials/rl_multihop/index.ipynb

docs/docs/tutorials/rl_papillon/index.ipynb

dspy/teleprompt/grpo/grpo.py

Ziems

Just a few more things!

Ziems · 2025-10-06T20:33:38Z

@zhassan223 it looks like a few tests failed too, could you take a look?

…arnings

…cordingly

made new GRPO data class for train_kwargs, integrated to lm_local_arb…

733d891

…or only

Ziems requested changes Oct 1, 2025

View reviewed changes

zhassan223 added 2 commits October 1, 2025 22:16

refactored grpo into own file and added tests to lm local arbor, upda…

fbfd080

…ted GRPOConfig for rl multihop notebook

fixed rl_papillon implementation to use grpoConfig and fixed outdate…

ba0332f

…d parameters

Ziems reviewed Oct 2, 2025

View reviewed changes

docs/docs/tutorials/rl_multihop/index.ipynb Outdated Show resolved Hide resolved

Ziems reviewed Oct 2, 2025

View reviewed changes

docs/docs/tutorials/rl_papillon/index.ipynb Outdated Show resolved Hide resolved

zhassan223 and others added 2 commits October 2, 2025 16:59

reconfigured arguments for GRPO to use config argument and phased out…

50fedce

… backwards compatibility to only use GRPOConfig dataclass

Merge branch 'main' into main

78093d2

Ziems reviewed Oct 6, 2025

View reviewed changes

docs/docs/tutorials/rl_multihop/index.ipynb Outdated Show resolved Hide resolved

Ziems reviewed Oct 6, 2025

View reviewed changes

docs/docs/tutorials/rl_papillon/index.ipynb Outdated Show resolved Hide resolved

Ziems reviewed Oct 6, 2025

View reviewed changes

dspy/teleprompt/grpo/grpo.py Outdated Show resolved Hide resolved

Ziems requested changes Oct 6, 2025

View reviewed changes

Ziems changed the title ~~made new GRPO data class for train_kwargs, integrated to lm_local_arb…~~ Replace GRPO kwargs with dataclass Oct 6, 2025

removed comments, restructured grpo dictionary function, fixed ruff w…

8755d88

…arnings

zhassan223 force-pushed the main branch from 48d3a96 to 8755d88 Compare October 7, 2025 02:51

renamed grpoConfig -> arborGRPOConfig and adjusted files and tests ac…

059c785

…cordingly

zhassan223 changed the title ~~Replace GRPO kwargs with dataclass~~ Add GRPOConfig for Arbor Oct 12, 2025

zhassan223 closed this Oct 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add GRPOConfig for Arbor #8882

Add GRPOConfig for Arbor #8882

Uh oh!

zhassan223 commented Oct 1, 2025

Uh oh!

zhassan223 commented Oct 1, 2025

Uh oh!

Ziems left a comment

Uh oh!

Ziems Oct 1, 2025

Uh oh!

Ziems commented Oct 1, 2025

Uh oh!

Uh oh!

Ziems Oct 2, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Ziems left a comment

Uh oh!

Ziems commented Oct 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add GRPOConfig for Arbor #8882

Add GRPOConfig for Arbor #8882

Uh oh!

Conversation

zhassan223 commented Oct 1, 2025

Uh oh!

zhassan223 commented Oct 1, 2025

Uh oh!

Ziems left a comment

Choose a reason for hiding this comment

Uh oh!

Ziems Oct 1, 2025

Choose a reason for hiding this comment

Uh oh!

Ziems commented Oct 1, 2025

Uh oh!

Uh oh!

Ziems Oct 2, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Ziems left a comment

Choose a reason for hiding this comment

Uh oh!

Ziems commented Oct 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants