[RLlib] Create resource bundle per learner #59620

kamil-kaczmarek · 2025-12-23T04:04:26Z

Description

Create a resource bundle for each learner, do not pack all learners into single bundle.

Related to #51017

Signed-off-by: Kamil Kaczmarek <[email protected]>

gemini-code-assist

Code Review

This pull request refactors the creation of resource bundles for learners in RLlib. Instead of creating a single large bundle for all learners, it now creates a separate bundle for each learner. This is a good change that allows for more flexible scheduling of learners across a cluster. The removal of the extension point for _get_learner_bundles also simplifies the code.

My review includes one suggestion to apply the same bundling logic to aggregator actors when no remote learners are used, for consistency.

rllib/algorithms/utils.py

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Signed-off-by: Kamil Kaczmarek <[email protected]>

simonsays1980

Thanks for taking the ownership here @kamil-kaczmarek ! The solution you propose looks good. There is one question left to be answered.

simonsays1980 · 2025-12-23T11:24:04Z

rllib/algorithms/utils.py

+            "CPU": num_cpus_per_learner + config.num_aggregator_actors_per_learner,
+            "GPU": config.num_gpus_per_learner,
        }
+        for _ in range(config.num_learners)


This looks more correct than before, but I am wondering, if this would still not ensure that AggregatorActors being scheduled on the same node as we do not use placement groups. Could theoretically an EnvRunner be scheduled on CPUs of the same node instead of an AggregatorActor?

Signed-off-by: Mark Towers <[email protected]>

rllib/algorithms/utils.py

… for spotting) Signed-off-by: Mark Towers <[email protected]>

simonsays1980

LGTM. Thanks for this important change @kamil-kaczmarek !

create resource bundle per learner

fccebe7

Signed-off-by: Kamil Kaczmarek <[email protected]>

kamil-kaczmarek self-assigned this Dec 23, 2025

kamil-kaczmarek requested a review from a team as a code owner December 23, 2025 04:04

kamil-kaczmarek added the rllib RLlib related issues label Dec 23, 2025

kamil-kaczmarek marked this pull request as draft December 23, 2025 04:04

kamil-kaczmarek changed the title ~~[RLlib] Create resource bundle per learner, optimize init~~ [RLlib] Create resource bundle per learner, optimize Algorithm setup Dec 23, 2025

gemini-code-assist bot reviewed Dec 23, 2025

View reviewed changes

rllib/algorithms/utils.py Outdated Show resolved Hide resolved

Update rllib/algorithms/utils.py

50810c6

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Signed-off-by: Kamil Kaczmarek <[email protected]>

kamil-kaczmarek changed the title ~~[RLlib] Create resource bundle per learner, optimize Algorithm setup~~ [RLlib] Create resource bundle per learner Dec 23, 2025

Merge branch 'master' into kk/fix-learner-bundles

b5c783b

kamil-kaczmarek added the go add ONLY when ready to merge, run all tests label Dec 23, 2025

kamil-kaczmarek requested review from pseudo-rnd-thoughts and simonsays1980 December 23, 2025 04:49

kamil-kaczmarek marked this pull request as ready for review December 23, 2025 04:55

simonsays1980 added the rllib-algorithms An RLlib algorithm/Trainer is not learning. label Dec 23, 2025

simonsays1980 requested changes Dec 23, 2025

View reviewed changes

kamil-kaczmarek and others added 2 commits December 27, 2025 14:10

Merge branch 'master' into kk/fix-learner-bundles

48e4405

Make num_cpus_per_learner more easily readable

0023e50

Signed-off-by: Mark Towers <[email protected]>

cursor bot reviewed Jan 7, 2026

View reviewed changes

rllib/algorithms/utils.py Outdated Show resolved Hide resolved

Fix num_cpus_per_learner from num_cpus_per_learners (thanks Simon…

12d7714

… for spotting) Signed-off-by: Mark Towers <[email protected]>

simonsays1980 self-requested a review January 7, 2026 14:44

simonsays1980 approved these changes Jan 7, 2026

View reviewed changes

simonsays1980 enabled auto-merge (squash) January 7, 2026 14:45

simonsays1980 merged commit f6c2b5f into master Jan 7, 2026
7 checks passed

simonsays1980 deleted the kk/fix-learner-bundles branch January 7, 2026 16:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[RLlib] Create resource bundle per learner #59620

[RLlib] Create resource bundle per learner #59620

kamil-kaczmarek commented Dec 23, 2025 •

edited by simonsays1980

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

simonsays1980 left a comment

Uh oh!

simonsays1980 Dec 23, 2025

Uh oh!

Uh oh!

simonsays1980 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[RLlib] Create resource bundle per learner #59620

[RLlib] Create resource bundle per learner #59620

Conversation

kamil-kaczmarek commented Dec 23, 2025 • edited by simonsays1980 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

simonsays1980 left a comment

Choose a reason for hiding this comment

Uh oh!

simonsays1980 Dec 23, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

simonsays1980 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

kamil-kaczmarek commented Dec 23, 2025 •

edited by simonsays1980

Loading