MultithreadedEnvs and Trajectories #920

HenriDeh · 2023-07-06T09:03:44Z

I'm creating this issue to pin down that MultiThreadedEnvs do not currently work.

It steps each inner env in parallel and stores the traces, but there's nothing that can be done with that currently as we would need to store each separate trajectory (in the seminal sense of "trajectory") contiguously in the replay buffer. Right now with the caching implementation, they can only be mixed up in the storage.

Options I see to address that

extend trajectories to have N (number of parallel envs) containers, each will pertain to one env and have a capacity of 1/Nth the total capacity of the trajectory.
partition the container into N partitions.
Create a small Trajectory for each env and append to the main Trajectory at the end of an episode.

I clearly prefer the first option, it is straightfoward and probably the most efficient.

Mytolo · 2023-07-10T08:48:10Z

In my opinion, in the first option, each container should have the full capacity? The MultiThreadEnv is simply to speed up experience gathering, so each container should behave as it would be a single environment. User perspective would be to define how much experience should be stored for one environment. That way, there are also no worries about rounding / ceiling for the capacity, since it has to be a whole number.

jeremiahpslewis mentioned this issue Mar 27, 2024

ReinforcementLearning.jl v0.12 #1061

Open

4 tasks

jeremiahpslewis added the v0.12 label Mar 27, 2024

jeremiahpslewis added this to the v0.12 milestone Mar 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MultithreadedEnvs and Trajectories #920

MultithreadedEnvs and Trajectories #920

HenriDeh commented Jul 6, 2023

Mytolo commented Jul 10, 2023

MultithreadedEnvs and Trajectories #920

MultithreadedEnvs and Trajectories #920

Comments

HenriDeh commented Jul 6, 2023

Mytolo commented Jul 10, 2023