You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I get a reward of 0.0 no matter how long i train or the algorithm i use.So far i have tried mappo and VDN, when i train on mpe environment , there are changes in the reward.
Please i would appreciate any ideas or suggestions.
The text was updated successfully, but these errors were encountered:
promiseve
changed the title
Zero reward in Overcooked environment regardless of algorithm/lenght of training
Zero reward in Overcooked environment regardless of algorithm/length of training
May 31, 2024
Hi,
i added the overcooked env as described here:https://marllib.readthedocs.io/en/latest/handbook/env.html#id64.
However, once i running the script as described in https://marllib.readthedocs.io/en/latest/handbook/quick_start.html#id11 but will modification for the overcooked environment.
script:
`from marllib import marl
prepare env
env = marl.make_env(environment_name="overcooked", map_name="asymmetric_advantages")
initialize algorithm with appointed hyper-parameters
vdn = marl.algos.vdn(hyperparam_source="common")
build agent model based on env + algorithms + user preference
model = marl.build_model(env, vdn, {"core_arch": "mlp", "encode_layer": "128-256"})
start training
vdn.fit(env, model, stop={"timesteps_total": 1000000}, checkpoint_freq=100, share_policy="group", checkpoint_end= True)
#render
#mappo.render(env, model, local_mode =True)`
I get a reward of 0.0 no matter how long i train or the algorithm i use.So far i have tried mappo and VDN, when i train on mpe environment , there are changes in the reward.
Please i would appreciate any ideas or suggestions.
The text was updated successfully, but these errors were encountered: