You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In PPO_Discrete each reward is multiplied by 0.01 and in PPO_Continuous reward is also modified. I don't understand why do these modification, what does these modification do?
The text was updated successfully, but these errors were encountered:
DeepRL-TensorFlow2/PPO/PPO_Discrete.py
Lines 151 to 154 in 876266d
DeepRL-TensorFlow2/PPO/PPO_Continuous.py
Lines 167 to 170 in 876266d
In
PPO_Discrete
each reward is multiplied by0.01
and inPPO_Continuous
reward is also modified. I don't understand why do these modification, what does these modification do?The text was updated successfully, but these errors were encountered: