[Bug]: DDPG seems unable to solve the MountainCarContinuous-v0 problem. #482
Labels
check the checklist
You have checked the required items in the checklist but you didn't do what is written...
🐛 Bug
import gymnasium as gym
from stable_baselines3 import DDPG
env = gym.make("MountainCarContinuous-v0", render_mode="rgb_array")
model = DDPG("MlpPolicy", env, verbose=1)
model.learn(total_timesteps=10_0000)
vec_env = model.get_env()
obs = vec_env.reset()
for i in range(1000):
action, _state = model.predict(obs, deterministic=True)
obs, reward, done, info = vec_env.step(action)
vec_env.render("human")
The average reward seems unable to exceed 0
To Reproduce
Relevant log output / Error message
No response
System Info
No response
Checklist
The text was updated successfully, but these errors were encountered: