Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some explanation
-
Updated
Dec 31, 2020 - Python
Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some explanation
This program is to solve the FrozenLake8x8 with the MC control method.
Add a description, image, and links to the frozenlake-not-slippery topic page so that developers can more easily learn about it.
To associate your repository with the frozenlake-not-slippery topic, visit your repo's landing page and select "manage topics."