QLearning-and-Sarsa-for-Cliff-Walking

Reinforcement learning project.

Environment

The environment is Cliff Walking, the detailed information can be read in [A3.pdf].

The experiment shows that Sarsa method tends to choose a safer path while Q-learning tends to choose the optimal path.

Just run Qlearning.py or Sarsa.py. And you can get plotted figure if you modify python file a bit.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
code		code
A3.pdf		A3.pdf
README.md		README.md
report.pdf		report.pdf