Developed model-free controller using Q-learning to solve the inverted pendulum problem.
Simulation of simple pendulum
The Q-learning algorithm is implemented with a table, the dimension of the Q-table will be of dimension
Q-table
The results in the form of graphs are visualized below.
The learning curves show the TD-error and instantaneous cost during the learning process.
The policy and value function learnt are vizualized to give a better of the learning process.
To read a detailed report, click HERE