GitHub - zhaleh-rahimi/QLearning-game: Nim Implementation with RL

Game Nim

The game of Nim(N,m,k) involves m players who begin with a pile of N stones and alternate turns, removing stones from the pile. Each player must remove between 1 and k stones on their turn. The objective is to avoid taking the last stone. We will look at the case when N = 51, m = 2, and k = 3. We have a small finite number of states(N+1 states), and action space only includes k actions {1,2,3}, that is, the number of stones that can be removed from the game.

Configuration:

I trained the QLearner versus a Random player, 1e4 iterations. With the following parameters:

N=51; k=3; m=2; Discount= 0.99; Alpha= 1/sqrt(n+2); n: #iterations epsilon=1-(1/log(n+2))
Q-Learner always starts the game.
Tested the Q-Learner against a random player 1e3 times.

It wins around 99% of the time.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
LICENSE		LICENSE
README.md		README.md
main_Nim.m		main_Nim.m
mdp_QL.m		mdp_QL.m

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Game Nim

Configuration:

About

Uh oh!

Releases

Packages

Languages

License

zhaleh-rahimi/QLearning-game

Folders and files

Latest commit

History

Repository files navigation

Game Nim

Configuration:

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages