AlphaDDA

AlphaDDA is an AlphaZero-based game AI with dynamic difficulty adjustment. It consists of MCTS and a deep neural network (DNN) like AlphaZero. It changes its skill according to the state's value estimated by the DNN.'a I propose three types of AlphaDDA.

AlphaDDA1: It changes the number of simulations in MCTS according to the value.
AlphaDDA2: It changes the dropout probability according to the value. In AlphaDDA2, the DNN used by MCTS is damaged by the dropout and outputs the inaccurate value.
AlphaDDA3: It applies a new UCT score. The UCT score is made based on the two assumptions: Its opponent continues to make a board state with the same value as the current board state. It makes a board state with the inverse value of the current board state.

In this study, AlphaDDAs play Connect4, 6x6 Othello, and Othello with the AI players. 6x6 Othello is Othello using a 6x6 board. The weights of AlphaDDAs are the same as trained AlphaZero. The codes of AlphaZero used in this study are opened in the "AlphaZero directory". The details of AlphaDDA are denoted in Fujita 2022.

References

Kazuhisa Fujita (2022) AlphaDDA: Strategies for Adjusting the Playing Strength of a Fully Trained AlphaZero System to a Suitable Human Training Partner. PeerJ Computer Science, 8, e1123.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
AlphaDDA1		AlphaDDA1
AlphaDDA2		AlphaDDA2
AlphaDDA3		AlphaDDA3
AlphaZero		AlphaZero
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AlphaDDA

References

About

Releases

Packages

Languages

License

KazuhisaFujita/AlphaDDA

Folders and files

Latest commit

History

Repository files navigation

AlphaDDA

References

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages