takzero

An implementation of AlphaZero for the board game Tak. See also https://github.com/ViliamVadocz/tak

Structure

The repository contains several libraries and binaries:

takzero is the main library which implements MCTS and the neural networks
selfplay is used during training to generate replays and exploitation targets
reanalyze computes fresh targets from old replays
learn takes targets from selfplay and reanalyze to train new models
evaluation pits models against each other
puzzle runs the puzzle benchmark
analysis includes interactive game analysis
graph computes the ratio of unique states seen throughout training
tei a TEI implementation
eee is a collection of binaries to run Epistemic uncertainty Estimation Experiments (EEE)
- generalization trains a hash-based uncertainty estimator
- rnd is the same as generalization, but specifically for rnd
- seen_ratio analyzes the ratio of seen states according to a filled hash-set
- ensemble trains an ensemble network
- utils utility functions for running experiments
visualize_search creates a visualization of the search tree used by an agent
visualize_replay_buffer creates a visualization of the overlap of different replay buffers, as well as the number of seen states at different depths
python contains miscellaneous Python scripts
- action_space computes the action space for different board sizes
- analyze_search analyzes search data to figure out which bandit algorithm optimizes best for exploration
- elo computes Bayesian Elo from match results (from evaluation) and creates a graph
- extract_from_logs graphs various data from logs
- concat_out concatenates log output
- generate_openings generates random opening positions (for example to use as an opening book for a tournament)
- get_match_results extract match results from evaluation logs
- improved_policy compares different improved policy formulas
- novelty_per_depth plots the novelty per depth
- plot_eee plots the results of EEE
- plot_elo_data plots the Elo data
- replay_buffer_uniqueness plots the replay buffer uniqueness

Building

You will need the C++ Pytorch library (LibTorch). See tch-rs for installation instructions.

LibTorch version

It's possible you may not be able to find these versions anymore. In that case try downloading the newest and update the tch-rs version in Cargo.toml.

You may also need to set LIBTORCH_BYPASS_VERSION_CHECK to 1.

If you find some version works, please let me know so I can add it here.

Windows

Worked:

Stable (2.5.1), CUDA 12.4, Release

Did not work:

TODO

Linux

Worked:

TODO

Did not work:

TODO

Reproducing the Plots

To generate the local novelty per depth graph follow these steps:

Edit eee/src/seen_ratio.rs with the path to a trained model, and adjust the imports based on whether it is a SimHash or LCGHash model.
Run cargo run -p eee -r --bin seen_ratio for each agent.
Take the output and place it into python/novelty_per_depth.py.
Run python python/novelty_per_depth.py.

Acquire a replay buffer by running an undirected agent. (See elo graph instructions.)
Edit the import in eee/src/generalization.rs for the model that you want to test.
Run cargo run -p eee -r --bin generalization for each agent, rename the output file eee_data.csv for each.
Edit plot_eee.py to plot hashes and run python python/plot_eee.py

Acquire a replay buffer by running an undirected agent. (See elo graph instructions.)
Run cargo run -p eee -r --bin rnd
Edit plot_eee.py to plot RND and run python python/plot_eee.py

To generate the elo ratings for agents throughout training follow these steps:

Edit selfplay/src/main.rs, reanalyze/src/main.rs, and learn/src/main.rs for the agent and value of beta that is desired.
Compile using cargo build -r -p selfplay -p reanalyze -p learn. If exploration is desired, append --features exploration to the command.
Deploy the agent on a cluster, 1 learn process, 10 selfplay processes, and 10 reanalyze processes.
Once you have generated checkpoints for all agents, compile the evaluation using cargo build -r -p evaluation.
Evaluate agents against each other by deploying evaluation processes.
Extract the match results out of logs using python/get_match_results.py.
Place the match results into match_results/ and run python python/elo.py to plot the elo.
For an easier to edit plot, copy the bayeselo output from elo.py into plot_elo_data.py in the expected format.

To generate the replay uniqueness graphs follow these steps:

Train agents using steps 1-3 from the elo graph instructions.
Edit graph/main.rs with paths to the replay files.
Run cargo run -r -p graph and see the generated graph in graph.html.
For an easier to edit plot, copy the output into replay_buffer_uniqueness.py and run with python python/replay_buffer_uniqueness.py.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

takzero

Structure

Building

LibTorch version

Windows

Linux

Reproducing the Plots

About

Releases 1

Packages

Contributors 3

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 660 Commits
analysis		analysis
eee		eee
evaluation		evaluation
figures		figures
graph		graph
learn		learn
match_results		match_results
puzzle		puzzle
python		python
reanalyze		reanalyze
runs		runs
selfplay		selfplay
takzero		takzero
tei		tei
visualize_replay_buffer		visualize_replay_buffer
visualize_search		visualize_search
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
README.md		README.md
rustfmt.toml		rustfmt.toml

ViliamVadocz/takzero

Folders and files

Latest commit

History

Repository files navigation

takzero

Structure

Building

LibTorch version

Windows

Linux

Reproducing the Plots

About

Resources

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 3

Languages

Packages