Contrastive Sparse Autoencoders for Interpreting Planning of Chess-Playing Agents

Project page | HF Space | Paper

Scripts for interpreting planning in LeelaChessZero networks.

🔴 Not a stable codebase

Install & Run

This work relies on poetry to manage the dependencies. To install run (with additional demo group for running the demo):

poetry install

Then to run a particular script use:

poetry run python -m scripts.sae_training.train_contrastive

To run the demo you can use the following make shortcut:

make demo

Tooling

See the lczerolens library (still under development) for more agnostic tooling to interpret the Leela Networks.

Contribute

Feel free to open a discussion, an issue or a PR for any question or feedback.

Cite

If you find this work useful please consider citing the associated paper:

@misc{poupart2024contrastivesparseautoencodersinterpreting,
      title={Contrastive Sparse Autoencoders for Interpreting Planning of Chess-Playing Agents},
      author={Yoann Poupart},
      year={2024},
      eprint={2406.04028},
      archivePrefix={arXiv},
      primaryClass={cs.AI},
      url={https://arxiv.org/abs/2406.04028},
}

Name		Name	Last commit message	Last commit date
Latest commit History 77 Commits
assets		assets
lczero-planning-demo @ c1ecb0b		lczero-planning-demo @ c1ecb0b
results		results
scripts		scripts
.gitignore		.gitignore
.gitmodules		.gitmodules
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Contrastive Sparse Autoencoders for Interpreting Planning of Chess-Playing Agents

Install & Run

Tooling

Contribute

Cite

About

Releases

Packages

Languages

License

Xmaster6y/lczero-planning

Folders and files

Latest commit

History

Repository files navigation

Contrastive Sparse Autoencoders for Interpreting Planning of Chess-Playing Agents

Install & Run

Tooling

Contribute

Cite

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages