RL_DaVinciCode

A reinforcement learning model for the Da Vinci Code game

Play with the model!

Structure

flowchart TD
    A[Base Game Logic] --> B[Gymnasium Environment]
    B --> C[Model Trainer]
    C --> D[Trained Model]
    A --> E[Web Game Interface]
    D --> E

Base Game Logic

Base Game Logic implements the abstracted game classes and methods necessary for the game to function.

Gymnasium Environment

Gymnasium Environment utilizes the game logic to create a Gymnasium environment.

Model Trainer

Model Trainer is a PPO trainer modified and refined from rl_adventure2.

Removed the multiple environment parallel training to reduce complexity.
Adapted to a multi-discrete action space.
Added a shared network before the actor and critic networks to improve training efficiency and model performance.
Included additional graphs (correct count, smoothed graphs) for better monitoring.
Adjusted hyperparameters to optimize model performance.

Trained Model

Trained Model contains the saved model object for further use in the web game interface.

Web Game Interface

Web Game Interface utilizes the game logic and the Streamlit library to create a web interface for human players. The model is loaded to play against human players. You can try it out on the Streamlit app deployment.

Da Vinci Code Game Rules

Objective

Be the first to expose your opponents' secret codes before your own is fully revealed.

Game Setup

Tile Arrangement: 24 numbered tiles divided into two sets:
- Dark tiles: 12 tiles numbered 1-12
- Light tiles: 12 tiles numbered 1-12
Drawing Tiles:
- Each player draws 4 tiles at random and hides the numbers.
Sorting Tiles:
- Each player sorts their tiles in numerical order from left to right (lowest to highest). For two tiles with the same number, the dark tile is placed to the left of the light tile.

Play

Drawing a Tile: On your turn, draw one of the remaining tiles and keep it hidden from other players.
Making a Guess:
- Choose an opponent and guess the number of one of their tiles.
- Correct Guess: If you are correct, the opponent will reveal the tile.
- Incorrect Guess: If you are wrong, the tile you drew will be revealed and placed in its correct position. This gives your opponents clues about your hidden tiles.
Continuing Your Turn: If your first guess is correct, you may either:
- Attack another opponent's tile.
- End your turn, in which case the tile you drew will be placed in its correct position without revealing it. Your secret code is now one tile longer.

Next Turn / Winning

Play continues in turns. The game continues until only one player has tiles still unrevealed. That player is declared the winner.

Name		Name	Last commit message	Last commit date
Latest commit History 66 Commits
.devcontainer		.devcontainer
.streamlit		.streamlit
assets		assets
ppo_model_saves		ppo_model_saves
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
actor_critic.py		actor_critic.py
app.py		app.py
davinci_code_env.py		davinci_code_env.py
davinci_code_env_v1.py		davinci_code_env_v1.py
game.py		game.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
test_game.py		test_game.py
tile_assets_generator.py		tile_assets_generator.py
training_ppo.ipynb		training_ppo.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RL_DaVinciCode

Play with the model!

Table of Contents

Structure

Base Game Logic

Gymnasium Environment

Model Trainer

Trained Model

Web Game Interface

Da Vinci Code Game Rules

Objective

Game Setup

Play

Next Turn / Winning

About

Releases

Languages

License

hhxc-0/RL_DaVinciCode

Folders and files

Latest commit

History

Repository files navigation

RL_DaVinciCode

Play with the model!

Table of Contents

Structure

Base Game Logic

Gymnasium Environment

Model Trainer

Trained Model

Web Game Interface

Da Vinci Code Game Rules

Objective

Game Setup

Play

Next Turn / Winning

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Languages