GitHub - int8/regret-matching: Simple implementation of regret matching algorithm for RPS nash equilibrium computation via self-play

Simple implementation of Regret Matching Algorithm for Nash Equilibrium computation via repeated self-play

This is simple implementation of regret matching algorithm for Nash Equilibrium computation for two players zero sum games via repeated self-play. This code uses game of Rock-Paper-Scissors as an example

First install numpy

pip install -r requirements.txt

then run main.py

from regretmatching.rps import RPSPlayer
import numpy as np

a = RPSPlayer()
b = RPSPlayer()
t = 10000
for i in range(0, t):
    a_move = a.move()
    b_move = b.move()
    a.learn_from(b_move)
    b.learn_from(a_move)

_2e = np.round(2 * np.max([a.eps(), b.eps()]), 3)
a_ne = a.current_best_response()
b_ne = b.current_best_response()
print("{0} - nash equilibrium for RPS: {1}, {2}".format(_2e, a_ne, b_ne))

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
regretmatching		regretmatching
.gitignore		.gitignore
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Simple implementation of Regret Matching Algorithm for Nash Equilibrium computation via repeated self-play

About

Releases

Packages

Contributors 2

Languages

int8/regret-matching

Folders and files

Latest commit

History

Repository files navigation

Simple implementation of Regret Matching Algorithm for Nash Equilibrium computation via repeated self-play

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages