Jax-Baseline

Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselines.

Features

2-3 times faster than previous Torch and Tensorflow implementations
Optimized using JAX's Just-In-Time (JIT) compilation

Installation

pip install -r requirement.txt
pip install .

Implementation Status

✔️ : Optional implemented
✅ : Defualt implemented at papers
❌ : Not implemeted yet or can not implemented

Supported Environments

Name	Q-Net based	Actor-Critic based	DPG based
Gymnasium	✔️	✔️	✔️
EnvPool	✔️	✔️	✔️

Implemented Algorithms

Q-Net bases

Name	`Double`¹	`Dueling`²	`Per`³	`N-step`⁴⁵	`NoisyNet`⁶	`Munchausen`⁷	`Ape-X`⁸	`HL-Gauss`⁹
DQN¹⁰	✔️	✔️	✔️	✔️	✔️	✔️	✔️	❌
C51¹¹	✔️	✔️	✔️	✔️	✔️	✔️	✔️	✔️
QRDQN¹²	✔️	✔️	✔️	✔️	✔️	✔️	✔️	❌
IQN¹³	✔️	✔️	✔️	✔️	✔️	✔️	❌	❌
FQF¹⁴	✔️	✔️	✔️	✔️	✔️	✔️	❌	❌
SPR¹⁵	✅	✅	✅	✅	✅	✔️	❌	✔️
BBF¹⁶	✅	✅	✅	✅	✔️	✔️	❌	✔️

Actor-Critic based

Name	`Box`	`Discrete`	`IMPALA`¹⁷
A2C¹⁸	✔️	✔️	✔️
PPO¹⁹	✔️	✔️	✔️²⁰
Truly PPO(TPPO)²¹	✔️	✔️	❌
SPO²²	✔️	✔️	❌

DPG bases

Name	`Per`³	`N-step`⁴⁵	`Ape-X`⁸	`Simba`²³	`Simba-v2`²⁴
DDPG²⁵	✔️	✔️	✔️	✔️	✔️
TD3²⁶	✔️	✔️	✔️	✔️	✔️
SAC²⁷	✔️	✔️	❌	✔️	✔️
DAC²⁸❌	❌	❌	❌	❌	❌
TQC²⁹	✔️	✔️	❌	✔️	✔️
TD7³⁰	✅(LAP³¹)	❌	❌	✔️	✔️
CrossQ³²	✔️	✔️	❌	✔️	✔️
BRO³³❌	❌	❌	❌	❌	❌

Performance Compariton

Test

To test Atari with DQN (or C51, QRDQN, IQN, FQF):

python test/run_qnet.py --algo DQN --env BreakoutNoFrameskip-v4 --learning_rate 0.0002 \
		--steps 5e5 --batch 32 --train_freq 1 --target_update 1000 --node 512 \
		--hidden_n 1 --final_eps 0.01 --learning_starts 20000 --gamma 0.995 --clip_rewards

500K steps can be run in just 15 minutes on Atari Breakout (540 steps/sec). Performance measured on Nvidia RTX3080 and AMD Ryzen 9 5950X in a single process.

score : 9.600, epsilon : 0.010, loss : 0.181 |: 100%|███████| 500000/500000 [15:24<00:00, 540.88it/s]

Name		Name	Last commit message	Last commit date
Latest commit History 1,873 Commits
docs		docs
jax_baselines		jax_baselines
model_builder		model_builder
test		test
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Jax-Baseline

Features

Installation

Implementation Status

Supported Environments

Implemented Algorithms

Q-Net bases

Actor-Critic based

DPG bases

Performance Compariton

Test

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Jax-Baseline

Features

Installation

Implementation Status

Supported Environments

Implemented Algorithms

Q-Net bases

Actor-Critic based

DPG bases

Performance Compariton

Test

Footnotes

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages