-
Notifications
You must be signed in to change notification settings - Fork 5
Customization
For scenarios where you would like to customize an environment or an algorithm e.g. PPO but the configurations provided by RL.ts do not suffice (e.g. adding random network distillation to your algorithm), then you can directly copy the original source file from this repository and edit that without having to code from scratch.
The following section will take you through how to customize an algorithm to your own needs in typescript and javascript.
The example used is DQN and the customization added is using a linear decay schedule for the epsilon value. Epsilon is the probability of taking a purely random action instead of one that is based on the deep Q network, which is important for exploration in DQN.
Find the source code for the algorithm. For DQN, this is https://github.com/StoneT2000/rl-ts/blob/main/src/Algos/dqn/index.ts