Agent must collect bananas in a large, square room.
Episodic
+1 for each yellow banana
-1 for each blue banana
Navigate through the room, collecting as many yellow bananas as possible while avoiding the blue bananas
This environment is considered "solved" when the agent recieves a scoreof 13 points, averaged over 100 episodes
-
37 Dimensions
-
Continuous
-
Agent Velocity
Ray traces to objects around agent's forward direction
- 4 Dimensions
- Discrete space
-
0 1 2 3 forward backward left right
In a new virtual environment:
cd {local path to cloned repo}
cd ./python && python setup.py build && python setup.py install && python setup.py clean && cd -
cd {local path to cloned repo}
python navigation.py {options}
-
Option Description --novis Run without visualization --env-dir Set directory where environment is implemented --env-file Set file for environment --n-episodes Set number of episodes to run agent --seed Set random seed for reproducibility --cpu Use CPU instead of GPU --help Show help message --prioritizedNot Yet Implemented Use Prioritized Replay Memory--doubleNot Yet Implemented Use double deep q network--duelingNot Yet Implemented Use dueling deep q networks