GitHub - h2jaafar/pb-value-iteration-networks: A modified implementation of Value Iteration Networks to be implemented with PathBench.

Introduction

This is a modified implementation of Kent Sommer's PyTorch Value Iteration Networks implementation, meant to work with PathBench.

Read about Kent Sommers Implementation: Here

Read about PathBench: Here

Read the original paper:Here

Read about similar implemenations, which have made this modified implemenation possible

What has been modified?

In order to run on PathBench generated maps, I had to modify a few areas of the code. The main difference is within the test.py file and gridworld class. The maps (resources>maps), which are JSON files structured with the goal position, agent position and grid. I have also added a few metrics, such as path deviation from optimal path, and time.

The PathBench maps are structured where 0 is obstacle and 1 is free space. Sample Map:

Installation

Instructions:

Install Packages
Download Training Data
Train on Training Data
Run test.py.

This repository requires following packages:

SciPy >= 0.19.0
Python >= 2.7 (if using Python 3.x: python3-tk should be installed)
Numpy >= 1.12.1
Matplotlib >= 2.0.0
PyTorch >= 0.1.11

Use pip to install the necessary dependencies:

pip install -U -r requirements.txt

Note that PyTorch cannot be installed directly from PyPI; refer to http://pytorch.org/ for custom installation instructions specific to your needs.

Downloading Training Data

WIP

To aquire the training data (NOT for PathBench Maps), either run the training data generator make_training_data.py in the dataset folder (resource intensive), or run the shell script to download them:

cd into main directory
chmod +x download_weights_and_datasets.sh
./download_weights_and_datasets

It should download the training data.

PathBench Training Data

Outdated- Please await updated training files

To download PathBench training data, visit GDrive Currently, only the 90000 map training data is uploaded. I will upload more as I generate them.

To generate your own training data, add the json files with correct structure to a specified path, and pass that path in the make_training_data.py file. It is fairly straight forward.

How to train

8x8 gridworld

python train.py --datafile dataset/gridworld_8x8.npz --imsize 8 --lr 0.005 --epochs 30 --k 10 --batch_size 128

16x16 gridworld

python train.py --datafile dataset/gridworld_16x16.npz --imsize 16 --lr 0.002 --epochs 30 --k 20 --batch_size 128

28x28 gridworld

python train.py --datafile dataset/gridworld_28x28.npz --imsize 28 --lr 0.002 --epochs 30 --k 36 --batch_size 128

Flags:

datafile: The path to the data files.
imsize: The size of input images. One of: [8, 16, 28]
lr: Learning rate with RMSProp optimizer. Recommended: [0.01, 0.005, 0.002, 0.001]
epochs: Number of epochs to train. Default: 30
k: Number of Value Iterations. Recommended: [10 for 8x8, 20 for 16x16, 36 for 28x28]
l_i: Number of channels in input layer. Default: 2, i.e. obstacles image and goal image.
l_h: Number of channels in first convolutional layer. Default: 150, described in paper.
l_q: Number of channels in q layer (~actions) in VI-module. Default: 10, described in paper.
batch_size: Batch size. Default: 128

How to test / visualize paths (requires training first)

8x8 gridworld

python test.py --weights trained/vin_8x8.pth --imsize 8 --k 10

16x16 gridworld

python test.py --weights trained/vin_16x16.pth --imsize 16 --k 20

28x28 gridworld

python test.py --weights trained/vin_28x28.pth --imsize 28 --k 36

64x64 gridworld

python test.py --weights trained/vin_28x28.pth --imsize 28 --k 36

(64x64 still uses 28x28 trained data, we haven't trained VIN on 64x64 maps yet.)

To visualize the optimal and predicted paths simply pass:

--plot

Flags:

weights: Path to trained weights.
imsize: The size of input images. One of: [8, 16, 28]
plot: If supplied, the optimal and predicted paths will be plotted
k: Number of Value Iterations. Recommended: [10 for 8x8, 20 for 16x16, 36 for 28x28]
l_i: Number of channels in input layer. Default: 2, i.e. obstacles image and goal image.
l_h: Number of channels in first convolutional layer. Default: 150, described in paper.
l_q: Number of channels in q layer (~actions) in VI-module. Default: 10, described in paper.

Results

The maps that VIN trains on are NOT the PathBench maps, rather they are maps generated from Kent's implementation. This is still WIP Therefore, when running VIN w/ PathBench maps, it is running on untrained style maps (Block map and Uniform Random Fill are unfamiliar to the algorithm).

Logs are saved in resources>logs. You can change the logging behaviour (debug vs info) in test_pb.py. Ensure logs don't overwrite eachother by changing the name at each run.

Sample Maps:

Block Map:

Uniform Random Fill Map

House Map

Training

WIP

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.vscode		.vscode
dataset		dataset
domains		domains
generators		generators
maps		maps
resources/maps		resources/maps
results		results
trained		trained
utility		utility
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
download_weights_and_datasets.sh		download_weights_and_datasets.sh
general_test16.py		general_test16.py
general_test28.py		general_test28.py
general_test8.py		general_test8.py
model.py		model.py
requirements.txt		requirements.txt
scrap.py		scrap.py
test.py		test.py
test16.py		test16.py
test28.py		test28.py
test64.py		test64.py
test8.py		test8.py
test_house_expo.py		test_house_expo.py
test_nonzeros.py		test_nonzeros.py
test_og.py		test_og.py
train.py		train.py
train_og.py		train_og.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction

What has been modified?

Installation

Downloading Training Data

PathBench Training Data

How to train

8x8 gridworld

16x16 gridworld

28x28 gridworld

How to test / visualize paths (requires training first)

8x8 gridworld

16x16 gridworld

28x28 gridworld

64x64 gridworld

Results

Sample Maps:

Training

About

Releases

Packages

Languages

License

h2jaafar/pb-value-iteration-networks

Folders and files

Latest commit

History

Repository files navigation

Introduction

What has been modified?

Installation

Downloading Training Data

PathBench Training Data

How to train

8x8 gridworld

16x16 gridworld

28x28 gridworld

How to test / visualize paths (requires training first)

8x8 gridworld

16x16 gridworld

28x28 gridworld

64x64 gridworld

Results

Sample Maps:

Training

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages