This example demonstrates how to run standard Chainer train_mnist.py example using Batch AI.
- batchaitraining/chainer:distributed docker image is used;
- Standard chainer sample script train_mnist.py is used;
- Chainer downloads the standard MNIST Database on its own;
- Standard output of the job and the model will be stored on Azure File Share;
You can find Jupyter Notebook for this recipe in Chainer-GPU-Distributed.ipynb.
You can find Azure CLI 2.0 instructions for this recipe in cli-instructions.md.
The Dockerfile
for the Docker images used in this recipe can be found here. The dockerfile is a modified version of ChainerMN example at chainer/chainermn#71
Under construction...
If you have any problems or questions, you can reach the Batch AI team at [email protected] or you can create an issue on GitHub.
We also welcome your contributions of additional sample notebooks, scripts, or other examples of working with Batch AI.