Deploying GPT-J with DeepSpeed on AWS Sagemaker

This repository demonstrates how you can accelerate GPT-J with DeepSpeed Inference and deploy it on AWS SageMaker.

Detailed instructions are provided in this notebook.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
container		container
run_local		run_local
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
build.sh		build.sh
deploy_gptj_with_deepspeed.ipynb		deploy_gptj_with_deepspeed.ipynb
push_to_ecr.sh		push_to_ecr.sh

Provide feedback