Skip to content

Demonstration of accelerating GPT-J with DeepSpeed Inference and deploying on AWS SageMaker

License

Notifications You must be signed in to change notification settings

mantiumai/aws-sagemaker-gptj-deepspeed-blog

Repository files navigation

Deploying GPT-J with DeepSpeed on AWS Sagemaker

This repository demonstrates how you can accelerate GPT-J with DeepSpeed Inference and deploy it on AWS SageMaker.

Detailed instructions are provided in this notebook.

About

Demonstration of accelerating GPT-J with DeepSpeed Inference and deploying on AWS SageMaker

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published