shakespeare

Custom LLM to speake in a shakespeare style

Usage

Build the Environment

There are two different requirements (the one for docker does not contain torch, as it is included in the base image)

pip install -r requirements.txt

Extract and Preprocess (Option1 from DataProcessing notebook)

python extract_and_process.py

Run the Flask App

python main.py

Train

python train.py --train_file data/enhanced/train.txt --test_file data/enhanced/test.txt --output_dir "experiments" --model_name gpt2 --num_train_epochs 5 --per_device_train_batch_size 16 --save_steps 10000

For the additional info, please, refer to the train.py script with provided descriptions or use help

or

curl -X POST -H "Content-Type: application/json" -d "{\"test_file\": \"data/enhanced/test.txt\", \"train_file\": \"data/enhanced/test.txt\", \"output_dir\": \"experiments\", \"port\": 6666, \"model_name\": \"gpt2\", \"num_train_epochs\": 5, \"per_device_train_batch_size\": 8, \"save_steps\": 10000}" http://localhost:5000/train

Inference

If you do not have trained model (e.g. when firstly interacting with docker container), you need to train. Then during the first call to the inference endpoint, the model will be loaded and stored in memory (globally) Hence, after that cold start, or requests will be veyr fast!

curl -X POST -H "Content-Type: application/json" -d "{\"input_text\": \"To be or not to be, that is the question:\"}" http://localhost:5000/generate_text

Docker

Build the Image

Downloads the data
Downloads the gpt2 related models/tokenizers

docker build -t my-gpt2-inference-image .

Run the container that supports train and inference methods

docker run --name test_gpt -p 6666:6666 --gpus all gpt2_inference_image

Run the same commands (curl) for train and inference

Note

You can also use volumes to store training results Just create a named volume

docker volume create checkpoint

And attach to the container

docker run --name test_gpt  -v checkpoint:/app/experiments -p 6666:6666 -p 5000:5000 --gpus all gpt2_inference_image

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
data		data
notebooks		notebooks
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
extract_and_process.py		extract_and_process.py
main.py		main.py
requirements.txt		requirements.txt
requirements_docker.txt		requirements_docker.txt
sources.txt		sources.txt
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

shakespeare

Usage

Build the Environment

Extract and Preprocess (Option1 from DataProcessing notebook)

Train

Inference

Docker

Note

About

Releases

Packages

Languages

Ara-Yeroyan/shakespeare

Folders and files

Latest commit

History

Repository files navigation

shakespeare

Usage

Build the Environment

Extract and Preprocess (Option1 from DataProcessing notebook)

Train

Inference

Docker

Note

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages