GPT2 Hasktorch implementation

The goal of this project is to reproduce GPT-2, created by OpenAI, in the Haskell programming language using the Hasktorch library, drawing inspiration from Andrej Karpathy's implementation in PyTorch.

Haskell : https://www.haskell.org/

Haskorch : http://hasktorch.org/

Nano GPT(Karpathy's implementation) : https://github.com/karpathy/nanoGPT

GPT2 Paper : https://cdn.openai.com/better-language-models/language_models_are_unsupervised_multitask_learners.pdf

GPT2 Parameters

Parameters	Value
nBlock	12
nHead	12
nEmbd	768
vocabSize	50 257
nbParameters	117M
seqLen	1024
activation	gelu
optimizer	Adam

Features

All modules of GPT2 ✅
Forward Pass ✅
Backward Pass ✅
LazyDataloader to manage big txt files ✅
variable learning rate ✅
complete training ✅
use gradient accumation ✅
Save the training state ✅
Performant training tracker ✅
Plot metrics in real time ✅
Load and use real GPT2 tokenizer ✅
Put on CUDA ✅

TODOs

Variable Batch Size ❌
Weights sharing between the input token embedding layer (wte) and the output language modeling head (lm_head) ❌
Use Weights Decay ❌
Use Flash Attention ❌
Use Distributed Data Parallel ❌
Generation Function ❌

Launch the program

docker compose up -d  # launch the docker

stack run  # launch the main

stack test # launch the tests

use Jupyter

http://localhost:8890/lab

Name		Name	Last commit message	Last commit date
Latest commit History 91 Commits
app		app
assets		assets
data		data
notebooks/haskell		notebooks/haskell
output		output
src		src
test		test
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
Dockerfile		Dockerfile
GPT2.cabal		GPT2.cabal
LICENSE		LICENSE
README.md		README.md
Setup.hs		Setup.hs
compose.yml		compose.yml
package.yaml		package.yaml
stack.yaml		stack.yaml
stack.yaml.lock		stack.yaml.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

GPT2 Hasktorch implementation

GPT2 Parameters

Features

TODOs

Launch the program

use Jupyter

About

Uh oh!

Releases

Packages

Contributors 2

Languages

License

theosorus/GPT2-Hasktorch

Folders and files

Latest commit

History

Repository files navigation

GPT2 Hasktorch implementation

GPT2 Parameters

Features

TODOs

Launch the program

use Jupyter

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages