llm.cu

Performance

Time refers to generating 32 tokens starting from the 2016 till 2048 tokens.

Iteration 1: 48.5s Iteration 2: ?

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
README.md		README.md
kernels.cu		kernels.cu
llm.cu		llm.cu
llm_old.cu		llm_old.cu
softmax.cu		softmax.cu
softmaxV.cu		softmaxV.cu
test		test
test.c		test.c
test.cu		test.cu
test.ipynb		test.ipynb