ReaLLMASIC

Overview

ReaLLMAsic aims to bridge the gap between theoretical model design and practical hardware implementation, ensuring efficient, scalable, and robust ML model development.

Our project stands out for its extensive exploration of various model configurations and modules, catering to a diverse range of use cases.

Key exploration features include:

Module Variation: Explore with different module types -- e.g. Softmax, Softermax, ConSmax, and SigSoftmax -- discover which is best suited (PPA) to your application.
Flexible Tokenization: Explore different tokenization: tiktoken, sentencepiece, phonemization, character level, custom tokenization, etc.
Diverse Dataset Performance Testing: Evaluate model efficacy across various languages and datasets including: csv-timeseries, mathematics, music, lyrics, literature, and webtext.
Standard and Custom Hyperparameters: Fine-tune models using conventional hyperparameters and explore the impact of custom settings on model performance and PPA impacts.

Key analysis features:

Exploration scripts: Are encapsulated into bash scripts which loop over the train.py's argparse parameters.
Logging with automatic timestamps & labels: run a suite of experiments and have the repo automatically organize and label them by timestamp and description

Hardware Related

Training with Hardware Emulation: Implement different operations for forward and backward passes for hardware-implementation aware training.
PPA Implications Analysis: Understand the power, performance, and area (PPA) implications of different model designs, guiding efficient hardware-software integration.

Name		Name	Last commit message	Last commit date
Latest commit History 1,205 Commits
.github/workflows		.github/workflows
HW/SA		HW/SA
checkpoint_analysis		checkpoint_analysis
colabs		colabs
config		config
curriculum		curriculum
data		data
demos		demos
documentation		documentation
explorations		explorations
factorization_util		factorization_util
huggingface_model		huggingface_model
images		images
model_info_util		model_info_util
modules		modules
monitoring_util		monitoring_util
publications		publications
quantization		quantization
statistics_util		statistics_util
steering_vector_util		steering_vector_util
tests		tests
tf_np_golden_gen		tf_np_golden_gen
variations		variations
visualization_util		visualization_util
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
Contributing_Features.md		Contributing_Features.md
LICENSE		LICENSE
README.md		README.md
bench.py		bench.py
gpt_conf.py		gpt_conf.py
inspect_ckpts.py		inspect_ckpts.py
model.py		model.py
requirements_cpu.txt		requirements_cpu.txt
run_curriculum_learning.py		run_curriculum_learning.py
run_experiments.py		run_experiments.py
run_vizier.py		run_vizier.py
sample.py		sample.py
softmax_sweep.py		softmax_sweep.py
start_tensorboard.sh		start_tensorboard.sh
train.py		train.py
visualize.py		visualize.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ReaLLMASIC

Overview

TOC

Installation

Step 1 (Recommended) Adding a Virtual Env

Step 2 Install Dependencies

Testing Your Setup

Prepare Training and Validation Data Sets

Train Model From Scratch

Perform Inference From Custom Model

Explorations

Start Exploration

Inspect and Monitor Best Val Losses

Start Tensorboard Logging

TODO Section:

Contributing

Acknowledgements

About

Releases

Packages

Languages

License

ReaLLMASIC/nanoGPT

Folders and files

Latest commit

History

Repository files navigation

ReaLLMASIC

Overview

TOC

Installation

Step 1 (Recommended) Adding a Virtual Env

Step 2 Install Dependencies

Testing Your Setup

Prepare Training and Validation Data Sets

Train Model From Scratch

Perform Inference From Custom Model

Explorations

Start Exploration

Inspect and Monitor Best Val Losses

Start Tensorboard Logging

TODO Section:

Contributing

Acknowledgements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages