Skip to content

b14ucky/Taco-LLMingway

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

41 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Taco LLMingway - Core Library & ML Training

This is the core Python library and heavy-duty training pipeline for the Taco LLMingway project. It features a custom GPT (Generative Pre-trained Transformer) implementation built from scratch using PyTorch, optimized for generating long-form lyrics in the style of Taco Hemingway.

Features

  • Scalable GPT Architecture: Causal Transformer decoder capable of running both "mini" local tests and deep Kaggle training sessions.
  • Sinusoidal Position Encodings: Fixed embeddings for precise sequence positioning.
  • Production-Ready Trainer: Supports DataParallel for multi-GPU training, automatic checkpointing, and training resumption.
  • Tokenization: Custom character-level/word-level tokenizer with JSON export/import.
  • Kaggle Optimized: Seamless integration with Kaggle environments for GPU-accelerated training.

Kaggle Training & Data

The heavy lifting (training the final model) was performed on Kaggle. You can explore the training process and the dataset used via the links below:

Tech Stack

Project Structure

├── main.py              # Sample training entry point
├── pyproject.toml       # Package metadata and dependencies
├── src/
│   └── taco_llmingway/  # Main package logic
│       ├── model.py     # GPT architecture
│       ├── train.py     # Advanced Trainer class
│       └── ...          # Utilities, Tokenizer, Logger
├── data/raw/            # Source lyrics dataset
└── checkpoints/         # Intermediate model states (.pth)

Model Configuration (Production Grade)

The model used for the final inference was trained with these parameters:

Parameter Value
Context Length 128
Embed Dim 256
Attention Heads 8
Decoder Blocks 6
FFN Dim 1024
Batch Size 128
Learning Rate $3 \times 10^{-4}$

Installation

To install the taco_llmingway package in editable mode:

pip install -e .

Project Ecosystem

This repository is the heart of the Taco LLMingway project. To see how this model is served and presented, check out the other components:

License

This project is licensed under the MIT License.

About

Custom GPT Transformer architecture built from scratch in PyTorch. Trained on Taco Hemingway's discography for authentic lyrics generation.

Topics

Resources

License

Stars

Watchers

Forks

Contributors

Languages