Commit dec4594

Octavian

and

committed

Add v7: legal score-first TTT eval (PR #461 recipe)

Our v1 base (1.1232 pre-TTT) + legal TTT should give ~1.1211. PR #473 gets 1.1213 from a 1.1234 base — our base is 0.0002 better. TTT recipe: 32K-token chunks, score-first (inference_mode), then train (SGD lr=0.002, momentum=0.9, 3 epochs, freeze blocks 0-1, cosine LR decay across chunks, grad clip 1.0). Removed TTT burst (replaced by legal TTT eval). 1499 lines (under 1500 limit). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

1 parent cc2ff3a commit dec4594Copy full SHA for dec4594

1 file changed

train_gpt_v7.py

Comments

(0)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Commit dec4594

File tree

0 commit comments