Skip to content

Commit dec4594

Browse files
Octavianclaude
andcommitted
Add v7: legal score-first TTT eval (PR #461 recipe)
Our v1 base (1.1232 pre-TTT) + legal TTT should give ~1.1211. PR #473 gets 1.1213 from a 1.1234 base — our base is 0.0002 better. TTT recipe: 32K-token chunks, score-first (inference_mode), then train (SGD lr=0.002, momentum=0.9, 3 epochs, freeze blocks 0-1, cosine LR decay across chunks, grad clip 1.0). Removed TTT burst (replaced by legal TTT eval). 1499 lines (under 1500 limit). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
1 parent cc2ff3a commit dec4594

1 file changed

Lines changed: 1499 additions & 0 deletions

File tree

0 commit comments

Comments
 (0)