Record: Causal SLOT + Pre-quant TTT — val_bpb 1.0846 (3-seed) by resouer · Pull Request #3 · resouer/parameter-golf

resouer · 2026-04-03T16:29:01Z

Summary

3-seed mean val_bpb: 1.0846 (std 0.0007)
Beats merged SOTA (1.1147) by 0.030
Artifact: ~15.95 MB (all seeds < 16MB)
Eval: ~551s / 600s budget

Novel Mechanism: Causal SLOT

Provably causal per-chunk delta optimization. Unlike standard SLOT (PR openai#1240 proved 100% causal violation), our delta is optimized using ONLY backward-looking loss from already-scored positions. Passes strict causality tests.

Stack

Coprime-stride multi-shard loader (-0.003)
6-epoch pre-quant AdamW TTT (-0.022)
Causal SLOT (-0.009)
Training-data GPTQ calibration
Full Hessian GPTQ int6 + LZMA

Test plan

3 seeds (1337: 1.0841, 42: 1.0843, 2025: 1.0854)
All artifacts < 16MB
Eval < 600s
Zero env vars needed
Causal SLOT provably passes PR Non-record: Does SLOT violate causal dependence? (empirical test + question) openai/parameter-golf#1240 flip test

Generated with Claude Code

3-seed mean 1.0846 (std 0.0007). Beats merged SOTA (1.1147) by 0.030. Novel: provably causal eval-time delta optimization (causal SLOT). Unlike standard SLOT (PR openai#1240 proved 100% causal violation), delta is optimized using only backward-looking loss from already-scored positions. Combined with 6-epoch pre-quant AdamW TTT and coprime-stride multi-shard data loading. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

resouer force-pushed the submission/causal-slot-1.0846 branch from 8930d5a to d43a0f3 Compare April 3, 2026 16:34

resouer closed this Apr 3, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Record: Causal SLOT + Pre-quant TTT — val_bpb 1.0846 (3-seed)#3

Record: Causal SLOT + Pre-quant TTT — val_bpb 1.0846 (3-seed)#3
resouer wants to merge 1 commit intomainfrom
submission/causal-slot-1.0846

resouer commented Apr 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

resouer commented Apr 3, 2026

Summary

Novel Mechanism: Causal SLOT

Stack

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant