Record: Round 5 — L-BFGS Causal SLOT (1.0046) + Discriminative TTT (1.0807) by resouer · Pull Request #4 · resouer/parameter-golf

resouer · 2026-04-04T15:02:34Z

Summary

Two 3-seed-validated submissions from Round 5:

Lane A: L-BFGS Causal SLOT in Logit Space — 1.0046 BPP

3-seed: 1.0043 / 1.0048 / 1.0047 (mean 1.0046, std 0.0003)
Novel: L-BFGS optimizer + logit-space delta + causal constraint. Nearest PR: Record: TTT-AdamW + SLOT L-BFGS25 LogitDelta + GPTQ DAMP=0.005 — val_bpb 1.00955 openai/parameter-golf#1318 (L-BFGS logit SLOT but non-causal). Our causal constraint means loss is computed ONLY on already-scored context positions — provably satisfies NoesisGenesis conditions.
Compliance: Causal SLOT passes flip test (PR Non-record: Does SLOT violate causal dependence? (empirical test + question) openai/parameter-golf#1240). But carries SLOT-category ban risk.
Beats SOTA by: 0.110 BPP (0.1101 nats)

Lane B: Discriminative TTT — 1.0807 BPP

3-seed: 1.0803 / 1.0805 / 1.0812 (mean 1.0807, std 0.0005)
Novel: Per-block adaptive LR during pre-quant TTT (ULMFiT-inspired). Early blocks get 0.3x LR, later blocks 1.0x. Combined with freeze=0 and 10 epochs. No existing PR modulates LR per block in TTT.
Compliance: Track A (fixed predictor). Zero eval-time adaptation. Zero compliance risk.
Beats SOTA by: 0.034 BPP. Beats Record: PROTEUS v1.6 — Scylla + Parallel Residuals + Depth Recurrence + Legal TTT — val_bpb 1.0819 (3-seed mean) openai/parameter-golf#1289 (1.0819, strongest Track A pending).

Files per submission (7 each)

train_gpt.py, README.md, submission.json, requirements.txt, 3 seed logs

Review checklist

Lane A: verify causal SLOT compliance argument holds
Lane A: confirm eval timing fits 600s budget (SLOT takes ~556s)
Lane B: verify discriminative TTT novelty claim
Lane B: confirm Track A compliance (no eval-time adaptation)
Both: verify artifact sizes < 16,000,000 bytes
Both: verify seed logs match claimed BPP values
Decide submission order: Lane A first (higher impact) or Lane B first (safer compliance)

🤖 Generated with Claude Code

@valerio-oai

…11473 (3-seed mean) AR self-generated calibration (no val/train data during quantization). Recreated from PR openai#728 at @valerio-oai's request for clarity. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…ptq-xsa-bigramhash3072 Record: AR Self-Gen GPTQ + XSA-all + BigramHash 3072×112 — val_bpb 1.11473 (3-seed mean)

Lane A: L-BFGS Causal SLOT in Logit Space — 1.0046 BPP (3-seed mean, std 0.0003) Novel: L-BFGS + logit-space + causal constraint (no existing PR combines these) Compliance: provably causal — passes flip test, NoesisGenesis conditions satisfied Lane B: Discriminative TTT — 1.0807 BPP (3-seed mean, std 0.0005) Novel: per-block adaptive LR during pre-quant TTT (no existing PR does this) Compliance: Track A (fixed predictor) — zero eval-time adaptation Both beat merged SOTA (1.1147) by large margins. Seed logs pending extraction from Lepton. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Lane A: L-BFGS Causal SLOT in Logit Space — 1.0046 BPP (3-seed mean, std 0.0003) Novel: L-BFGS + logit-space + causal constraint (no existing PR combines these) Compliance: provably causal — passes flip test, NoesisGenesis conditions satisfied Lane B: Discriminative TTT — 1.0807 BPP (3-seed mean, std 0.0005) Novel: per-block adaptive LR during pre-quant TTT (no existing PR does this) Compliance: Track A (fixed predictor) — zero eval-time adaptation Both beat merged SOTA (1.1147) by large margins. 7 files per submission: train_gpt.py, README.md, submission.json, requirements.txt, 3 seed logs. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

resouer · 2026-04-04T15:07:18Z

Closing: need separate PRs per submission, squashed commits

abaybektursun and others added 5 commits March 28, 2026 08:32

Merge pull request openai#1019 from abaybektursun/record/ar-selfgen-g…

2443851

…ptq-xsa-bigramhash3072 Record: AR Self-Gen GPTQ + XSA-all + BigramHash 3072×112 — val_bpb 1.11473 (3-seed mean)

Update README.md

9d070df

resouer closed this Apr 4, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Record: Round 5 — L-BFGS Causal SLOT (1.0046) + Discriminative TTT (1.0807)#4

Record: Round 5 — L-BFGS Causal SLOT (1.0046) + Discriminative TTT (1.0807)#4
resouer wants to merge 5 commits intomainfrom
submission/round5

resouer commented Apr 4, 2026

Uh oh!

resouer commented Apr 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

resouer commented Apr 4, 2026

Summary

Lane A: L-BFGS Causal SLOT in Logit Space — 1.0046 BPP

Lane B: Discriminative TTT — 1.0807 BPP

Files per submission (7 each)

Review checklist

Uh oh!

resouer commented Apr 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants