Skip to content

Record: Round 5 — L-BFGS Causal SLOT (1.0046) + Discriminative TTT (1.0807)#4

Closed
resouer wants to merge 5 commits intomainfrom
submission/round5
Closed

Record: Round 5 — L-BFGS Causal SLOT (1.0046) + Discriminative TTT (1.0807)#4
resouer wants to merge 5 commits intomainfrom
submission/round5

Conversation

@resouer
Copy link
Copy Markdown
Owner

@resouer resouer commented Apr 4, 2026

Summary

Two 3-seed-validated submissions from Round 5:

Lane A: L-BFGS Causal SLOT in Logit Space — 1.0046 BPP

Lane B: Discriminative TTT — 1.0807 BPP

Files per submission (7 each)

  • train_gpt.py, README.md, submission.json, requirements.txt, 3 seed logs

Review checklist

  • Lane A: verify causal SLOT compliance argument holds
  • Lane A: confirm eval timing fits 600s budget (SLOT takes ~556s)
  • Lane B: verify discriminative TTT novelty claim
  • Lane B: confirm Track A compliance (no eval-time adaptation)
  • Both: verify artifact sizes < 16,000,000 bytes
  • Both: verify seed logs match claimed BPP values
  • Decide submission order: Lane A first (higher impact) or Lane B first (safer compliance)

🤖 Generated with Claude Code

abaybektursun and others added 5 commits March 28, 2026 08:32
…11473 (3-seed mean)

AR self-generated calibration (no val/train data during quantization).
Recreated from PR openai#728 at @valerio-oai's request for clarity.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
…ptq-xsa-bigramhash3072

Record: AR Self-Gen GPTQ + XSA-all + BigramHash 3072×112 — val_bpb 1.11473 (3-seed mean)
Lane A: L-BFGS Causal SLOT in Logit Space — 1.0046 BPP (3-seed mean, std 0.0003)
  Novel: L-BFGS + logit-space + causal constraint (no existing PR combines these)
  Compliance: provably causal — passes flip test, NoesisGenesis conditions satisfied

Lane B: Discriminative TTT — 1.0807 BPP (3-seed mean, std 0.0005)
  Novel: per-block adaptive LR during pre-quant TTT (no existing PR does this)
  Compliance: Track A (fixed predictor) — zero eval-time adaptation

Both beat merged SOTA (1.1147) by large margins.
Seed logs pending extraction from Lepton.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Lane A: L-BFGS Causal SLOT in Logit Space — 1.0046 BPP (3-seed mean, std 0.0003)
  Novel: L-BFGS + logit-space + causal constraint (no existing PR combines these)
  Compliance: provably causal — passes flip test, NoesisGenesis conditions satisfied

Lane B: Discriminative TTT — 1.0807 BPP (3-seed mean, std 0.0005)
  Novel: per-block adaptive LR during pre-quant TTT (no existing PR does this)
  Compliance: Track A (fixed predictor) — zero eval-time adaptation

Both beat merged SOTA (1.1147) by large margins.
7 files per submission: train_gpt.py, README.md, submission.json, requirements.txt, 3 seed logs.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
@resouer
Copy link
Copy Markdown
Owner Author

resouer commented Apr 4, 2026

Closing: need separate PRs per submission, squashed commits

@resouer resouer closed this Apr 4, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants