Non-record: 11L Int6 + Online Logit Bias (val_bpb=1.1609) by bopmite · Pull Request #330 · openai/parameter-golf

bopmite · 2026-03-21T09:09:45Z

No description provided.

…ne TTT

MatoTeziTanka · 2026-04-11T20:10:15Z

Community Review — Non-record: 11L Int6 + Online Logit Bias (val_bpb=1.1609)

BPB: 1.1609 | Compliance: FLAG — Pre-Quant TTT runs multi-epoch on val_tokens with no score-first discipline

What I found in the code (head SHA 1c795387ef9b, file records/track_10min_16mb/2026-03-21_OnlineLogitBias_11L_Int6/train_gpt.py):

At line 1218 the pre-quant TTT function takes val_tokens as an input argument and runs an epoch loop over it with loss.backward()/optimizer.step(), with no prior torch.no_grad() scoring pass over the same tokens:

ttt_adapt(args, base_model, device, val_tokens, rank, world_size, log_fn) — for epoch in range(args.ttt_epochs), loss.backward() without prior no_grad score pass

Per Issue #402 and Issue #677 (@valerio-oai, 2026-03-27), TTT is valid only if each token is scored BEFORE the adapter trains on it; multi-epoch TTT that scores only on the final pass is explicitly called out as invalid. This implementation matches the pattern that closed PR #1376 (stukenov) and was subsequently confirmed in #1485/#1487/#1488/#1489/#1517/#1539 — see Issue #677 meta-comment from 2026-04-11 which lists the 6+ PRs in the cluster.

Contrast with the legal Pre-Quant TTT pattern (e.g. PR #1416 / PR #1423 lineage): those train the adapter on a held-out slice of training data (not val_tokens) with score-first-per-chunk discipline. The distinction is on the function signature itself — the argument tensor passed in.

CPU smoke test (CT2038 proteus-engine, 2026-04-11): import OK in 0.05s, dim=512, layers=9, vocab=1024, code=94438 B, SMOKE_TEST_PASS

Verdict: COMPLIANCE FLAG — same pattern as the closed Pre-Quant TTT cluster.

Recommendation to @cocohearts @valerio-oai @0hq @yuzhougu-oai @notapplica: CLOSE under the same ruling as #1376 and the rest of the cluster. A resubmission with the TTT function taking a training-data slice instead of val_tokens (per #1416/#1423 reference implementation) would be welcomed.

Reviewed by @MatoTeziTanka — The Agora. CPU smoke test (CT2038 proteus-engine, 2026-04-11): import OK in 0.05s, dim=512, layers=9, vocab=1024, code=94438 B, SMOKE_TEST_PASS. Classification via deterministic AST-based classify_prs.py (pattern bank derived from ~65 manually-reviewed PRs earlier in the 2026-04-11 sweep). This review was auto-drafted from a template and spot-checked before posting — if the template misread your code, please call it out so I can iterate the classifier.

non-record: 11L int6 + OLB, val_bpb=1.1609

809785e

notapplica mentioned this pull request Mar 21, 2026

Parameter Golf Formerly Live AI Commentary ⛳ + Analysis / Ideas | every 10 minutes. Now disabled #140

Closed

bopmite added 2 commits March 25, 2026 16:34

add GPTQ, VRL, gated attention, n-gram confidence scaling, 100ep cosi…

96cb908

…ne TTT

add logistic mixing, APM, bloom filter, kNN-LM, LoRA TTT, learned mixing

1c79538

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Non-record: 11L Int6 + Online Logit Bias (val_bpb=1.1609)#330

Non-record: 11L Int6 + Online Logit Bias (val_bpb=1.1609)#330
bopmite wants to merge 3 commits intoopenai:mainfrom
bopmite:user/bopmite

bopmite commented Mar 21, 2026

Uh oh!

MatoTeziTanka commented Apr 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

bopmite commented Mar 21, 2026

Uh oh!

MatoTeziTanka commented Apr 11, 2026

Community Review — Non-record: 11L Int6 + Online Logit Bias (val_bpb=1.1609)

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants