Skip to content

Add DeepFloor non-record submission candidate#1304

Draft
KenMalloy wants to merge 20 commits intoopenai:mainfrom
KenMalloy:deepfloor-submission
Draft

Add DeepFloor non-record submission candidate#1304
KenMalloy wants to merge 20 commits intoopenai:mainfrom
KenMalloy:deepfloor-submission

Conversation

@KenMalloy
Copy link
Copy Markdown

Summary

This PR adds a new non-record submission candidate: 2026-04-03_DeepFloor.

DeepFloor is a compact recurrent multi-view language model that replaces a large flat stack with:

  • repeated QKV+O recurrent blocks
  • two cross-token modes: periodic floor attention or a fused recurrent accumulator
  • explicit stability controls: contraction target, decay, TBPTT chunking, Jacobian penalty, periodic normalization, and optional HiPPO-style fused state cores
  • a frozen record-folder submission snapshot so the submission entrypoint runs without importing mutable repo-root code

Why this is interesting

This is not a leaderboard-quality run yet. The value of the submission is that it packages a real, unusual architecture direction that is reproducible inside the challenge record format:

  • depth recurrence / universal-transformer-style computation
  • multi-view recurrence instead of standard layer stacking
  • a self-contained submission path built for cold pods and remote validation

Included artifacts

  • README.md
  • RESULTS.md
  • submission.json
  • train_gpt.py
  • deepfloor_snapshot.py
  • freeze_submission_snapshot.py
  • runner scripts for local, smallbox, and fullbox validation
  • train log and candidate result json

Current checked-in evidence

Best real-enwik8 small-box matrix candidate:

  • fused_d32_v2
  • val_bpb = 7.9221
  • artifact_bytes = 8448

Frozen submission preflight:

  • bytes_total = 56221
  • bytes_model_estimated = 8192
  • bytes_code = 48029

Checked-in candidate submission:

  • submission.json built from candidate_result_seed1337.json
  • bytes_total = 56477
  • val_bpb = 7.9221

Validation

  • local submission preflight on the frozen record-folder snapshot
  • remote GPU submission preflight on a reused small pod
  • remote small-box suite on real enwik8
  • remote 8x H100 fullbox suite, with synced smoke, matrix, launch_logs, and evolution artifacts under runs/fullbox/

Fullbox evidence

Best fullbox recipe-search result:

  • frontier seed 2025
  • val_bpb = 4.1101
  • test_bpb = 4.0239
  • artifact_mb = 0.1377
  • mode: floor
  • state core: scalar_decay
  • recurrent dim: 96
  • views: 8

This is stronger evidence that DeepFloor is a real direction worth reviewing, but it is still not presented as a record claim or as three repeated fixed-candidate contest runs.

Submission lane

This should be reviewed as a track_non_record_16mb research submission, not a record claim.

KenMalloy and others added 20 commits April 1, 2026 21:17
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
…mark

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
… confirmations

- estimate_artifact_bytes() now only counts parameters for the active
  cross_token_mode (floor or fused) and output path, not both
- run_deepfloor_recipe_evolution recomputes best from confirm_results
  sorted by val BPB, not just short-fitness winner
- Added tests for both: inactive branch exclusion and confirm re-ranking
- Updated design doc budget table to match QKV+O block (16K params)
- Added implementation plan doc

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
@KenMalloy KenMalloy marked this pull request as draft April 10, 2026 22:49
@MatoTeziTanka
Copy link
Copy Markdown

MatoTeziTanka commented Apr 11, 2026

[RETRACTED 2026-04-11] — This IMPORT_FAIL was a false positive. Root cause: sibling module exists in same records/ folder; runner sys.path bug. Your code is not broken. See correction below: #1304 (comment)


Community Review — Add DeepFloor non-record submission candidate

Compliance: NEEDS AUTHOR ACTION — train_gpt.py fails to import on CT2038 (Python 3.10 / torch 2.10.0+cpu)

What I found: The CPU smoke test on CT2038 (proteus-engine, 128 GB RAM, Triton 3.6.0, flash_attn stub, cutlass_evt_fusion stub) failed at the import step with:

ModuleNotFoundError: No module named 'spectral_flood_walk_v0'

A few of the common patterns I've seen for this class of error in the 2026-04-11 sweep:

Recommendation: Could you run python3 -c "import py_compile; py_compile.compile('train_gpt.py')" on your records-folder train_gpt.py under Python 3.10 specifically? The eval image is Python 3.10 per Issue #17 / the README, so any parse error on 3.10 blocks the submission at import time before any of the scored-eval logic runs.

Once the parse/import issue is fixed, I'll re-run the compliance audit through the normal pipeline. No other flags identified yet because the audit halts at the import step.


Reviewed by @MatoTeziTankaThe Agora. CPU smoke test (CT2038 proteus-engine, 2026-04-11): IMPORT_FAIL — ModuleNotFoundError: No module named 'spectral_flood_walk_v0'. Classification via classify_prs.py AST-based classifier; full compliance audit deferred until the import issue is resolved. Auto-drafted from a template and spot-checked before posting.

@MatoTeziTanka
Copy link
Copy Markdown

Retraction — this IMPORT_FAIL was a bug in my smoke runner

Sorry @KenMalloy, this one's on me. I re-audited the IMPORT_FAIL I posted above and it was a false positive — the fault is in how my CPU smoke runner set up sys.path, not in your code.

What happened:

The runner imported your ./train_gpt.py with only the script's folder implicitly on sys.path, so when your file did from spectral_flood_walk_v0 import ... it couldn't resolve the sibling spectral_flood_walk_v0.py that lives in the same ./ directory. The error I reported — ModuleNotFoundError: No module named 'spectral_flood_walk_v0' — looked like a missing file, but I re-checked the head SHA b3169d4 and spectral_flood_walk_v0.py is right there, committed to the PR, next to train_gpt.py.

Verified at head b3169d4:

spectral_flood_walk_v0.py   ← sibling module, exists
./train_gpt.py   ← imports it

On the real eval image (Python 3.10, records/*/ as the working dir), this import resolves correctly because the records folder ends up on sys.path via the standard cwd-driven import or via the eval harness's per-record entry point.

Your PR is not broken by this error. I'm retracting the IMPORT_FAIL classification. I'll re-queue the full compliance audit (BPB check, n-gram / TTT / SLOT flags, etc.) on the current head and post findings separately.

Again — sorry for the noise. These community reviews only work if I actually read what I'm reviewing, and I didn't in this case.

@KenMalloy
Copy link
Copy Markdown
Author

This is a draft PR, and not my full bet. Please don't spend too much effort on it.

Keep up the good work.

@MatoTeziTanka
Copy link
Copy Markdown

Understood — noted for the backlog. The retraction of the IMPORT_FAIL still stands either way. When you're ready to promote it from draft, the compliance pass will pick it up in the next sweep.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants