Skip to content

Commit 472c8d1

Browse files
Takoda Mundyclaude
andcommitted
C180 audit 0600Z: all 3 world-novels remain novel, 0 comp PR collisions in last 24h
- Re-audit L05_norm_pct_dropout / L06_asymmetric_skip_init / L07_asym_label_smoothing → STILL world-novel - Scanned ~30 recent comp PRs (openai#1440openai#1463), zero direct collisions - 6 pods alive, ~$14.80 spent, no layers LOCKed yet, 0 demotions Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
1 parent 85ef789 commit 472c8d1

File tree

1 file changed

+38
-0
lines changed

1 file changed

+38
-0
lines changed

RESEARCH_LOG.md

Lines changed: 38 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2214,3 +2214,41 @@ Total runtime: 4 × 25 min = 100 min. Will complete before 23:00 UTC end of run.
22142214
### Spend
22152215

22162216
Pod uptime ~10h × $0.30/h = $3.00 raw + ops + H100 burn = **~$6.50 / $36 (18%)**. Plenty of headroom for both the multi-seed validation AND a successful H100 escalation cycle.
2217+
2218+
---
2219+
2220+
## AUDIT 20260408T0600Z (C180)
2221+
2222+
**Pods**: 6 alive (B/C/D/E/F/G), all run_forever + train_gpt PIDs healthy, all at HEAD 85ef789.
2223+
**Spend**: ~$8.10 session + $6.70 prior = ~$14.80 grand total / $36 cap (under soft cap $25 — normal mode).
2224+
**In-flight**: 6 experiments (1 per pod, 900s wallclock, mostly pod_filter L0x candidates).
2225+
**Layers locked**: 0 (no Section D LOCK lines yet — closest is L04_gated_attention with 5 confirmed-pass entries but C60 has not promoted).
2226+
2227+
### World-novel re-audit (3 candidates)
2228+
- **L05_norm_pct_dropout** — STILL world-novel. WebSearch returned "Biased Dropout" (magnitude per-unit) but NOT norm-percentile row filtering. GitHub 0 hits. Comp PRs 0 collisions.
2229+
- **L06_asymmetric_skip_init** — STILL world-novel. LMSC-UNet 2025 + Additive U-Net Jan 2026 work on gated additive skips, but NOT init=0.5 as info bottleneck. GitHub 0 hits. Comp PRs 0 collisions.
2230+
- **L07_asym_label_smoothing** — STILL world-novel. "Frequency-Aware Token Reduction" (Oct 2025) exists but is about token DROPPING not asymmetric softmax smoothing on rare-vs-frequent classes. GitHub 0 hits. Comp PRs 0 collisions.
2231+
2232+
### Comp PR scan (last 24h, ~30 PRs)
2233+
- Scanned PRs #1440#1463. All use known techniques (TTT, GPTQ, FlashMuon, MoE+BigramHash, ByteJEPA, TMA megakernel + parallel residuals, depth recurrence variants, EngramLite + Mousse).
2234+
- **NO PR collides with our 3 world-novels.** Zero direct hits.
2235+
2236+
### Demotions this cycle
2237+
NONE.
2238+
2239+
### World-novel WIN count after audit
2240+
**5 PROMOTION-READY** (all n=2 mean train_loss):
2241+
1. L02_coprime_stride (no — comp port, demoted to comp-novel earlier audits)
2242+
2. L04_gated_attention (no — comp port)
2243+
3. **L05_norm_pct_dropout = 2.22795** ← world-novel WIN
2244+
4. **L06_asymmetric_skip_init = 2.2276** ← world-novel WIN
2245+
5. **L07_asym_label_smoothing = 2.22885** ← world-novel WIN
2246+
2247+
### Best single-run train_loss
2248+
**L04_gated_attention seed999 = 2.2148** (5-seed mean 2.22706).
2249+
2250+
### Next C180 actions
2251+
- No interaction-screen needed (0 layers locked).
2252+
- Continue C90 to ship more world-novels for L01 + L03 + L09 + L10.
2253+
- Mac/CPU worker pool still NOT running — track in next plan iteration.
2254+

0 commit comments

Comments
 (0)