Recursive Memory Kernel v1 — Implementation Checklist

# Recursive Memory Kernel v1 — Implementation Checklist

> Purpose: land the smallest tree-only v1 that is causally correct, observable, ablatable, and narrow enough to debug.
>
> Rule of thumb: do **not** add side-memory bank, learned merge scheduling, routing early-stop, or dense fallback attention until this checklist is complete.
>
> Status note: the implementation checklist below is complete through the current proving surface. This does **not** yet mean the architecture has been validated under learned training; the live ablation run so far is still an untrained baseline.

---

## PR 1 — Scaffold v2 module boundaries

**Title:** `scaffold fractal-v2 module boundaries`

### Deliverables
- [x] Add `LocalTrunk`
- [x] Add `LeafSummarizer`
- [x] Add `TreeMergeCell`
- [x] Add `FractalRouterHead`
- [x] Add `ReadFusion`
- [x] Add `FractalV2Model`

### Requirements
- [x] Do not disturb `recursive-kernel-v1`
- [x] No side-memory bank
- [x] No learned merge scheduling
- [x] No routing early-stop
- [x] No broad tournament integration beyond stub wiring

### Merge criteria
- [x] Workspace compiles
- [x] Module ownership is explicit
- [x] Public type boundaries are clear
- [x] Existing v1 path still runs unchanged

---

## PR 2 — Add typed v2 runtime state

**Title:** `add typed recursive-memory-kernel v2 state surfaces`

### Deliverables
- [x] Add multi-root recurrent state
- [x] Add live leaf state
- [x] Add sealed leaf summary store
- [x] Add tree level summary stores
- [x] Add exact leaf token cache
- [x] Add retrieval policy surface

### Requirements
- [x] Prefer backend-friendly tensor layouts in hot paths
- [x] Avoid pointer-heavy recursive state structures
- [x] Separate conceptual ownership from runtime storage
- [x] Sketch checkpoint/serialization boundary

### Merge criteria
- [x] State transitions are testable in isolation
- [x] Shapes are explicit
- [x] No hidden global state
- [x] Types are stable enough for later model wiring

---

## PR 3 — Multi-root local trunk baseline

**Title:** `implement multi-root local trunk baseline`

### Deliverables
- [x] Implement 2 to 4 root local processing
- [x] Support leaf size 16
- [x] Use a simple local recurrent/selective trunk
- [x] Keep root outputs independently inspectable

### Requirements
- [x] Strict autoregressive behavior
- [x] Same token stream reaches all roots
- [x] No tree retrieval yet
- [x] No exact leaf read yet

### Diagnostics
- [x] Root similarity / collapse metric
- [x] Per-root norm tracking
- [x] Per-root activation statistics

### Merge criteria
- [x] Forward pass works
- [x] Training step works
- [x] Single-root vs multi-root baseline is runnable
- [x] Roots do not immediately collapse in smoke tests

---

## PR 4 — Live leaf and sealed leaf mechanics

**Title:** `add live leaf append and sealed leaf summary path`

### Deliverables
- [x] Implement live leaf append path
- [x] Seal leaves at fixed size 16
- [x] Create sealed leaf summaries
- [x] Populate token-level cache for sealed leaves

### Requirements
- [x] Only sealed leaves enter global memory
- [x] Live leaf remains local-only
- [x] Exact leaf read target is meaningful for later use

### Tests
- [x] Incremental append correctness
- [x] Leaf sealing boundaries
- [x] Correct sealed leaf spans
- [x] No future-token leakage

### Merge criteria
- [x] Live/sealed split is stable
- [x] Leaf summaries are deterministic
- [x] Token cache contents are correct
- [x] Causality is preserved

---

## PR 5 — Regular dyadic summary tree

**Title:** `implement regular causal dyadic summary tree`

### Deliverables
- [x] Insert sealed leaves into level 0
- [x] Add deterministic parent merge
- [x] Store summaries level-major
- [x] Track span metadata at each level

### Requirements
- [x] Regular dyadic tree only
- [x] No learned merge gating
- [x] Every sealed leaf participates
- [x] Only prior sealed leaves are globally visible

### Tests
- [x] Incremental tree build matches reference recompute
- [x] Span metadata is correct at all levels
- [x] Causal visibility holds
- [x] Parent and level counts are correct

### Diagnostics
- [x] Nodes per level
- [x] Tree depth reached
- [x] Dead / unused node detection

### Merge criteria
- [x] Tree construction is correct
- [x] Incremental update path works
- [x] No causal violations
- [x] Summary tree is observable

---

## PR 6 — Sparse routing over the sealed tree

**Title:** `add sparse fractal router over sealed tree`

### Deliverables
- [x] Add 4 routing heads
- [x] Add beam width 2 routing
- [x] Score candidates top-down
- [x] Descend to selected sealed leaves

### Requirements
- [x] Candidate scoring is explicit
- [x] Normalize only over surviving candidates
- [x] No early-stop in first runnable version
- [x] Routing stays inspectable per head

### Diagnostics
- [x] Routing depth histogram
- [x] Candidate entropy per head
- [x] Selected span distance histogram
- [x] Head agreement / disagreement rate

### Tests
- [x] Routing touches sealed nodes only
- [x] Beam width is enforced
- [x] Behavior is deterministic under fixed seed

### Merge criteria
- [x] Routing is sparse
- [x] Routing is query-dependent
- [x] Heads do not all choose the same path by default
- [x] Retrieval path is ablatable

---

## PR 7 — Exact leaf read

**Title:** `implement exact local read for selected sealed leaves`

### Deliverables
- [x] Choose one exact-read mechanism:
  - [x] local attention over token-level K/V inside the selected leaf
  - [ ] pointer-style read over cached token states
  - [ ] copy-distribution read over leaf token positions
- [x] Wire exact read to selected routed leaves only

### Requirements
- [x] Keep the first mechanism simple
- [x] No dense global token cache fallback
- [x] Exact means token-level local access, not summary-only approximation

### Diagnostics
- [x] Fraction of steps using exact leaf read
- [x] Selected token-position distribution
- [x] Read concentration / entropy

### Tests
- [x] Exact reads target sealed leaves only
- [x] Exact read span matches routed leaf
- [x] Local token indices are correct

### Merge criteria
- [x] Exact read is real, not approximate
- [x] Retrieval behavior changes when enabled
- [x] Copy/retrieval probes improve relative to no-exact-read

---

## PR 8 — Read fusion and LM head wiring

**Title:** `wire read fusion into existing language model head`

### Deliverables
- [x] Fuse per-root recurrent outputs
- [x] Fuse routed tree values
- [x] Fuse exact leaf read values
- [x] Project fused output through LM head

### Requirements
- [x] Fusion logic is explicit and typed
- [x] Do not bury routing usefulness inside an opaque mixer
- [x] Keep all major sources easy to zero out for ablations

### Tests
- [x] Zeroing routed values changes behavior predictably
- [x] Zeroing exact leaf read changes behavior predictably
- [x] Zeroing extra roots changes behavior predictably

### Merge criteria
- [x] Full forward path works end-to-end
- [x] Each source path is individually ablatable
- [x] Logits are stable enough for smoke training

---

## PR 8.5 — Causal Memory Auditor

**Title:** `add causal memory auditor for counterfactual memory credit`

### Deliverables
- [x] Add full forward reference path
- [x] Add no-tree-read intervention
- [x] Add no-exact-leaf-read intervention
- [x] Add next-best-span substitution
- [x] Add root-drop intervention
- [x] Add structured reporting of utility deltas

### Requirements
- [x] Sampled, not always-on
- [x] Cheap enough for evaluation runs
- [x] Explicit intervention definitions
- [x] No silent mutation of reference forward behavior

### Diagnostics
- [x] Loss delta
- [x] Target-logit delta
- [x] KL divergence from full forward
- [x] Utility by root
- [x] Utility by routing depth
- [x] Utility by task family

### Tests
- [x] Each intervention preserves shape compatibility
- [x] No-tree-read removes only tree-summary contributions
- [x] No-exact-read removes only exact-read contributions
- [x] Next-best substitution respects routing depth
- [x] Root-drop removes only the selected root contribution

### Merge criteria
- [x] Tree retrieval utility is measurable
- [x] Exact leaf read utility is measurable
- [x] Root utility is measurable
- [x] Dead-weight tree behavior is detectable if present

---

## PR 9 — Synthetic task harness

**Title:** `add synthetic retrieval and copy probes for v2`

### Deliverables
- [x] Add copy task
- [x] Add associative recall task
- [x] Add induction task
- [x] Add noisy retrieval task
- [x] Add far-token comparison task

### Requirements
- [x] Tasks run quickly
- [x] Metrics are comparable across ablations
- [x] Do not wait for large LM training to evaluate architecture value

### Merge criteria
- [x] Each probe has a stable baseline
- [x] Probes can compare:
  - [x] no memory
  - [x] tree only
  - [x] tree + exact read
- [x] Results are logged in a repeatable format

---

## PR 10 — Benchmark and observability pass

**Title:** `add scaling benchmarks and v2 observability suite`

### Deliverables
- [x] Benchmark token append
- [x] Benchmark leaf sealing
- [x] Benchmark tree update
- [x] Benchmark routing
- [x] Benchmark exact leaf read
- [x] Benchmark end-to-end forward pass

### Sequence lengths
- [x] 256
- [x] 512
- [x] 1k
- [x] 2k
- [x] 4k
- [x] 8k

### Metrics
- [x] Tokens/sec
- [x] Wall-clock per forward
- [x] Peak memory
- [x] Routing sparsity
- [x] Root collapse metrics
- [x] Exact-read usage
- [x] Retrieval distance

### Merge criteria
- [x] Measured behavior trends toward intended scaling
- [x] Hot paths are identifiable
- [x] No accidental quadratic fallback is present

---

## Required ablations before calling v1 “real”

Run at equal total state / parameter budget:

- [x] single-root, no memory
- [x] multi-root, no memory
- [x] single-root, summaries only
- [x] single-root, sparse retrieval
- [x] single-root, sparse retrieval + exact leaf read
- [x] multi-root, summaries only
- [x] multi-root, sparse retrieval
- [x] multi-root, sparse retrieval without exact leaf read
- [x] multi-root, sparse retrieval + exact leaf read

**Do not skip these.**

---

# Deferred until after this checklist

Do **not** add yet:

- [ ] side-memory bank
- [ ] learned eviction
- [ ] learned merge scheduling
- [ ] routing early-stop
- [ ] giant-scale training
- [ ] dense fallback attention
- [ ] complexity added only to rescue weak early results

---

# Definition of done for first serious v1

The first serious v1 exists only when all of these are true:

- [x] multi-root local trunk works
- [x] leaf sealing is causal and correct
- [x] dyadic tree is stable and incremental
- [x] routing is sparse and query-dependent
- [x] exact leaf read is implemented and ablatable
- [x] causal memory auditing is implemented and reports useful deltas
- [x] synthetic retrieval/copy tasks run
- [x] scaling benchmarks exist
- [x] diagnostics expose collapse and dead-weight behavior
- [x] the architecture can be falsified cleanly

If these are not true yet, the work is still infrastructure, not a validated architecture.


Recursive Memory Kernel v1 — Implementation Checklist #6

Description

Recursive Memory Kernel v1 — Implementation Checklist

PR 1 — Scaffold v2 module boundaries

Deliverables

Requirements

Merge criteria

PR 2 — Add typed v2 runtime state

Deliverables

Requirements

Merge criteria

PR 3 — Multi-root local trunk baseline

Deliverables

Requirements

Diagnostics

Merge criteria

PR 4 — Live leaf and sealed leaf mechanics

Deliverables

Requirements

Tests

Merge criteria

PR 5 — Regular dyadic summary tree

Deliverables

Requirements

Tests

Diagnostics

Merge criteria

PR 6 — Sparse routing over the sealed tree

Deliverables

Requirements

Diagnostics

Tests

Merge criteria

PR 7 — Exact leaf read

Deliverables

Requirements

Diagnostics

Tests

Merge criteria

PR 8 — Read fusion and LM head wiring

Deliverables

Requirements

Tests

Merge criteria

PR 8.5 — Causal Memory Auditor

Deliverables

Requirements

Diagnostics

Tests

Merge criteria

PR 9 — Synthetic task harness

Deliverables

Requirements

Merge criteria

PR 10 — Benchmark and observability pass

Deliverables

Sequence lengths

Metrics

Merge criteria

Required ablations before calling v1 “real”

Deferred until after this checklist

Definition of done for first serious v1

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions