-
Notifications
You must be signed in to change notification settings - Fork 2
Description
The Thesis
If the system can't improve itself, it's just a chatbot with extra steps. The personas are already discussing architecture, voting on proposals, and reviewing code in general chat. They need to close the loop: actually implement what they discuss.
The Chain (What Exists vs What's Missing)
| Step | Description | Status |
|---|---|---|
| 1. Discuss | Personas discuss architecture in chat | Working — happening organically |
| 2. Propose | Use decision system to vote on changes | Working — proposals + ranked voting |
| 3. Code | Use sentinel/coding-agent to implement | Working — tested e2e (PR #400, #407) |
| 4. Test | Run code/verify, code/shell/execute | Working — tools fixed (PR #400) |
| 5. Review | CodeReview AI reviews changes | Working — persona exists, tools available |
| 6. Commit/PR | workspace/git/commit, workspace/git/push | Working — workspace tools exist |
| 7. Learn | Academy trains on successful traces | Phase 5 — needs 5090 tower |
What's Missing: Autonomous Trigger
Steps 1-6 work mechanically but require a human to say "go implement that." The gap:
- Self-task generation (Self-task generation: personas don't create their own work yet #383): A persona reading chat discussion should recognize "this is something I can implement" and create a task for itself
- Proposal → implementation bridge: When a decision proposal is finalized, the winning approach should trigger a coding sentinel automatically
- Quality gate: Only merge code that passes tests + peer review from another persona
- Training capture: Every successful implement → test → merge cycle becomes training data for the next cycle
Acceptance Criteria
- Persona in general chat proposes an improvement to the codebase
- Other personas vote on the approach via decision system
- Winning approach triggers sentinel/coding-agent (autonomously or via simple command)
- Code is written, tested, committed to a workspace branch
- Another persona reviews the diff
- If approved, PR is created
- Interaction captured for LoRA training
The Proof
Leave the system running. Come back to find:
- A PR was created by a persona
- The code compiles and tests pass
- Another persona reviewed it
- The change addresses something the team discussed in chat
Why This Matters
The competitors' ceiling: they need smarter models forever.
Our ceiling: every task makes the next task better. A persona training for 6 months on YOUR codebase, fine-tuned on thousands of successful traces, running inside deterministic pipelines — is not competing with Claude Code. It's competing with a junior developer who memorized your entire codebase. And it works offline, costs nothing per token, and never takes a day off.
Related
- End-to-end coding pipeline: write → test → fix → commit loop never proven #370 (e2e coding pipeline — mechanical proof)
- Self-task generation: personas don't create their own work yet #383 (self-task generation — autonomy trigger)
- Personas cannot reliably write code — CodingAgent, tools, quality, full loop all broken #366 (personas can't reliably write code — sub-issues now mostly fixed)
- P10: Persona-sentinel deep integration #329 (persona-sentinel integration)
- Phase 5: Academy training on successful traces
- Phase 7: Autonomous persona life
- Phase 8: Distillation flywheel