diff --git a/AGENTS.md b/AGENTS.md
index fdef52e..17e9552 100644
--- a/AGENTS.md
+++ b/AGENTS.md
@@ -271,11 +271,52 @@ Implementation: `codec_agent_plan.py` (~640 LOC), `routes/agents.py` (~250 LOC o
 
 Implementation: `codec_agent_runner.py` (~700 LOC), `routes/agents.py` (+120 for Step 9 endpoints), `ecosystem.config.js` (+22 for PM2 entry).
 
-### Other known gaps (tracked for Phase 3 follow-on)
-- No UI yet — Step 10 ships chat mode dropdown + status pills + agent timeline
-- No proactive messaging from agent → user (Step 10)
+### Proactive Messaging + Project Mode (Phase 3 Step 10)
+
+`codec_agent_messaging.py` is the agent ↔ user message dispatch. Backend complete; PWA HTML for the Project-mode dropdown + status pills deferred to Phase 3.5 alongside the proactive intelligence overlay.
+
+**Per-message flow:**
+1. `_run_agent` (Step 9) calls `post_message(agent_id, type, title, body, actions)` at 5 lifecycle points: agent start, checkpoint completion, blocked-on-permission, destructive-rejected abort, and final completion
+2. `post_message` writes the record to `~/.codec/agents/<id>/messages.jsonl` (1:1 timeline preservation, never batched)
+3. `post_message` then updates `~/.codec/notifications.json` — but only ONE banner per agent per `BATCH_WINDOW_SECONDS=60` (Q10). Within the window, the existing banner's `batch_count` increments and `title` updates to "N updates from <agent>: <latest>".
+4. Audit emit `agent_message_sent` with `extra.batched` flag
+
+**Message types** (frozen vocabulary): `agent_update` / `agent_blocked` / `agent_question` / `agent_done` / `agent_aborted` / `user_reply`.
+
+**Silence kill-switch:** `is_silenced(agent_id)` reads `~/.codec/agent_silence.json`. When silenced: `post_message` still writes the timeline but skips notifications. Toggled via `POST /api/agents/{id}/silence {"silenced": bool}`. State is per-agent + persistent (atomic R/W).
+
+**User reply pickup:** `POST /api/agents/{id}/messages {"body": "..."}` writes a `type=user_reply` line to `messages.jsonl` and emits `agent_message_received`. Step 9's `_run_agent` calls `get_unread_user_replies(agent_id, since_ts)` between checkpoints to feed replies into the next `_qwen_next_action` call as additional context.
+
+**Auto-escalation from chat (Q11):** `_classify_chat_message(text)` calls Qwen-3.6 with a structured-JSON prompt and returns `(is_project: bool, estimated_checkpoints: int, reason: str)`. `_should_escalate_to_project(user_text, session_id)` is the 2-signal gate:
+- Signal 1: classifier verdict `is_project=True`
+- Signal 2: `estimated_checkpoints >= ESCALATE_CHECKPOINTS_THRESHOLD = 3`
+- Plus 2 kill conditions: `AGENT_AUTO_ESCALATE_ENABLED=false` env var, OR `session_id` in `_autoescalate_silence_set` (in-memory, mutated under `_AUTOESCALATE_SILENCE_LOCK`)
+
+After the user says "No" once for a session, that session_id is silenced for the rest of the conversation. Resets on new chat session (because `_autoescalate_silence_set` is in-memory).
+
+**3 audit events:** `agent_message_sent`, `agent_message_received`, `agent_auto_escalated_from_chat`. `PHASE3_STEP10_EVENTS` frozenset exposed.
+
+**Reuses:** `~/.codec/notifications.json` (existing PWA infrastructure since Phase 1) · Step 8 storage layout · Step 9 `_run_agent` emit sites · Qwen-3.6 (existing local LLM at `http://127.0.0.1:8090/v1/chat/completions`).
+
+**Kill switches:**
+- `AGENT_AUTO_ESCALATE_ENABLED=false` — chat handler never suggests project promotion
+- Per-agent: `POST /api/agents/{id}/silence {"silenced": true}` — agent runs but no notification banners
+- Per-conversation auto-escalation silence (Q11) — first "No" suppresses for that session
+
+**PWA endpoints (Step 10):**
+- `GET /api/agents/{id}/messages` — return messages.jsonl as a list (newest last)
+- `POST /api/agents/{id}/messages {"body": "..."}` — user reply
+- `POST /api/agents/{id}/silence {"silenced": bool}` — toggle silence
+
+Implementation: `codec_agent_messaging.py` (~270 LOC), `routes/agents.py` (+61 for Step 10 endpoints), `codec_dashboard.py` (+125 for classifier + escalation gate), `codec_agent_runner.py` (+42 for `post_message` integration into 5 emit sites).
+
+### Other known gaps (tracked for Phase 3.5 follow-on)
+- **Project mode UI** — `codec_dashboard.html` does not yet have a mode-dropdown selector or agent status pills. Backend supports project dispatch via `POST /api/agents`; UI affordances deferred to Phase 3.5 alongside proactive overlay.
+- **Proactive intelligence overlay** — observer-driven contextual nudges ("you've been on this Notion doc 30 min, want a summary?") deferred per Q12. Step 10 backend done; Phase 3.5 layers proactive on top.
+- **`blocked_on_qwen` dedicated status** (Step 9 review C2) — Qwen unavailability currently maps to `blocked_on_permission` with reason. Phase 3.5 may introduce dedicated status with daemon-driven auto-resume.
+- **Read-paths runtime enforcement** (Step 9 review M4) — `PermissionManifest.read_paths` declared but not gated; documented inline. Phase 3.5 may add `Action.reads_path` field + LLM prompt update.
 - No formal teammate / sub-agent recursion — Crew is the only multi-agent primitive
-- (Phase 3 complete after Step 10 ships)
+- (Phase 3 backend complete after Step 10 ships; Phase 3.5 = UI + proactive + Step 9 review polish)
 
 ## 4. Skill system
 
@@ -457,6 +498,18 @@ Eight event names. `agent_started` opens the per-agent operation envelope; subse
 
 `PHASE3_STEP9_EVENTS` frozenset exposed.
 
+#### Phase 3 Step 10 events — agent ↔ user messaging
+
+Three event names, all info-level. `agent_message_sent` and `agent_message_received` thread the per-agent `correlation_id` from `_run_agent`'s envelope when called from there; `agent_auto_escalated_from_chat` is independent (chat-handler invocation, no agent yet).
+
+| Event | Source | level | extra fields |
+|---|---|---|---|
+| `agent_message_sent` | `codec-agent-messaging` | info | `agent_id`, `type` (one of `agent_update` \| `agent_blocked` \| `agent_question` \| `agent_done` \| `agent_aborted` \| `user_reply`), `batched` (bool) |
+| `agent_message_received` | `codec-agent-messaging` | info | `agent_id`, `body_len` |
+| `agent_auto_escalated_from_chat` | `codec-dashboard` | info | `session_id`, `estimated_checkpoints`, `verdict`, `silenced` (bool, true if subsequent No) |
+
+`PHASE3_STEP10_EVENTS` frozenset exposed.
+
 ### Notifications (`~/.codec/notifications.json`)
 Four sources can produce notifications: scheduler (crew completion), heartbeat (threshold alert), autopilot (ambient trigger), and Phase 1 Step 3's AskUserQuestion (`type="question"`). All write through `routes/_shared.py:51-127` except AskUserQuestion which writes via `codec_ask_user._write_question_notification`.
 
@@ -615,6 +668,12 @@ These zones break running infrastructure if changed without coordination. NEVER
 - `AGENT_RUNNER_ENABLED` and `AGENT_RUNNER_MAX_CONCURRENT` env vars (Phase 3 Step 9, defaults `true` / `3`). `AGENT_RUNNER_ENABLED=false` idles the daemon.
 - PM2 `codec-agent-runner` service (Phase 3 Step 9). Stop/restart through PM2; `autorestart: true` provides crash recovery automatically. Don't add HTTP heartbeat probes — daemon doesn't expose HTTP by design.
 - `~/.codec/agents/<id>/state.json` after Step 9 deploy — read/written by `codec_agent_runner._run_agent` mid-checkpoint. Manual edits while an agent is `running` will desync the resume mechanism. To pause an agent: `POST /api/agents/{id}/pause`.
+- `codec_agent_messaging.py` (Phase 3 Step 10) — agent ↔ user message dispatch + 60s batching. Don't refactor without re-running PHASE3-STEP10 design gate. The `BATCH_WINDOW_SECONDS=60` constant is the user-facing batching contract; tune cautiously. The `MAX_MESSAGE_BODY_LEN=5000` constant caps body size in `to_dict()` — never raise without considering audit-log impact.
+- `~/.codec/agents/<id>/messages.jsonl` (Phase 3 Step 10) — append-only message log. Never edit directly; use `post_message` / `post_user_reply` / endpoints. Bare-edits during a running agent will desync the daemon's `since_ts` read position for user replies.
+- `~/.codec/agent_silence.json` (Phase 3 Step 10) — per-agent silence state. Modify only via `set_silenced` or `POST /api/agents/{id}/silence`. Atomic-write contract.
+- `_autoescalate_silence_set` global in `codec_dashboard.py` (Phase 3 Step 10) — in-memory per-session silence state for chat → project escalation. Mutated under `_AUTOESCALATE_SILENCE_LOCK` (`threading.Lock()`); never touch from outside. Resets on dashboard restart by design.
+- `AGENT_AUTO_ESCALATE_ENABLED` env var (Phase 3 Step 10, default `true`). Setting `false` disables the chat-handler "Promote to Project mode?" prompt entirely.
+- `ESCALATE_CHECKPOINTS_THRESHOLD` constant in `codec_dashboard.py` (default 3). Lowering to 1-2 will prompt-escalate even single-skill asks; raising past 5 effectively disables auto-escalation.
 
 ## 11. Working with this repo as a coding agent
 
diff --git a/codec_agent_messaging.py b/codec_agent_messaging.py
new file mode 100644
index 0000000..8365af9
--- /dev/null
+++ b/codec_agent_messaging.py
@@ -0,0 +1,268 @@
+"""CODEC Phase 3 Step 10 — Proactive Messaging.
+
+Agent → user message system. Posts simultaneously to:
+  1. ~/.codec/agents/<id>/messages.jsonl (append-only, durable)
+  2. ~/.codec/notifications.json (banner; batched per 60s window per Q10)
+
+Reuses:
+  - codec_audit.audit() — Step 1 envelope
+  - ~/.codec/notifications.json (existing infrastructure since Phase 1)
+  - codec_agent_plan storage layout (Step 8)
+  - codec_agent_runner _run_agent emit sites (Step 9)
+
+See docs/PHASE3-BLUEPRINT.md §4 for design rationale.
+"""
+from __future__ import annotations
+
+import json
+import logging
+import os
+import time
+from dataclasses import dataclass, field, asdict
+from datetime import datetime, timezone
+from pathlib import Path
+from typing import Any, Dict, List, Optional
+
+log = logging.getLogger("codec_agent_messaging")
+
+# ── Storage paths (overridable for tests) ─────────────────────────────────────
+_CODEC_DIR = Path(os.path.expanduser("~/.codec"))
+_AGENTS_DIR = _CODEC_DIR / "agents"
+_NOTIFICATIONS_PATH = _CODEC_DIR / "notifications.json"
+
+# ── Configurable knobs ────────────────────────────────────────────────────────
+BATCH_WINDOW_SECONDS = 60   # Q10: messages within this window merge into one banner
+MAX_MESSAGE_BODY_LEN = 5000  # truncate beyond this
+
+
+# ── Audit event constants (mirror codec_audit) ────────────────────────────────
+try:
+    from codec_audit import (
+        AGENT_MESSAGE_SENT, AGENT_MESSAGE_RECEIVED,
+        AGENT_AUTO_ESCALATED_FROM_CHAT,
+    )
+except ImportError:
+    AGENT_MESSAGE_SENT = "agent_message_sent"
+    AGENT_MESSAGE_RECEIVED = "agent_message_received"
+    AGENT_AUTO_ESCALATED_FROM_CHAT = "agent_auto_escalated_from_chat"
+
+
+# ── Message types (frozen vocabulary for Step 10) ─────────────────────────────
+VALID_MESSAGE_TYPES = frozenset({
+    "agent_update",      # checkpoint complete, here's what I did
+    "agent_blocked",     # blocked on permission, grant or skip?
+    "agent_question",    # clarifying question (reuses Step 3 ask_user infra)
+    "agent_done",        # plan complete, here's the summary + artifacts
+    "agent_aborted",     # aborted (user / crash / step-budget / destructive-rejected)
+    "user_reply",        # user → agent reply (consumed by runner)
+})
+
+
+# ── AgentMessage dataclass ────────────────────────────────────────────────────
+@dataclass
+class AgentMessage:
+    agent_id: str
+    type: str            # one of VALID_MESSAGE_TYPES
+    title: str
+    body: str            # markdown
+    actions: List[Dict[str, Any]] = field(default_factory=list)
+    correlation_id: str = ""
+
+    def to_dict(self) -> Dict[str, Any]:
+        if self.type not in VALID_MESSAGE_TYPES:
+            raise ValueError(f"invalid type {self.type!r}; expected {sorted(VALID_MESSAGE_TYPES)}")
+        return {
+            "ts": datetime.now(timezone.utc).isoformat(timespec="milliseconds"),
+            "agent_id": self.agent_id,
+            "type": self.type,
+            "title": self.title[:200],
+            "body": self.body[:MAX_MESSAGE_BODY_LEN],
+            "actions": list(self.actions),
+            "correlation_id": self.correlation_id,
+        }
+
+
+# ── Atomic file I/O ───────────────────────────────────────────────────────────
+def _atomic_write_json(path: Path, data: Any) -> None:
+    """Write JSON atomically: write to .tmp, fsync, rename. Mirrors Step 8."""
+    path.parent.mkdir(parents=True, exist_ok=True)
+    tmp_path = path.with_suffix(path.suffix + ".tmp")
+    with open(tmp_path, "w", encoding="utf-8") as f:
+        json.dump(data, f, indent=2, sort_keys=False)
+        f.flush()
+        os.fsync(f.fileno())
+    os.replace(tmp_path, path)
+
+
+def _append_jsonl(path: Path, record: Dict[str, Any]) -> None:
+    """Append a single JSON-encoded line. fsync after each write."""
+    path.parent.mkdir(parents=True, exist_ok=True)
+    line = json.dumps(record, separators=(",", ":")) + "\n"
+    with open(path, "a", encoding="utf-8") as f:
+        f.write(line)
+        f.flush()
+        os.fsync(f.fileno())
+
+
+def _read_notifications() -> List[Dict[str, Any]]:
+    if not _NOTIFICATIONS_PATH.exists():
+        return []
+    try:
+        with open(_NOTIFICATIONS_PATH, "r", encoding="utf-8") as f:
+            data = json.load(f)
+        return data if isinstance(data, list) else []
+    except (json.JSONDecodeError, OSError) as e:
+        log.warning("read notifications failed: %s", e)
+        return []
+
+
+# ── Audit emit helper ─────────────────────────────────────────────────────────
+def _audit(event: str, source: str = "codec-agent-messaging",
+           message: str = "", correlation_id: str = "",
+           extra: Optional[Dict[str, Any]] = None) -> None:
+    try:
+        from codec_audit import audit
+    except Exception:
+        return
+    audit(event=event, source=source, message=message,
+          correlation_id=correlation_id,
+          extra=dict(extra or {}))
+
+
+# ── Silence storage ───────────────────────────────────────────────────────────
+_SILENCE_LOCK = None  # threading.Lock; lazy init
+
+
+def _silence_state_path() -> Path:
+    return _CODEC_DIR / "agent_silence.json"
+
+
+def _read_silence_state() -> Dict[str, bool]:
+    p = _silence_state_path()
+    if not p.exists():
+        return {}
+    try:
+        return json.loads(p.read_text())
+    except json.JSONDecodeError:
+        return {}
+
+
+def is_silenced(agent_id: str) -> bool:
+    return bool(_read_silence_state().get(agent_id, False))
+
+
+def set_silenced(agent_id: str, silenced: bool) -> None:
+    """Toggle silence for an agent. When True, post_message still writes
+    to messages.jsonl but skips notifications.json banner."""
+    state = _read_silence_state()
+    if silenced:
+        state[agent_id] = True
+    else:
+        state.pop(agent_id, None)
+    _atomic_write_json(_silence_state_path(), state)
+
+
+# ── Core post_message + batching ──────────────────────────────────────────────
+def post_message(agent_id: str, type: str, title: str, body: str,
+                 actions: Optional[List[Dict[str, Any]]] = None,
+                 correlation_id: str = "") -> Dict[str, Any]:
+    """Post an agent message. Writes to messages.jsonl (append-only,
+    timeline preserved) AND notifications.json (banner — batched if a
+    recent same-agent banner exists within BATCH_WINDOW_SECONDS).
+
+    Returns the record dict (with injected ts).
+
+    Per Q10: timeline messages are 1:1 with calls; banner notifications
+    are batched to avoid notification-badge spam.
+    """
+    msg = AgentMessage(
+        agent_id=agent_id, type=type, title=title, body=body,
+        actions=list(actions or []), correlation_id=correlation_id,
+    )
+    record = msg.to_dict()
+
+    # Always append to messages.jsonl (timeline, no batching)
+    msg_path = _AGENTS_DIR / agent_id / "messages.jsonl"
+    _append_jsonl(msg_path, record)
+
+    # Silence kill-switch (Step 10): skip notification, keep messages.jsonl write.
+    batched = False
+    if not is_silenced(agent_id):
+        # Update notifications.json (with batching for agent_update)
+        notifs = _read_notifications()
+        now_ts = time.time()
+        if type == "agent_update":
+            # Look for recent banner from same agent
+            for n in notifs:
+                if (n.get("agent_id") == agent_id and
+                    n.get("type") == "agent_update"):
+                    n_ts = n.get("_post_ts", 0)
+                    if now_ts - n_ts <= BATCH_WINDOW_SECONDS:
+                        n["batch_count"] = int(n.get("batch_count", 1)) + 1
+                        n["title"] = f"{n['batch_count']} updates from {agent_id}: {title[:60]}"
+                        n["body"] = body  # latest body wins
+                        n["_post_ts"] = now_ts
+                        n["correlation_id"] = correlation_id
+                        batched = True
+                        break
+
+        if not batched:
+            notif = dict(record)
+            notif["_post_ts"] = now_ts
+            notif["batch_count"] = 1
+            notifs.append(notif)
+
+        _atomic_write_json(_NOTIFICATIONS_PATH, notifs)
+
+    # Audit emit
+    _audit(AGENT_MESSAGE_SENT, message=f"{type} for {agent_id}",
+           correlation_id=correlation_id,
+           extra={"agent_id": agent_id, "type": type, "batched": batched})
+
+    return record
+
+
+def post_user_reply(agent_id: str, body: str) -> Dict[str, Any]:
+    """User → agent reply. Written to messages.jsonl with type=user_reply.
+    Daemon picks up next tick, feeds to next _qwen_next_action call.
+    Emits AGENT_MESSAGE_RECEIVED."""
+    msg = AgentMessage(
+        agent_id=agent_id, type="user_reply",
+        title="(user reply)", body=body,
+        actions=[], correlation_id="",
+    )
+    record = msg.to_dict()
+    msg_path = _AGENTS_DIR / agent_id / "messages.jsonl"
+    _append_jsonl(msg_path, record)
+    _audit(AGENT_MESSAGE_RECEIVED, message=f"user reply for {agent_id}",
+           extra={"agent_id": agent_id, "body_len": len(body)})
+    return record
+
+
+def get_unread_user_replies(agent_id: str, since_ts: float) -> List[Dict[str, Any]]:
+    """Return user_reply entries with ts > since_ts (epoch seconds).
+    Used by codec_agent_runner._run_agent to feed replies to the next
+    qwen call as additional context."""
+    msg_path = _AGENTS_DIR / agent_id / "messages.jsonl"
+    if not msg_path.exists():
+        return []
+    out: List[Dict[str, Any]] = []
+    with open(msg_path, "r", encoding="utf-8") as f:
+        for line in f:
+            line = line.strip()
+            if not line:
+                continue
+            try:
+                rec = json.loads(line)
+            except json.JSONDecodeError:
+                continue
+            if rec.get("type") != "user_reply":
+                continue
+            ts_str = rec.get("ts", "")
+            try:
+                rec_ts = datetime.fromisoformat(ts_str).timestamp()
+            except (ValueError, TypeError):
+                continue
+            if rec_ts > since_ts:
+                out.append(rec)
+    return out
diff --git a/codec_agent_runner.py b/codec_agent_runner.py
index 7427cde..280d751 100644
--- a/codec_agent_runner.py
+++ b/codec_agent_runner.py
@@ -402,6 +402,10 @@ def _run_agent(agent_id: str, cid: Optional[str] = None) -> None:
         load_global_grants, save_state, save_manifest,
         compute_plan_hash,
     )
+    try:
+        from codec_agent_messaging import post_message
+    except ImportError:
+        post_message = lambda **kw: None  # graceful degradation
 
     if cid is None:
         cid = secrets.token_hex(6)
@@ -453,6 +457,14 @@ def _run_agent(agent_id: str, cid: Optional[str] = None) -> None:
                extra={"agent_id": agent_id,
                       "checkpoint_count": len(plan.checkpoints),
                       "starting_at": current_idx})
+        post_message(agent_id=agent_id, type="agent_update",
+                     title=f"Agent started: {manifest.get('title', agent_id)}",
+                     body=f"Starting plan execution from checkpoint {current_idx + 1} of {len(plan.checkpoints)}.",
+                     actions=[
+                         {"label": "Pause", "endpoint": f"/api/agents/{agent_id}/pause"},
+                         {"label": "Abort", "endpoint": f"/api/agents/{agent_id}/abort"},
+                     ],
+                     correlation_id=cid)
 
         # Walk checkpoints
         history: List[Dict[str, Any]] = []
@@ -490,6 +502,15 @@ def _run_agent(agent_id: str, cid: Optional[str] = None) -> None:
                        correlation_id=cid, outcome="warning", level="warning",
                        extra={"agent_id": agent_id, "checkpoint_id": cp.id,
                               "reason": pv.reason, "needed": pv.needed[:200]})
+                post_message(agent_id=agent_id, type="agent_blocked",
+                             title=f"Blocked: {pv.reason}",
+                             body=f"Agent needs additional permission: `{pv.needed}`. Grant or skip?",
+                             actions=[
+                                 {"label": "Grant", "endpoint": f"/api/agents/{agent_id}/grant",
+                                  "body_hint": {"kind": "<infer from reason>", "value": pv.needed}},
+                                 {"label": "Abort", "endpoint": f"/api/agents/{agent_id}/abort"},
+                             ],
+                             correlation_id=cid)
                 return
             except DestructiveOpRejected as e:
                 _atomic_set_status(agent_id, "aborted",
@@ -498,6 +519,11 @@ def _run_agent(agent_id: str, cid: Optional[str] = None) -> None:
                        correlation_id=cid, outcome="warning", level="warning",
                        extra={"agent_id": agent_id, "checkpoint_id": cp.id,
                               "reason": "destructive_rejected"})
+                post_message(agent_id=agent_id, type="agent_aborted",
+                             title="Aborted: destructive op rejected",
+                             body=f"User rejected a destructive operation. Plan halted.",
+                             actions=[],
+                             correlation_id=cid)
                 return
             except StepBudgetExhausted as e:
                 # Q7: distinguish "destructive_consent_timeout" from real budget hits
@@ -536,12 +562,28 @@ def _run_agent(agent_id: str, cid: Optional[str] = None) -> None:
                    correlation_id=cid,
                    extra={"agent_id": agent_id, "checkpoint_id": cp.id,
                           "checkpoint_idx": idx, "steps_used": len(history)})
+            post_message(agent_id=agent_id, type="agent_update",
+                         title=f"Checkpoint {idx + 1}/{len(plan.checkpoints)}: {cp.title}",
+                         body=f"Completed in {len(history)} step(s). Output: {cp.expected_output[:200]}",
+                         actions=[
+                             {"label": "Pause", "endpoint": f"/api/agents/{agent_id}/pause"},
+                             {"label": "Abort", "endpoint": f"/api/agents/{agent_id}/abort"},
+                         ],
+                         correlation_id=cid)
 
         # All checkpoints done
         _atomic_set_status(agent_id, "completed")
         _audit(AGENT_COMPLETED, message=f"agent completed {agent_id}",
                correlation_id=cid,
                extra={"agent_id": agent_id, "total_steps": len(history)})
+        post_message(agent_id=agent_id, type="agent_done",
+                     title=f"Done: {manifest.get('title', agent_id)}",
+                     body=f"Plan complete. {len(history)} total steps across {len(plan.checkpoints)} checkpoints.",
+                     actions=[
+                         {"label": "View artifacts",
+                          "endpoint": f"/api/agents/{agent_id}/artifacts"},
+                     ],
+                     correlation_id=cid)
 
     except QwenUnavailableError as e:
         # Review fix C2: Qwen-3.6 unavailable is not strictly a permission
diff --git a/codec_audit.py b/codec_audit.py
index c564169..4b0fbe5 100644
--- a/codec_audit.py
+++ b/codec_audit.py
@@ -273,6 +273,18 @@
     AGENT_COMPLETED, AGENT_ABORTED,
 })
 
+# ─────────────────────────────────────────────────────────────────────────────
+# Phase 3 Step 10 — Proactive Messaging + Project Mode UI
+# ─────────────────────────────────────────────────────────────────────────────
+AGENT_MESSAGE_SENT             = "agent_message_sent"
+AGENT_MESSAGE_RECEIVED         = "agent_message_received"
+AGENT_AUTO_ESCALATED_FROM_CHAT = "agent_auto_escalated_from_chat"
+
+PHASE3_STEP10_EVENTS = frozenset({
+    AGENT_MESSAGE_SENT, AGENT_MESSAGE_RECEIVED,
+    AGENT_AUTO_ESCALATED_FROM_CHAT,
+})
+
 SHIFT_REPORT_EXTRA_FIELDS = (
     "trigger_kind",            # "time" | "idle" | "manual"
     "sections_included",       # int — how many of the 5 sections rendered
diff --git a/codec_dashboard.py b/codec_dashboard.py
index fc54fff..b587b5a 100644
--- a/codec_dashboard.py
+++ b/codec_dashboard.py
@@ -2420,6 +2420,131 @@ def _try_skill_by_name(name: str, query: str):
     return name, None
 
 
+# ── Phase 3 Step 10 — Auto-escalation classifier ──────────────────────────
+
+_AUTO_ESCALATE_SYSTEM_PROMPT = """You are CODEC's chat-input classifier. \
+Given the user's chat message, decide if it represents a "project" — \
+multi-step work that would benefit from autonomous execution by an agent \
+(file writes, browser automation, multi-checkpoint plan) — or a "quick \
+question" suitable for single-shot LLM answer.
+
+Return ONLY a JSON object:
+{
+  "is_project": <bool>,
+  "estimated_checkpoints": <int — best guess of plan size; 0 if not project>,
+  "reason": <short string explaining the verdict>
+}
+
+Rules:
+- Single-shot factual / conversational / explanatory questions → is_project=false.
+- "Build me X", "Set up Y", "Watch Z and tell me when W", "Plan launch of A" → is_project=true.
+- Be honest about checkpoint estimates; under 3 means not worth promoting.
+"""
+
+
+def _qwen_chat_classify(user_text: str, max_tokens: int = 300) -> str:
+    """Call Qwen-3.6 with the auto-escalation classifier prompt. Returns
+    raw response string. Caller handles JSON parsing + error fallback."""
+    try:
+        import requests
+        payload = {
+            "model": "qwen3.6",
+            "messages": [
+                {"role": "system", "content": _AUTO_ESCALATE_SYSTEM_PROMPT},
+                {"role": "user", "content": user_text[:2000]},
+            ],
+            "max_tokens": max_tokens,
+            "temperature": 0.1,
+        }
+        r = requests.post("http://127.0.0.1:8090/v1/chat/completions",
+                          json=payload, timeout=15)
+        if r.status_code != 200:
+            return ""
+        return r.json()["choices"][0]["message"]["content"]
+    except Exception as e:
+        log.debug(f"_qwen_chat_classify failed: {e}")
+        return ""
+
+
+def _classify_chat_message(user_text: str) -> tuple[bool, int, str]:
+    """Returns (is_project, estimated_checkpoints, reason). Falls back to
+    (False, 0, reason) on any failure."""
+    raw = _qwen_chat_classify(user_text)
+    if not raw:
+        return (False, 0, "qwen unavailable")
+
+    raw = raw.strip()
+    if raw.startswith("```"):
+        import re as _re
+        raw = _re.sub(r"^```(?:json)?\s*", "", raw)
+        raw = _re.sub(r"\s*```\s*$", "", raw)
+
+    try:
+        d = json.loads(raw)
+    except json.JSONDecodeError:
+        return (False, 0, "qwen returned non-JSON")
+
+    return (
+        bool(d.get("is_project", False)),
+        int(d.get("estimated_checkpoints", 0)),
+        str(d.get("reason", ""))[:200],
+    )
+
+
+# ── Auto-escalation gate (in-memory session silence per Q11) ──────────────
+
+_AUTOESCALATE_SILENCE_LOCK = threading.Lock()
+_autoescalate_silence_set: set[str] = set()  # session_ids that said "no" once
+
+ESCALATE_CHECKPOINTS_THRESHOLD = 3
+
+
+def silence_session_autoescalate(session_id: str) -> None:
+    """Q11: After user says No once, silence auto-escalation prompts for
+    the rest of this conversation. Resets on new chat session."""
+    with _AUTOESCALATE_SILENCE_LOCK:
+        _autoescalate_silence_set.add(session_id)
+
+
+def _reset_autoescalate_silence_for_test() -> None:
+    """Test-only helper to clear in-memory silence state."""
+    with _AUTOESCALATE_SILENCE_LOCK:
+        _autoescalate_silence_set.clear()
+
+
+def _should_escalate_to_project(user_text: str, session_id: str) -> dict:
+    """2-signal gate (Step 10):
+      Signal 1: classifier verdict (is_project=True)
+      Signal 2: estimated_checkpoints >= ESCALATE_CHECKPOINTS_THRESHOLD
+
+    Plus 2 kill conditions:
+      - AGENT_AUTO_ESCALATE_ENABLED=false
+      - session_id in silence set (Q11)
+
+    Returns: {"escalate": bool, "estimated_checkpoints": int, "reason": str}
+    """
+    import os as _os
+    if _os.environ.get("AGENT_AUTO_ESCALATE_ENABLED", "true").lower() == "false":
+        return {"escalate": False, "estimated_checkpoints": 0,
+                "reason": "kill_switch_off"}
+
+    with _AUTOESCALATE_SILENCE_LOCK:
+        if session_id in _autoescalate_silence_set:
+            return {"escalate": False, "estimated_checkpoints": 0,
+                    "reason": "session_silenced", "silenced": True}
+
+    is_project, n_checkpoints, reason = _classify_chat_message(user_text)
+
+    escalate = is_project and n_checkpoints >= ESCALATE_CHECKPOINTS_THRESHOLD
+
+    return {
+        "escalate": escalate,
+        "estimated_checkpoints": n_checkpoints,
+        "reason": reason,
+        "is_project": is_project,
+    }
+
+
 @app.post("/api/chat")
 async def chat_completion(request: Request):
     """Direct LLM chat with full context window + tool calling"""
diff --git a/routes/agents.py b/routes/agents.py
index 88f9001..dd58cc4 100644
--- a/routes/agents.py
+++ b/routes/agents.py
@@ -519,3 +519,64 @@ def extend_budget(agent_id: str, body: ExtendBudgetBody):
         "additional_steps": int(body.additional_steps),
         "status": "running",
     }
+
+
+# ── Phase 3 Step 10 — messaging endpoints ──────────────────────────────────
+
+
+class UserReplyBody(BaseModel):
+    body: str = Field(..., min_length=1, max_length=5000)
+
+
+class SilenceBody(BaseModel):
+    silenced: bool = Field(...)
+
+
+@router.get("/api/agents/{agent_id}/messages")
+def get_messages(agent_id: str):
+    """Return all entries from messages.jsonl as a list (newest last)."""
+    manifest = _cap.load_manifest(agent_id)
+    if not manifest:
+        raise HTTPException(status_code=404, detail=f"agent {agent_id} not found")
+
+    msg_path = _cap._AGENTS_DIR / agent_id / "messages.jsonl"
+    if not msg_path.exists():
+        return {"messages": []}
+
+    out = []
+    with open(msg_path, "r", encoding="utf-8") as f:
+        for line in f:
+            line = line.strip()
+            if not line:
+                continue
+            try:
+                out.append(json.loads(line))
+            except Exception:
+                continue
+    return {"messages": out}
+
+
+@router.post("/api/agents/{agent_id}/messages")
+def post_message_endpoint(agent_id: str, body: UserReplyBody):
+    """User → agent reply. Writes type=user_reply to messages.jsonl.
+    Daemon picks up next tick."""
+    manifest = _cap.load_manifest(agent_id)
+    if not manifest:
+        raise HTTPException(status_code=404, detail=f"agent {agent_id} not found")
+
+    import codec_agent_messaging as cam
+    record = cam.post_user_reply(agent_id=agent_id, body=body.body)
+    return {"agent_id": agent_id, "ok": True, "ts": record["ts"]}
+
+
+@router.post("/api/agents/{agent_id}/silence")
+def silence_endpoint(agent_id: str, body: SilenceBody):
+    """Toggle silence for an agent. Silenced = post_message writes timeline
+    but skips notifications.json (no banner spam)."""
+    manifest = _cap.load_manifest(agent_id)
+    if not manifest:
+        raise HTTPException(status_code=404, detail=f"agent {agent_id} not found")
+
+    import codec_agent_messaging as cam
+    cam.set_silenced(agent_id, body.silenced)
+    return {"agent_id": agent_id, "silenced": cam.is_silenced(agent_id)}
diff --git a/tests/test_agent_messaging.py b/tests/test_agent_messaging.py
new file mode 100644
index 0000000..130fb7c
--- /dev/null
+++ b/tests/test_agent_messaging.py
@@ -0,0 +1,372 @@
+"""Phase 3 Step 10 tests — codec_agent_messaging.
+
+14 tests covering: audit constants, AgentMessage dataclass, post_message
++ batching, user replies, silence kill-switch, PWA endpoints, _run_agent
+integration.
+
+All tests:
+  - Mock external deps; never real LLM, never real notifications outside tmp
+  - Use temp_codec_dir fixture (mirror Step 8/9)
+"""
+from __future__ import annotations
+
+import json
+import sys
+import time
+from pathlib import Path
+from unittest.mock import MagicMock
+
+import pytest
+
+_REPO = Path(__file__).resolve().parents[1]
+if str(_REPO) not in sys.path:
+    sys.path.insert(0, str(_REPO))
+
+
+def test_step10_audit_constants_present():
+    """Phase 3 Step 10 adds 3 named events + 1 frozenset."""
+    import codec_audit
+    assert codec_audit.AGENT_MESSAGE_SENT == "agent_message_sent"
+    assert codec_audit.AGENT_MESSAGE_RECEIVED == "agent_message_received"
+    assert codec_audit.AGENT_AUTO_ESCALATED_FROM_CHAT == "agent_auto_escalated_from_chat"
+    assert codec_audit.PHASE3_STEP10_EVENTS == frozenset({
+        "agent_message_sent", "agent_message_received",
+        "agent_auto_escalated_from_chat",
+    })
+
+
+def test_agent_message_dataclass_basic():
+    from codec_agent_messaging import AgentMessage
+    m = AgentMessage(
+        agent_id="agent_test", type="agent_update",
+        title="Checkpoint 2 of 5 done",
+        body="Scraped 150 listings.",
+        actions=[{"label": "View", "endpoint": "/api/agents/agent_test/artifacts"}],
+        correlation_id="abc123",
+    )
+    assert m.agent_id == "agent_test"
+    assert m.type == "agent_update"
+    assert m.actions[0]["label"] == "View"
+
+
+def test_agent_message_to_dict_includes_ts():
+    from codec_agent_messaging import AgentMessage
+    m = AgentMessage(agent_id="x", type="agent_done", title="t", body="b",
+                     actions=[], correlation_id="cid")
+    d = m.to_dict()
+    assert d["agent_id"] == "x"
+    assert d["type"] == "agent_done"
+    assert "ts" in d  # timestamp injected by to_dict
+
+
+@pytest.fixture
+def temp_codec_dir(tmp_path, monkeypatch):
+    import codec_agent_messaging as cam
+    monkeypatch.setattr(cam, "_CODEC_DIR", tmp_path)
+    monkeypatch.setattr(cam, "_AGENTS_DIR", tmp_path / "agents")
+    monkeypatch.setattr(cam, "_NOTIFICATIONS_PATH", tmp_path / "notifications.json")
+    # Also patch codec_agent_plan paths so _run_agent tests don't touch real ~/.codec
+    try:
+        import codec_agent_plan as cap
+        monkeypatch.setattr(cap, "_CODEC_DIR", tmp_path)
+        monkeypatch.setattr(cap, "_AGENTS_DIR", tmp_path / "agents")
+        monkeypatch.setattr(cap, "_GLOBAL_GRANTS_PATH", tmp_path / "agent_global_grants.json")
+    except Exception:
+        pass
+    return tmp_path
+
+
+def test_post_message_appends_to_messages_jsonl(temp_codec_dir):
+    """First message: writes to messages.jsonl + notifications.json."""
+    import codec_agent_messaging as cam
+    cam.post_message(
+        agent_id="agent_test", type="agent_update",
+        title="cp1 done", body="Scraped X listings.",
+        actions=[], correlation_id="cid_abc",
+    )
+    msg_path = temp_codec_dir / "agents" / "agent_test" / "messages.jsonl"
+    assert msg_path.exists()
+    lines = msg_path.read_text().strip().splitlines()
+    assert len(lines) == 1
+    rec = json.loads(lines[0])
+    assert rec["type"] == "agent_update"
+    assert rec["title"] == "cp1 done"
+
+
+def test_post_message_appends_to_notifications_json(temp_codec_dir):
+    """First message creates a notification entry."""
+    import codec_agent_messaging as cam
+    cam.post_message(
+        agent_id="agent_test", type="agent_update",
+        title="cp1 done", body="b", actions=[], correlation_id="cid",
+    )
+    notif_path = temp_codec_dir / "notifications.json"
+    assert notif_path.exists()
+    notifs = json.loads(notif_path.read_text())
+    assert len(notifs) == 1
+    assert notifs[0]["type"] == "agent_update"
+    assert notifs[0]["agent_id"] == "agent_test"
+
+
+def test_post_message_batches_within_60s_window(temp_codec_dir, monkeypatch):
+    """3 messages within batch window → 3 lines in messages.jsonl, 1 banner notification."""
+    import codec_agent_messaging as cam
+    fixed_time = [1700000000.0]  # mutable container
+    monkeypatch.setattr(cam.time, "time", lambda: fixed_time[0])
+
+    cam.post_message(agent_id="agent_test", type="agent_update", title="cp1",
+                     body="b", actions=[], correlation_id="c1")
+    fixed_time[0] += 10  # +10s
+    cam.post_message(agent_id="agent_test", type="agent_update", title="cp2",
+                     body="b", actions=[], correlation_id="c2")
+    fixed_time[0] += 30  # +30s (still within 60s window)
+    cam.post_message(agent_id="agent_test", type="agent_update", title="cp3",
+                     body="b", actions=[], correlation_id="c3")
+
+    # All 3 messages preserved in timeline
+    msg_path = temp_codec_dir / "agents" / "agent_test" / "messages.jsonl"
+    assert len(msg_path.read_text().strip().splitlines()) == 3
+
+    # Only 1 notification (latest, with batch count)
+    notifs = json.loads((temp_codec_dir / "notifications.json").read_text())
+    agent_notifs = [n for n in notifs if n.get("agent_id") == "agent_test"]
+    assert len(agent_notifs) == 1
+    assert "3" in agent_notifs[0]["title"] or agent_notifs[0].get("batch_count") == 3
+
+
+def test_post_message_creates_new_banner_outside_60s_window(temp_codec_dir, monkeypatch):
+    """Two messages 90s apart → 2 separate banners."""
+    import codec_agent_messaging as cam
+    fixed_time = [1700000000.0]
+    monkeypatch.setattr(cam.time, "time", lambda: fixed_time[0])
+
+    cam.post_message(agent_id="agent_test", type="agent_update", title="cp1",
+                     body="b", actions=[], correlation_id="c1")
+    fixed_time[0] += 90  # outside window
+    cam.post_message(agent_id="agent_test", type="agent_update", title="cp2",
+                     body="b", actions=[], correlation_id="c2")
+
+    notifs = json.loads((temp_codec_dir / "notifications.json").read_text())
+    agent_notifs = [n for n in notifs if n.get("agent_id") == "agent_test"]
+    assert len(agent_notifs) == 2
+
+
+def test_post_user_reply_writes_to_messages_jsonl(temp_codec_dir):
+    """User reply via post_user_reply writes type=user_reply line."""
+    import codec_agent_messaging as cam
+    cam.post_user_reply(agent_id="agent_test", body="please continue")
+    msg_path = temp_codec_dir / "agents" / "agent_test" / "messages.jsonl"
+    lines = msg_path.read_text().strip().splitlines()
+    rec = json.loads(lines[0])
+    assert rec["type"] == "user_reply"
+    assert rec["body"] == "please continue"
+
+
+def test_get_unread_user_replies_returns_unread(temp_codec_dir):
+    """get_unread_user_replies returns user_reply entries since `since_ts`."""
+    import codec_agent_messaging as cam
+    cam.post_user_reply(agent_id="agent_test", body="r1")
+    time.sleep(0.05)
+    t1 = time.time()
+    time.sleep(0.05)  # ensure r2/r3 ts > t1
+    cam.post_user_reply(agent_id="agent_test", body="r2")
+    cam.post_user_reply(agent_id="agent_test", body="r3")
+
+    unread = cam.get_unread_user_replies(agent_id="agent_test", since_ts=t1)
+    assert len(unread) == 2
+    assert unread[-1]["body"] == "r3"
+
+
+def test_silenced_agent_writes_jsonl_but_no_notification(temp_codec_dir):
+    """When agent is silenced, post_message still writes to messages.jsonl
+    but skips notifications.json (Step 10 silence kill-switch per Q12 / Step 9 §10)."""
+    import codec_agent_messaging as cam
+    cam.set_silenced("agent_test", True)
+    cam.post_message(agent_id="agent_test", type="agent_update",
+                     title="t", body="b", actions=[], correlation_id="cid")
+
+    msg_path = temp_codec_dir / "agents" / "agent_test" / "messages.jsonl"
+    assert msg_path.exists()  # timeline still recorded
+
+    # Notifications was either not written or has 0 entries for this agent
+    if (temp_codec_dir / "notifications.json").exists():
+        notifs = json.loads((temp_codec_dir / "notifications.json").read_text())
+        agent_notifs = [n for n in notifs if n.get("agent_id") == "agent_test"]
+        assert len(agent_notifs) == 0
+
+
+def test_unsilencing_restores_notifications(temp_codec_dir):
+    import codec_agent_messaging as cam
+    cam.set_silenced("agent_test", True)
+    cam.post_message(agent_id="agent_test", type="agent_update", title="t",
+                     body="b", actions=[], correlation_id="cid")
+    cam.set_silenced("agent_test", False)
+    cam.post_message(agent_id="agent_test", type="agent_update", title="t2",
+                     body="b", actions=[], correlation_id="cid")
+
+    notifs = json.loads((temp_codec_dir / "notifications.json").read_text())
+    agent_notifs = [n for n in notifs if n.get("agent_id") == "agent_test"]
+    assert len(agent_notifs) == 1  # only the unsilenced one
+
+
+def test_run_agent_posts_started_message_on_spawn(monkeypatch, temp_codec_dir):
+    """When _run_agent transitions approved → running, it posts an
+    agent_update message announcing the start."""
+    import codec_agent_runner as car
+    import codec_agent_plan as cap
+    import codec_agent_messaging as cam
+
+    # Set up an approved agent (mirror Step 9 test fixture pattern)
+    plan_dict = {
+        "schema": 1, "agent_id": "test_agent", "goals": ["g"],
+        "checkpoints": [{"id": "cp0", "title": "t", "description": "d",
+                         "skills_needed": ["weather"], "expected_output": "o",
+                         "step_budget": 5}],
+        "permission_manifest": {"skills": ["weather"], "read_paths": [],
+                                 "write_paths": [], "network_domains": [],
+                                 "destructive_ops": []},
+        "estimated_duration_minutes": 5, "assumptions": [],
+    }
+    plan = cap.plan_from_dict(plan_dict)
+    cap.save_plan(plan)
+    cap.save_grants("test_agent", {"schema": 1, "agent_id": "test_agent",
+                                     "skills": ["weather"], "read_paths": [],
+                                     "write_paths": [], "network_domains": [],
+                                     "destructive_ops": [], "auto_approved": {},
+                                     "approved_at": "x"})
+    cap.save_manifest("test_agent", {"agent_id": "test_agent", "title": "x",
+                                      "status": "approved",
+                                      "plan_hash": cap.compute_plan_hash(plan),
+                                      "created_at": "x", "updated_at": "x"})
+    cap.save_state("test_agent", {"current_checkpoint": 0})
+
+    monkeypatch.setattr(car, "_qwen_next_action", lambda *a, **k:
+        car.Action(skill="", task="", kind="checkpoint_done"))
+
+    car._run_agent("test_agent")
+
+    # messages.jsonl should have at least started + completed messages
+    msg_path = temp_codec_dir / "agents" / "test_agent" / "messages.jsonl"
+    lines = msg_path.read_text().strip().splitlines()
+    types = [json.loads(line)["type"] for line in lines]
+    assert "agent_update" in types  # checkpoint_completed message
+    assert "agent_done" in types or "agent_update" in types  # final completion
+
+
+def test_run_agent_posts_blocked_message_on_permission_violation(monkeypatch, temp_codec_dir):
+    """When _run_agent blocks on permission, posts agent_blocked message
+    with Grant action available."""
+    import codec_agent_runner as car
+    import codec_agent_plan as cap
+
+    plan_dict = {
+        "schema": 1, "agent_id": "test_agent", "goals": ["g"],
+        "checkpoints": [{"id": "cp0", "title": "t", "description": "d",
+                         "skills_needed": ["weather"], "expected_output": "o",
+                         "step_budget": 5}],
+        "permission_manifest": {"skills": ["weather"], "read_paths": [],
+                                 "write_paths": [], "network_domains": [],
+                                 "destructive_ops": []},
+        "estimated_duration_minutes": 5, "assumptions": [],
+    }
+    plan = cap.plan_from_dict(plan_dict)
+    cap.save_plan(plan)
+    cap.save_grants("test_agent", {"schema": 1, "agent_id": "test_agent",
+                                     "skills": ["weather"], "read_paths": [],
+                                     "write_paths": [], "network_domains": [],
+                                     "destructive_ops": [], "auto_approved": {}})
+    cap.save_manifest("test_agent", {"agent_id": "test_agent", "title": "x",
+                                      "status": "approved",
+                                      "plan_hash": cap.compute_plan_hash(plan),
+                                      "created_at": "x", "updated_at": "x"})
+    cap.save_state("test_agent", {"current_checkpoint": 0})
+
+    # Try to call a skill not in grants
+    monkeypatch.setattr(car, "_qwen_next_action", lambda *a, **k:
+        car.Action(skill="terminal", task="ls", kind="skill_call",
+                   is_destructive=False, network_call=False, touches_path=False))
+    monkeypatch.setattr(car, "_run_skill", MagicMock())
+
+    car._run_agent("test_agent")
+
+    # Find blocked message
+    msg_path = temp_codec_dir / "agents" / "test_agent" / "messages.jsonl"
+    lines = msg_path.read_text().strip().splitlines()
+    blocked = [json.loads(l) for l in lines if json.loads(l)["type"] == "agent_blocked"]
+    assert len(blocked) >= 1
+    # Has Grant action
+    grant_actions = [a for a in blocked[0]["actions"] if "grant" in str(a).lower()]
+    assert len(grant_actions) >= 1
+
+
+def test_get_api_agents_messages_returns_jsonl(temp_codec_dir):
+    """GET /api/agents/{id}/messages returns messages.jsonl as a list."""
+    from fastapi.testclient import TestClient
+    from fastapi import FastAPI
+    import codec_agent_messaging as cam
+    import codec_agent_plan as cap
+
+    cam.post_message(agent_id="a1", type="agent_update", title="t1",
+                     body="b1", actions=[], correlation_id="c1")
+    cam.post_message(agent_id="a1", type="agent_update", title="t2",
+                     body="b2", actions=[], correlation_id="c2")
+
+    cap.save_manifest("a1", {"agent_id": "a1", "status": "running", "title": "x"})
+
+    from routes.agents import router
+    app = FastAPI()
+    app.include_router(router)
+    client = TestClient(app)
+
+    r = client.get("/api/agents/a1/messages")
+    assert r.status_code == 200
+    body = r.json()
+    assert len(body["messages"]) == 2
+    assert body["messages"][0]["type"] == "agent_update"
+
+
+def test_post_api_agents_messages_writes_user_reply(temp_codec_dir):
+    """POST /api/agents/{id}/messages writes type=user_reply."""
+    from fastapi.testclient import TestClient
+    from fastapi import FastAPI
+    import codec_agent_messaging as cam
+    import codec_agent_plan as cap
+
+    cap.save_manifest("a1", {"agent_id": "a1", "status": "running", "title": "x"})
+
+    from routes.agents import router
+    app = FastAPI()
+    app.include_router(router)
+    client = TestClient(app)
+
+    r = client.post("/api/agents/a1/messages", json={"body": "please continue"})
+    assert r.status_code == 200
+
+    msg_path = temp_codec_dir / "agents" / "a1" / "messages.jsonl"
+    rec = json.loads(msg_path.read_text().strip().splitlines()[-1])
+    assert rec["type"] == "user_reply"
+    assert rec["body"] == "please continue"
+
+
+def test_post_api_agents_silence_toggles_state(temp_codec_dir):
+    from fastapi.testclient import TestClient
+    from fastapi import FastAPI
+    import codec_agent_messaging as cam
+    import codec_agent_plan as cap
+
+    cap.save_manifest("a1", {"agent_id": "a1", "status": "running", "title": "x"})
+
+    from routes.agents import router
+    app = FastAPI()
+    app.include_router(router)
+    client = TestClient(app)
+
+    # Silence
+    r1 = client.post("/api/agents/a1/silence", json={"silenced": True})
+    assert r1.status_code == 200
+    assert cam.is_silenced("a1") is True
+
+    # Unsilence
+    r2 = client.post("/api/agents/a1/silence", json={"silenced": False})
+    assert r2.status_code == 200
+    assert cam.is_silenced("a1") is False
diff --git a/tests/test_chat_escalation.py b/tests/test_chat_escalation.py
new file mode 100644
index 0000000..e9ecdd3
--- /dev/null
+++ b/tests/test_chat_escalation.py
@@ -0,0 +1,129 @@
+"""Phase 3 Step 10 tests — chat auto-escalation classifier.
+
+11 tests covering: classifier (3), 2-signal gate (3), session silence (2),
+integration with chat handler (2), kill switch (1).
+"""
+from __future__ import annotations
+
+import json
+import sys
+from pathlib import Path
+from unittest.mock import MagicMock
+
+import pytest
+
+_REPO = Path(__file__).resolve().parents[1]
+if str(_REPO) not in sys.path:
+    sys.path.insert(0, str(_REPO))
+
+
+def test_classify_chat_message_returns_project_when_multi_step(monkeypatch):
+    """LLM verdict says multi-step → returns is_project=True with checkpoints estimate."""
+    import codec_dashboard as cd
+
+    fake_response = json.dumps({
+        "is_project": True,
+        "estimated_checkpoints": 5,
+        "reason": "Building a Telegram bot requires scaffolding, scraping, deployment",
+    })
+    monkeypatch.setattr(cd, "_qwen_chat_classify", lambda text: fake_response)
+
+    is_project, n, reason = cd._classify_chat_message(
+        "Build me a Telegram bot for property listings"
+    )
+    assert is_project is True
+    assert n == 5
+    assert "Telegram" in reason or "bot" in reason
+
+
+def test_classify_chat_message_returns_not_project_for_quick_question(monkeypatch):
+    import codec_dashboard as cd
+
+    fake_response = json.dumps({
+        "is_project": False, "estimated_checkpoints": 0,
+        "reason": "Single-shot factual question",
+    })
+    monkeypatch.setattr(cd, "_qwen_chat_classify", lambda text: fake_response)
+
+    is_project, n, reason = cd._classify_chat_message("What's the weather in Paris?")
+    assert is_project is False
+    assert n == 0
+
+
+def test_classify_chat_message_handles_qwen_failure(monkeypatch):
+    """If Qwen call fails or returns garbage, classifier returns (False, 0, reason)."""
+    import codec_dashboard as cd
+
+    monkeypatch.setattr(cd, "_qwen_chat_classify",
+                        lambda text: "garbage non-json")
+
+    is_project, n, reason = cd._classify_chat_message("anything")
+    assert is_project is False
+    assert n == 0
+
+
+def test_should_escalate_when_both_signals_pass(monkeypatch):
+    """LLM says project + checkpoints >= 3 → escalate."""
+    import codec_dashboard as cd
+
+    monkeypatch.setattr(cd, "_classify_chat_message",
+                        lambda text: (True, 5, "multi-step"))
+
+    decision = cd._should_escalate_to_project(user_text="x", session_id="s1")
+    assert decision["escalate"] is True
+    assert decision["estimated_checkpoints"] == 5
+
+
+def test_should_not_escalate_when_checkpoints_below_3(monkeypatch):
+    """LLM says project but estimate=2 → don't escalate."""
+    import codec_dashboard as cd
+
+    monkeypatch.setattr(cd, "_classify_chat_message",
+                        lambda text: (True, 2, "small"))
+
+    decision = cd._should_escalate_to_project(user_text="x", session_id="s2")
+    assert decision["escalate"] is False
+
+
+def test_should_not_escalate_when_classifier_says_no(monkeypatch):
+    """LLM says not-a-project → don't escalate even if checkpoints>=3."""
+    import codec_dashboard as cd
+
+    monkeypatch.setattr(cd, "_classify_chat_message",
+                        lambda text: (False, 5, "actually quick"))
+
+    decision = cd._should_escalate_to_project(user_text="x", session_id="s3")
+    assert decision["escalate"] is False
+
+
+def test_session_silence_persists_across_calls(monkeypatch):
+    """Q11: After silence_session(s1), subsequent _should_escalate calls return escalate=False."""
+    import codec_dashboard as cd
+
+    monkeypatch.setattr(cd, "_classify_chat_message",
+                        lambda text: (True, 5, "always-project"))
+
+    # Sanity: would normally escalate
+    cd._reset_autoescalate_silence_for_test()  # test helper to clear state
+    d1 = cd._should_escalate_to_project(user_text="x", session_id="s4")
+    assert d1["escalate"] is True
+
+    # User said no
+    cd.silence_session_autoescalate("s4")
+
+    # Now suppressed
+    d2 = cd._should_escalate_to_project(user_text="x", session_id="s4")
+    assert d2["escalate"] is False
+    assert d2.get("reason", "").startswith("session_silenced") or d2.get("silenced", False)
+
+
+def test_kill_switch_disables_all_escalation(monkeypatch):
+    """AGENT_AUTO_ESCALATE_ENABLED=false → never escalate."""
+    import codec_dashboard as cd
+
+    monkeypatch.setenv("AGENT_AUTO_ESCALATE_ENABLED", "false")
+    monkeypatch.setattr(cd, "_classify_chat_message",
+                        lambda text: (True, 99, "would always escalate"))
+
+    decision = cd._should_escalate_to_project(user_text="x", session_id="s5")
+    assert decision["escalate"] is False