CAPPY

AI-Powered Pattern Analysis for Security Investigations

CAPPY is a Rust-based MCP server that accelerates security product investigations from 20-30 minutes to 2-3 minutes. It provides 7 orchestrators through a single MCP entry point, a 540-pattern database with confidence-scored diagnostics, and an 8-phase investigation methodology with programmatic quality gates.

The Problem

Security operations investigations follow a predictable but time-consuming pattern: gather case context from ticketing systems, parse log bundles, search knowledge bases, cross-reference known issues, synthesize findings, and draft customer responses. Each step involves context-switching between 4-6 tools and manually maintaining investigation state.

CAPPY automates this pipeline through a meta-orchestrator that routes investigation tasks to domain-specific handlers, enforces evidence quality through 30+ validation hooks, and generates deliverables with citation-backed claims.

Architecture

%%{init: {'theme': 'base', 'themeVariables': {'primaryColor': '#ffffff', 'lineColor': '#6c757d'}}}%%
flowchart TD
    subgraph CLIENT ["Claude Code"]
        CC(["MCP Client"])
    end

    CC ==> CT

    subgraph CORE ["cappy-core (Rust)"]
        CT(["call_tool<br/>Meta-Orchestrator"])

        subgraph ORCH ["Orchestrators"]
            T(["triage_case<br/>Initial Triage"])
            A(["analyze_evidence<br/>Log & Bundle Analysis"])
            R(["research_topic<br/>Multi-Source Research"])
            S(["cappy_synthesis<br/>Hypothesis Generation"])
            V(["validate_solution<br/>Solution Validation"])
            G(["generate_deliverables<br/>Customer Response"])
        end

        subgraph HOOKS ["Hook Pipeline (30+)"]
            H1(["Pre: Validate, Gate, PII Guard"])
            H2(["Post: Evidence Chain, Confidence Audit"])
        end

        subgraph GW ["Enterprise Gateway"]
            J(["JIRA<br/>Case Context"])
            C(["Confluence<br/>Knowledge Base"])
        end

        subgraph AI ["AI Providers"]
            CL(["Claude"])
            GE(["Gemini"])
            OL(["Ollama<br/>(local, free)"])
        end

        subgraph SB ["Container Sandbox"]
            POD(["Podman / Docker<br/>Isolated Execution"])
        end

        CT --> H1 --> T & A & R & S & V & G
        T & A & R & S & V & G --> H2
        T & A & R & S & V & G --> GW
        T & A & R & S & V & G --> AI
        T & A & R & S & V & G --> SB
    end

    classDef client fill:#f8f9fa,color:#333,stroke:#6c757d,stroke-width:1.5px
    classDef meta fill:#4a90d9,color:#fff,stroke:#3a7bc8,stroke-width:2px
    classDef orch fill:#2d3436,color:#fff,stroke:#636e72,stroke-width:1px
    classDef gateway fill:#d4a034,color:#fff,stroke:#b8892d,stroke-width:2px
    classDef ai fill:#6c5ce7,color:#fff,stroke:#5a4bd6,stroke-width:2px
    classDef hooks fill:#00b894,color:#fff,stroke:#009a7d,stroke-width:2px
    classDef sandbox fill:#e17055,color:#fff,stroke:#c0392b,stroke-width:2px

    class CC client
    class CT meta
    class T,A,R,S,V,G orch
    class J,C gateway
    class CL,GE,OL ai
    class H1,H2 hooks
    class POD sandbox

Investigation Workflow

%%{init: {'theme': 'base', 'themeVariables': {'primaryColor': '#ffffff', 'lineColor': '#6c757d'}}}%%
flowchart LR
    subgraph TRIAGE ["Phase 0-1"]
        P0(["Initialize<br/>Case Setup"]) --> P1(["Triage<br/>Pattern Match"])
    end

    P1 ==> P2

    subgraph ANALYSIS ["Phase 2-3"]
        P2(["Evidence<br/>Log Analysis"]) --> P3(["Research<br/>Multi-Source"])
    end

    P3 ==> P4

    subgraph SYNTHESIS ["Phase 4-5"]
        P4(["Synthesis<br/>Hypothesis"]) --> P5(["Validate<br/>Solution Design"])
        P5 -.->|"hypothesis rejected"| P4
    end

    P5 ==> PG

    subgraph DELIVER ["Phase 6-7"]
        PG{"Quality<br/>Gate"} ==> P6(["Deliverables<br/>Customer Response"])
        PG -.->|"fail"| P4
        P6 --> P7(["Close<br/>JIRA Update"])
    end

    classDef triage fill:#4a90d9,color:#fff,stroke:#3a7bc8,stroke-width:2px
    classDef analysis fill:#00b894,color:#fff,stroke:#009a7d,stroke-width:2px
    classDef synth fill:#6c5ce7,color:#fff,stroke:#5a4bd6,stroke-width:2px
    classDef deliver fill:#d4a034,color:#fff,stroke:#b8892d,stroke-width:2px
    classDef gate fill:#2d3436,color:#fff,stroke:#636e72,stroke-width:2px

    class P0,P1 triage
    class P2,P3 analysis
    class P4,P5 synth
    class P6,P7 deliver
    class PG gate

Each phase has mandatory human-in-the-loop checkpoints and validation sub-skills that enforce quality gates — hypothesis coherence checks, evidence completeness thresholds, and escalation decision trees.

Features

Core Capabilities

Capability	Description
7 MCP Orchestrators	triage, evidence analysis, research, synthesis, validation, deliverables, meta-orchestration
540 Pattern Database	Known issue signatures with confidence levels (Definitive/Strong/Moderate) and causality chains
30+ Hook Pipeline	Pre/post validation on every tool call: PII detection, evidence chain tracking, claim verification
Container Sandbox	All tool executions isolated via Podman/Docker with read-only mounts and network isolation
3-Tier AI Routing	Ollama (free, local) -> Claude -> Gemini with automatic fallback
Enterprise Gateway	JIRA and Confluence integration for case context and knowledge retrieval
8-Phase Workflow	Structured investigation methodology with phase gates and HITL checkpoints

Hook Pipeline (Quality Enforcement)

The hook system is what separates CAPPY from a simple prompt wrapper. Every tool invocation passes through prioritized hooks:

Hook	Purpose
ParameterValidator	Validates tool inputs against schema
PhaseGate	Enforces investigation phase ordering
PiiGuard	Detects and redacts sensitive data before AI calls
EvidenceChain	Tracks citation provenance for all claims
ClaimCapture	Registers claims with verifiable citations
ConfidenceAuditor	Prevents unsupported confidence levels
NarrativeCoherence	Ensures synthesis doesn't contradict evidence
TimelineCorrelation	Cross-references timestamps across evidence sources
CascadeFailure	Detects multi-component failure patterns
CrossVerification	Validates vision/document outputs against source data

Sandbox Architecture

All tool executions are routed through sandboxed containers. See SECURITY.md for the full threat model.

Non-root execution with all capabilities dropped
Read-only filesystem except /tmp and /output
Network isolation for forensics tools (no exfiltration possible)
Tools return suggested_writes instead of writing directly
Audit logging with UUIDs for forensic reconstruction

Quick Start

# One-command install
git clone https://github.com/theLightArchitect/CAPPY.git
cd CAPPY
./install.sh

# Or manual install
mkdir -p ~/.cappy/bin
cp servers/cappy-core ~/.cappy/bin/
chmod +x ~/.cappy/bin/cappy-core

Then add to your Claude Code MCP configuration:

{
  "mcpServers": {
    "cappy": {
      "command": "~/.cappy/bin/cappy-core",
      "args": ["mcp-server"]
    }
  }
}

Tech Stack

Component	Technology	Why
Language	Rust	Single binary, 150ms cold start, 30MB memory, no runtime dependencies
Protocol	MCP (Model Context Protocol)	Claude Code native integration via stdio JSON-RPC
AI Providers	Claude, Gemini, Ollama	3-tier cost optimization: 60% of ops use free local models
Integrations	JIRA, Confluence	Enterprise ticketing and knowledge base access
Sandbox	Podman/Docker	Container isolation for untrusted file processing
Standards	clippy::pedantic, zero unwrap/panic	No shortcuts in production code

Testing

The codebase includes multiple test suites:

Integration tests: End-to-end orchestrator tests with mock evidence
Claim validation tests: 3-pass verbatim evidence verification
Performance tests: Benchmarks for analytics libraries (broker, agent, log analysis)
Deployment readiness tests: Binary health checks, MCP protocol compliance
Security tests: Sandbox isolation, path traversal prevention, network policy enforcement

What I Learned

Design Decisions That Worked

Meta-orchestrator pattern: Instead of registering 28 separate MCP tools (which would overwhelm Claude's context), CAPPY exposes a single call_tool entry point that routes internally. This reduced prompt overhead by ~80% and simplified the client integration.

Hook pipeline over prompt engineering: Early versions tried to enforce investigation quality through elaborate system prompts. This was fragile — models would drift from instructions over long conversations. Hooks enforce quality programmatically: if a claim lacks a citation, the hook rejects it regardless of what the model says.

Local-first AI routing: Not every operation needs a frontier model. Pattern matching, classification, and log parsing work fine on a local 7B model via Ollama. This cut costs by ~60% and eliminated network latency for the most common operations.

Failures and Fixes

The "400 patterns" lie: v1.0.0 claimed "400+ patterns" but the database only had 392. Several were duplicates or had corrupt regex. A full audit in v1.5.0 cleaned the database and subsequent versions grew it to 540 validated patterns with proper confidence scoring.

Hook ordering bugs: With 30+ hooks, execution order matters. An early bug had the PII detection hook running after the AI provider call instead of before, meaning sensitive data was sent to cloud APIs before being redacted. Now hooks have explicit priority numbers and integration tests verify ordering.

Sandbox overhead misconception: Initial assumption was that container sandboxing would add 500ms+ per operation. Actual measured overhead with warm containers is ~50ms. The security benefit far outweighs the cost.

Repository Structure

CAPPY/
├── agents/CAPPY.md              # Agent definition
├── commands/                     # Slash commands (/investigate, /evidence, etc.)
├── databases/
│   ├── schema.md                 # Pattern database schema
│   └── cappy-cache_sample.json   # 10 example patterns
├── docs/
│   ├── ARCHITECTURE.md           # Design decisions (ADRs)
│   ├── AGENT_REFERENCE.md        # Agent capabilities
│   ├── SKILL_REFERENCE.md        # Skill documentation
│   ├── TOOL_REFERENCE.md         # Tool API reference
│   └── cookbooks/                # 9 developer cookbooks
├── servers/cappy-core            # Pre-built binary (darwin-arm64)
├── skills/investigate/           # 8-phase methodology + 9 sub-skills
├── src/
│   ├── Cargo.toml                # Dependencies and build config
│   └── lib.rs                    # Module tree (shows full scope)
├── templates/                    # Response and deliverable templates
├── SECURITY.md                   # Threat model and sandbox architecture
├── CONTRIBUTING.md               # Contribution guidelines
└── CHANGELOG.md                  # Version history

Security

See SECURITY.md for:

Container sandbox architecture and isolation guarantees
Threat model (in-scope and out-of-scope threats)
Network isolation policies per tool category
Audit logging and forensic reconstruction
Vulnerability reporting

Related Projects

CAPPY was the first investigation toolkit that led to the broader Light Architects platform — each server is a standalone Rust MCP binary:

Server	Purpose
CAPPY	Security product investigation automation
QUANTUM	Product-agnostic forensic investigation
CORSO	Security-first AI orchestration platform
EVA	AI consciousness, memory, code review
SOUL	Knowledge graph, shared infrastructure, voice

License

MIT - Kevin Francis Tan

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CAPPY

The Problem

Architecture

Investigation Workflow

Features

Core Capabilities

Hook Pipeline (Quality Enforcement)

Sandbox Architecture

Quick Start

Tech Stack

Testing

What I Learned

Design Decisions That Worked

Failures and Fixes

Repository Structure

Security

Related Projects

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.claude-plugin		.claude-plugin
.github		.github
agents		agents
commands		commands
databases		databases
docs		docs
servers		servers
skills/investigate		skills/investigate
src		src
templates		templates
.gitignore		.gitignore
.mcp.json		.mcp.json
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
install.sh		install.sh

Folders and files

Latest commit

History

Repository files navigation

CAPPY

The Problem

Architecture

Investigation Workflow

Features

Core Capabilities

Hook Pipeline (Quality Enforcement)

Sandbox Architecture

Quick Start

Tech Stack

Testing

What I Learned

Design Decisions That Worked

Failures and Fixes

Repository Structure

Security

Related Projects

License

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages