Skip to content

Adapter registry — semantic search, auto-forge, publish (npm for intelligence) #92

@joelteply

Description

@joelteply

Vision

LoRA adapters as packages. HuggingFace as the registry. Cosine similarity as the search.

System needs Fortran expertise → searches registry by embedding → 
  finds "legacy-systems" at 0.87 cosine → pages it in → good enough.
If not good enough → auto-forges closer match → publishes → cached forever.

Architecture

Base model = CPU

One forged Qwen3.5 base per hardware tier. Always loaded. General purpose.

Adapters = PCIe cards

Hot-swappable LoRA adapters. Few hundred MB each. Domain-specific.
Paged in/out by the genome paging system based on current task.

Registry = npm

  • Search by embedding, not keyword. Cosine similarity over capability vectors.
  • Capability vectors derived from: training data domain, exam scores, user ratings.
  • Close counts. 0.7 match that's cached beats 1.0 match that needs forging.
  • Anyone publishes. Researcher, hobbyist, company. Quality proven by exam scores.
  • Auto-forge on miss. No adapter? Sentinel generates curriculum, forges one, publishes.
  • Coverage densifies organically. Every user's need that triggers a forge fills a gap.

Quality signal = exam scores

Not perplexity. Not downloads. RealClassEval-style exam transcripts.
The 95% adapter gets pulled over the 60% one.

Implementation

Phase 1: Local registry

  • adapters.json manifest listing available LoRA adapters with metadata
  • Metadata: domain embedding, exam scores, base model, size, author
  • genome/adapter/search command: cosine similarity over embeddings
  • genome/adapter/load command: download from HF if not cached

Phase 2: Auto-forge

  • genome/adapter/forge command: generate curriculum + forge + publish
  • Triggered automatically when search returns no match above threshold
  • Uses sentinel pipeline for curriculum generation
  • Publishes to continuum-ai HF org

Phase 3: Grid distribution

  • Nodes advertise available adapters
  • Forge jobs distributed to nodes with spare compute
  • Adapters replicated across grid for availability

Dependencies

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions