Skip to content

feat: fuzzy/embedding-based prefix similarity via FAISS #3

Description

@manishklach

Research Question

What changes if approximate prefix reuse moves beyond structural token similarity and experiments with embedding-based neighborhood lookup?

Scope

  • keep the current MinHash path as the structural baseline
  • prototype an optional FAISS-backed similarity experiment
  • compare match quality, reuse rate, and false-positive risk
  • clearly separate this from the current metadata-aware exact/structural matching model

Note

This should remain an exploratory simulation feature, not a claim of production retrieval quality.

Metadata

Metadata

Assignees

No one assigned

    Labels

    researchResearch exploration or experiment proposal

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions