Automated system for transforming transcripts into structured and queryable documentation.
This project provides a complete suite of tools and agents to transform raw transcripts (meeting recordings, knowledge transfer interviews, etc.) into structured, navigable and queryable documentation.
Generated documentation can be searched and queried via the integrated search agent (search-doc), allowing quick and precise access to documented information.
- π Knowledge Capture: Transforms verbal transcripts into written documentation
- π Accessibility: Makes information easily accessible via intelligent search
- π Structuring: Organizes information in a coherent and logical manner
- π Reusability: Generic documentation usable on any project
- π€ Automation: Uses GitHub Copilot to accelerate the process
- GitHub Copilot Chat - AI assistant for executing agents
- Visual Studio Code - Code editor with Copilot Chat support
- Project Folder - Structure prepared with necessary folders
- VS Code: Recent version (2024+)
- GitHub Copilot: Access enabled
- Understand Git/GitHub basics
- Familiarity with VS Code
- Ability to follow step-by-step instructions
# Option 1: Clone existing repository
git clone <repository-url>
cd <repository-name>
# Option 2: Create structure from scratch
mkdir -p .github/{agents,prompts,instructions}
mkdir -p transcripts/{raw,clean}
mkdir -p docs
mkdir -p tempEnsure following files exist in your project:
Required Files:
.github/
βββ agents/
β βββ clean-transcript.agent.md β Cleaning agent
β βββ search-doc.agent.md β Search agent
βββ prompts/
β βββ generic-doc-transformation-agent.prompt.md β Agent generator
β βββ generate-doc-plan.prompt.md β Plan generator
β βββ execute-doc-plan.prompt.md β Plan executor
βββ instructions/
β βββ agents.instructions.md β Agent rules
β βββ markdown.instructions.md β Markdown standards
β βββ process.instructions.md β Global process
β βββ prompt.instructions.md β Prompt standards
βββ prompts.config β Central configuration
transcripts/
βββ raw/ β Raw transcripts
βββ clean/ β Cleaned transcripts
docs/ β Generated documentation
temp/ β Temporary files
Edit .github/prompts.config file with your project parameters:
PROJECT_NAME: Your Project Name
AGENT_NAME: create-docs
SOURCE_PATHS:
- /transcripts/clean/1_Domain_1
- /transcripts/clean/2_Domain_2
OUTPUT_PATH: /docs
ENTRYPOINT: SUMMARY.md
CREATE_OVERVIEW_FILES: true
OVERVIEW_FILE_NAME: overview.md
LANGUAGE: English
DOMAINS:
- name: Domain 1
path: 1_Domain_1
description: My domain 1
- name: Domain 2
path: 2_Domain_2
description: My domain 2
BATCH_SIZE: 2-4Role: Transcript cleaning and structuring agent
- Reads raw transcripts (
.transcript) - Corrects errors and omissions
- Structures content into markdown
- Applies formatting standards
- Output: Structured
.mdfiles in/transcripts/clean - Status: Required agent (necessary at startup)
Role: Main documentation creation agent
- DYNAMICALLY GENERATED by
generic-doc-transformation-agent.prompt.md - Reads cleaned transcripts
- Transforms them into structured documentation
- Organizes by domains and topics
- Generates with metadata (Topics, Related, Source)
- Output: Structured markdown documents in
OUTPUT_PATH/(from.github/prompts.config) - Status: Created during process (not a prerequisite)
Role: Search and response agent
- Queries generated documentation
- Responds only based on documentation
- No hallucination or invention
- Provides sources and citations
- Restriction: No code execution
- Status: Generic agent (copy-paste ready)
Role: Execution plan generator
- Analyzes source files and folder structure
- Creates comprehensive execution plan
- Groups files into logical batches (2-4 files per batch)
- Defines all phases (init, batches, cross-refs, summary, validation)
- Output: Complete plan in
temp/plan.json+temp/plan.md - Features: Progress tracking format, success criteria, timing estimates
- Phase: 4 (runs after transcripts are cleaned and validated)
Role: Documentation agent generator
- Produces custom
create-docsagent - Adapted to your structure and domains
- Utilizes the execution plan generated by
generate-doc-plan.prompt.md - Input: Source files + parameters + execution plan
- Output:
.github/agents/create-docs.agent.md - Phase: 5 (runs after plan is generated)
Role: Plan executor agent
- Reads and executes plan from
temp/plan.json(withtemp/plan.mdas human-readable reference) - Manages all phases automatically without pauses
- Tracks progress continuously
- Performs detailed validation at each phase
- Output: Complete documentation in
OUTPUT_PATH/(from.github/prompts.config) - Features: Automatic resumption, comprehensive error handling, and phase-by-phase validation
Global process and workflow:
- Complete end-to-end system description
- Pipeline phases and steps
- Data flow between components
- Dependencies and sequencing
- Completion checklists
Creation rules for .agent.md files:
- YAML frontmatter structure
- Required sections
- Format and conventions
- Best practices
Documentation standards:
- Consistent markdown formatting
- Document structure
- Naming conventions
- Required metadata
Standards for .prompt.md files:
- Structure and format
- Instruction sections
- Best practices
- Validation
Centralized YAML file containing:
PROJECT_NAME: Project nameSOURCE_PATHS: Transcript locationsOUTPUT_PATH: Where to generate documentationLANGUAGE: Language (e.g., English)DOMAINS: Main domains/topicsBATCH_SIZE: Files per batch (2-4)TARGET: Execution environment (e.g.,vscode)TOOLS: Available tools (e.g.,read,edit,search)
- Contains: Raw
.transcriptfiles - Source: Original recordings/transcriptions
- Format: Plain text or formatted
- Role: Process starting point
- Contains: Cleaned
.mdfiles - Source: Transformed from raw files
- Format: Structured markdown
- Role: Source for documentation generation
- Contains: Final generated documentation
- Structure: Organized by domains and hierarchical sub-folders
- Format: Markdown with metadata
- Role: Documentation destination (defaults to
/docs/whenOUTPUT_PATH: /docs) - Flexibility: First folder level under
OUTPUT_PATH/is always one folder per configured domain (DOMAINS[].path). Inside each domain, the agent can create topic/subtopic sub-folders (up to ~4 nesting levels total).
- Contains: Progress temporary files
- Usage: Batch tracking during processing
- Format: Progress files (
agent-progress.md) - Role: Long operation management
graph LR
A["Raw Transcripts<br/>/transcripts/raw"] -->|"@clean-transcript"| B["Cleaned Transcripts<br/>/transcripts/clean"]
B -->|"/generate-doc-plan"| D["Plan Generated<br/>temp/plan.json + temp/plan.md"]
D -->|"@generic-doc-transformation-agent"| C["Agent Created<br/>create-docs.agent.md"]
C -->|"executes"| E["@create-docs<br/>/execute-doc-plan"]
D -->|"guides"| E
E -->|"produces"| F["Documentation<br/>OUTPUT_PATH"]
F -->|"@search-doc"| G["Answers"]
Action: Add transcripts to /transcripts/raw/
transcripts/raw/
βββ KT_1.transcript
βββ KT_2.transcript
βββ KT_3.transcript
Accepted Format:
.transcriptfiles (plain text)- Content: Text transcriptions of meetings/interviews
Tool: Cleaning agent clean-transcript.agent.md
VS Code Command:
@clean-transcript
Process the transcript "/transcripts/raw/KT_1.transcript"
Note: Select @clean-transcript agent in Copilot Chat interface
Output: Cleaned .md files in /transcripts/clean/
Note: Multiple iterations may be necessary
- Check quality
- Correct omissions
- Refine structure
Action: Examine files in /transcripts/clean/
transcripts/clean/
βββ domain1/
β βββ KT_1.md
β βββ KT_2.md
βββ domain2/
βββ KT_1.md
βββ KT_2.md
Checks:
- β Correct and complete content
- β Logical structure
- β Metadata present
- β No file corruption
Tool: generate-doc-plan.prompt.md
Steps:
- Use prompt directly in chat (parameters read from
prompts.config):/generate-doc-plan - Prompt analyzes source files and creates complete plan
Output: Complete execution plan in temp/plan.json + temp/plan.md
- Deterministic batch structure with file groupings
- All execution phases (init + batches + cross-refs + summary + validation)
- Strict execution order
- Progress tracking format
Tool: generic-doc-transformation-agent.prompt.md
Steps:
- Use prompt directly in chat (parameters read from
prompts.config):/generic-doc-transformation-agent Create the agent for me - Prompt generates:
.github/agents/create-docs.agent.md
Output:
- Custom agent based on your structure
- Adapted to your domains and execution plan
- Ready for execution
Tool: execute-doc-plan.prompt.md executed as @create-docs agent
Steps:
-
Use the generated agent to execute the plan:
@create-docs /execute-doc-plan -
The plan executes automatically through all phases:
- Phase 0: Initialization (creates folder structure)
- Phases 1-N: Batch Processing (transforms transcripts by batch)
- Phase N+1: Cross-Reference Resolution (links documents)
- Phase N+2: Summary Generation (creates index and overview)
- Phase N+3: Final Validation (verifies completeness)
-
Progress is tracked continuously with updates
Output: Complete documentation in OUTPUT_PATH/
- Structured markdown files with metadata
OUTPUT_PATH/ENTRYPOINT(default:OUTPUT_PATH/SUMMARY.md) as the single documentation entrypoint- Entry point includes pages (course order) with topics + description, plus AβZ indexes (pages + topics) and source mapping
- Folder-level
overview.mdfiles (one per domain/topic/subtopic) when enabled byCREATE_OVERVIEW_FILES - All cross-references resolved
- Validation report confirming completion
- Hierarchical structure: Within each domain folder, the agent creates topic/subtopic sub-folders as needed (up to ~4 nesting levels total)
Tool: Search agent search-doc.agent.md
Usage:
@search-doc
"What is [Concept]?"
@search-doc
"How to [Action]?"
@search-doc
"What is the difference between [A] and [B]?"
Responses:
- β Based ONLY on documentation
- β With citations and sources
- β Indicating limitations
- β Suggestions for related documents
Centralized YAML file containing all project parameters.
# ========================================
# PROJECT INFORMATION
# ========================================
PROJECT_NAME: My Documentation
AGENT_NAME: create-docs
AGENT_DESCRIPTION: Agent for transforming transcripts to documentation
# ========================================
# PATHS AND SOURCES
# ========================================
SOURCE_PATHS:
- /transcripts/clean/1_Domain_1
- /transcripts/clean/2_Domain_2
OUTPUT_PATH: /docs
# ========================================
# STRUCTURE AND DOMAINS
# ========================================
DOMAINS:
- Domain_1
path: 1_Domain_1
file_count: X
description: My awesome domain 1
- Domain_2
path: 2_Domain_2
file_count: X
description: My awesome domain 2
LANGUAGE: English
TONE: Professional
AUDIENCE: Technical teams and documentation users
# ========================================
# BATCH PROCESSING
# ========================================
BATCH_SIZE: 2-4 # Files per batch
PROGRESS_FILE: /temp/[agent-name]-progress.md
# ========================================
# AGENT AND TOOLS
# ========================================
TOOLS: [read, edit, search]
TARGET: vscodeTo change output path:
OUTPUT_PATH: /documentation # Instead of /docsTo add new domains:
DOMAINS:
- Domain1
- Domain2
- NewDomainTo change language:
LANGUAGE: English # Instead of FrenchTo modify batch size:
BATCH_SIZE: 3-5 # Process 3-5 files per batchAgents and prompts reading from prompts.config automatically adapt:
- β
generate-doc-plan.prompt.mdanalyzes files and creates plan based on new structure - β
generic-doc-transformation-agent.prompt.mduses new parameters to generate adapted agent - β
execute-doc-plan.prompt.mdexecutes plan with new settings - β
create-docs.agent.mdgenerates according to new structure - β
search-doc.agent.mdqueries newOUTPUT_PATH
No code modification required!
| Step | Tool | Action | Output |
|---|---|---|---|
| 1 | Manual | Add transcripts | /transcripts/raw/ |
| 2 | @clean-transcript | Clean transcripts | /transcripts/clean/ |
| 3 | Manual | Verify quality | β Validation |
| 4 | /generate-doc-plan | Generate plan | temp/plan.json + temp/plan.md |
| 5 | @generic-doc-transformation-agent | Generate agent | create-docs.agent.md |
| 6 | @create-docs /execute-doc-plan | Execute plan | OUTPUT_PATH/ |
| 7 | @search-doc | Query docs | Answers |
Q: My agent doesn't generate documentation
A: Verify that /transcripts/clean/ contains files and that OUTPUT_PATH exists in prompts.config
Q: Search returns no results
A: Ensure documentation was generated in OUTPUT_PATH/ and that the entrypoint exists at OUTPUT_PATH/ENTRYPOINT (default: OUTPUT_PATH/SUMMARY.md)
Q: How do I add new domains?
A: Edit DOMAINS in .github/prompts.config and re-run agents
Q: Can I use the system for another project?
A: Yes! Configure paths in prompts.config and execute agents
- Project: Transcript to Documentation System
- Generic Agent: Designed for reusability
- Based on: GitHub Copilot Chat
Version: 1.0 (Generic Release)
Date: 2026
Status: π Ready