Skip to content

docs: Add comprehensive YAML configuration reference for SFT and distillation#1

Open
scm-aiml wants to merge 1 commit intomainfrom
claude/document-config-datasets-bR1dC
Open

docs: Add comprehensive YAML configuration reference for SFT and distillation#1
scm-aiml wants to merge 1 commit intomainfrom
claude/document-config-datasets-bR1dC

Conversation

@scm-aiml
Copy link
Owner

Add detailed CONFIG_REFERENCE.md documenting all configuration options for:

  • Supervised Fine-Tuning (SFT)
  • On-Policy Distillation
  • All available dataset names and their mappings
  • Policy, training backend (DTensor/Megatron), and generation configs
  • Multi-node configuration examples

This addresses the gap in documentation where dataset names and config
keys were scattered across code and not clearly documented. Users can
now find a single comprehensive reference for all YAML config options.

Includes:

  • Complete DataConfig schema with all keys
  • Full list of response, preference, and eval datasets
  • DTensor and Megatron backend configuration
  • Generation, logging, and checkpointing settings
  • Real multi-node examples from the recipes directory

…illation

Add detailed CONFIG_REFERENCE.md documenting all configuration options for:
- Supervised Fine-Tuning (SFT)
- On-Policy Distillation
- All available dataset names and their mappings
- Policy, training backend (DTensor/Megatron), and generation configs
- Multi-node configuration examples

This addresses the gap in documentation where dataset names and config
keys were scattered across code and not clearly documented. Users can
now find a single comprehensive reference for all YAML config options.

Includes:
- Complete DataConfig schema with all keys
- Full list of response, preference, and eval datasets
- DTensor and Megatron backend configuration
- Generation, logging, and checkpointing settings
- Real multi-node examples from the recipes directory
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants