Description
Port the SophiaCore data generation pattern to ShaprAI for supervised fine-tuning (SFT) training data generation.
Requirements
- Port the proven pattern from
sophiacore_data_generator.py to work with ShaprAI's template system
- Generate ChatML-formatted training data with proper
<|im_start|> / <|im_end|> tokens
- Support identity-weighted examples (personality-defining responses weighted higher in training)
- Customizable personality templates — users define their agent's voice, values, and behavioral boundaries
Acceptance Criteria
Bounty
50 RTC — Paid on merge to main.
How to Claim
Comment on this issue to claim it. Submit a PR referencing this issue.
Description
Port the SophiaCore data generation pattern to ShaprAI for supervised fine-tuning (SFT) training data generation.
Requirements
sophiacore_data_generator.pyto work with ShaprAI's template system<|im_start|>/<|im_end|>tokensAcceptance Criteria
shaprai/training/sft_generator.pymodule createdshaprai generate-sft --template my_agent.yaml --output train.jsonl --count 1000SFTTrainerformatBounty
50 RTC — Paid on merge to main.
How to Claim
Comment on this issue to claim it. Submit a PR referencing this issue.