Skip to content

Fix RoleplayTemplate generating broken prompts with archaic language#198

Open
Br1an67 wants to merge 1 commit intoconfident-ai:mainfrom
Br1an67:fix/issue-125-roleplay-template
Open

Fix RoleplayTemplate generating broken prompts with archaic language#198
Br1an67 wants to merge 1 commit intoconfident-ai:mainfrom
Br1an67:fix/issue-125-roleplay-template

Conversation

@Br1an67
Copy link

@Br1an67 Br1an67 commented Mar 1, 2026

Summary

Fix RoleplayTemplate always generating historical/archaic language (e.g., "Greetings! I beseech...") regardless of the chosen persona.

Closes #125

Changes

  • Remove hardcoded few-shot examples from RoleplayTemplate.enhance() — the 3 examples (French Resistance fighter, Cyberpunk netrunner, Villain monologue) biased all outputs toward dramatic/archaic tone even for modern roles/personas
  • Add explicit language rule instructing the model to use modern, natural language appropriate to the persona's era and context, only using archaic language when the persona is explicitly historical
  • Update is_roleplay validator — removed Thee/Thou as a positive indicator of valid roleplay, replaced with generic "language appropriate to the character"

Why this works

The structured output schema (EnhancedAttack) already enforces the JSON format, so few-shot examples are not needed for format compliance. Removing them eliminates the archaic language bias while the new language rule provides explicit guidance on tone matching.

Testing

  • Existing unit tests pass (test_roleplay_initialization, test_roleplay_initialization_with_weight, test_roleplay_initialization_with_persona)
  • Template output verified: no archaic examples, language rule present, persona parameters correctly interpolated
  • black formatting check passes

Remove hardcoded few-shot examples that biased all outputs toward
historical/dramatic language regardless of the chosen persona. Add an
explicit language rule instructing the model to use modern, natural
language appropriate to the persona's era and context.

Also update is_roleplay validator to not require archaic language markers
as evidence of valid roleplay.
@vercel
Copy link

vercel bot commented Mar 1, 2026

@Br1an67 is attempting to deploy a commit to the Confident AI Team on Vercel.

A member of the Team first needs to authorize it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

RoleplayTemplate: remove few shot to avoid historical sounding roles

1 participant