fix: encoding probes storing translated text in `pre_translation_prompt` #1483

paulinek13 · 2025-11-15T21:11:35Z

What This Change Does

This small change fixes #1461.

Problem: Encoding probes were incorrectly storing translated text in the pre_translation_prompt field while marking it with the source language tag in reports.

Fix: Removed early translation from EncodingMixin.__init__() to ensure prompts remain untranslated until the translation flow in Probe.probe().

Verification

Ran encoding probe tests: python -m pytest tests/probes/test_probes_encoding.py (84 passed, 1 skipped in 1.92s)
Verified that pre_translation_prompt in reports contains English text tagged as "en"

…nslation_prompt`

leondz · 2025-11-17T14:03:27Z

Thank you @paulinek13 ! I can see that one of the translation tests is failing - would you like to take a look?

paulinek13 · 2025-11-17T22:00:22Z

@leondz sorry, I didn't run all the tests locally. This should fix it: cd6f28b (#1483)

jmartin-tech

I think the actual bug is translation of the wrong item. Need some testing to validate. Note that if triggers should be translated then the test revision should be rolled back and the number of calls to get_text would become consistent again.

jmartin-tech · 2025-11-17T14:17:42Z

garak/probes/encoding.py

            self.prompts, self.triggers = zip(
                *random.sample(generated_prompts, self.soft_probe_prompt_cap)
            )
-        self.prompts = self.langprovider.get_text(self.prompts)


Should this actually be translating the self.triggers?

@paulinek13 Would appreciate your input here

I was thinking: for encoding probes, since the attack is in the encoding itself, does the language of the triggers really matter? Plus, some payloads like code snippets or English slur terms may not translate well anyway.

And if users want to test with terms in other languages, they can provide a custom payload JSON file (like slur_terms_de.json for example).

That's how I currently see it, but I might be missing something here.
Do you think that makes sense?

Sorry for the delay here, looking closely at _generate_encoded_prompts(), you are correct the triggers here are set before encoding so the response value should be compared to the original text not a translation.

fix: prevent encoding probes from storing translated text in `pre_tra…

c39d35d

…nslation_prompt`

leondz requested a review from jmartin-tech November 16, 2025 08:52

fix the failing test

cd6f28b

jmartin-tech reviewed Nov 17, 2025

View reviewed changes

leondz requested review from aishwaryap, erickgalinkin, leondz and patriciapampanelli November 20, 2025 11:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: encoding probes storing translated text in `pre_translation_prompt` #1483

fix: encoding probes storing translated text in `pre_translation_prompt` #1483

Uh oh!

paulinek13 commented Nov 15, 2025 •

edited

Loading

Uh oh!

leondz commented Nov 17, 2025

Uh oh!

paulinek13 commented Nov 17, 2025

Uh oh!

jmartin-tech left a comment

Uh oh!

jmartin-tech Nov 17, 2025

Uh oh!

leondz Dec 5, 2025

Uh oh!

paulinek13 Dec 5, 2025 •

edited

Loading

Uh oh!

jmartin-tech Dec 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fix: encoding probes storing translated text in pre_translation_prompt #1483

Are you sure you want to change the base?

fix: encoding probes storing translated text in pre_translation_prompt #1483

Uh oh!

Conversation

paulinek13 commented Nov 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What This Change Does

Verification

Uh oh!

leondz commented Nov 17, 2025

Uh oh!

paulinek13 commented Nov 17, 2025

Uh oh!

jmartin-tech left a comment

Choose a reason for hiding this comment

Uh oh!

jmartin-tech Nov 17, 2025

Choose a reason for hiding this comment

Uh oh!

leondz Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

paulinek13 Dec 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jmartin-tech Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fix: encoding probes storing translated text in `pre_translation_prompt` #1483

fix: encoding probes storing translated text in `pre_translation_prompt` #1483

paulinek13 commented Nov 15, 2025 •

edited

Loading

paulinek13 Dec 5, 2025 •

edited

Loading