Account for additional processor outputs #4191

KarelKenens · 2025-10-01T23:46:05Z

What does this PR do?

In DataCollatorForVisionLanguageModeling, attempt to dynamically determine additional processor outputs, on top of input_ids and attention_mask. For example in the Gemma 3 model family also token_type_ids are part of the output (and required futher downstream).

Fixes #4189

@qgallouedec , similar to #4190

KarelKenens added 2 commits October 2, 2025 01:35

fix: account for additional processor outputs

603a29f

test: add would fail before fix

5ce3223

KarelKenens mentioned this pull request Oct 1, 2025

sft_trainer.DataCollatorForVisionLanguageModelling does not account for "non-standard" processor outputs (e.g. Gemma 3) #4189

Open

5 tasks

refactor: remove unnecessary copy

1b77170

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Account for additional processor outputs #4191

Account for additional processor outputs #4191

Uh oh!

KarelKenens commented Oct 1, 2025

Uh oh!

Uh oh!

Account for additional processor outputs #4191

Are you sure you want to change the base?

Account for additional processor outputs #4191

Uh oh!

Conversation

KarelKenens commented Oct 1, 2025

What does this PR do?

Uh oh!

Uh oh!