Add trust_remote_code to GRPOConfig #4186

muupan · 2025-10-01T07:50:01Z

What does this PR do?

This PR adds trust_remote_code to GRPOConfig and makes GRPOTrainer use it when making the model and its related objects to support custom models.

Fixes #4129

Before submitting

Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a GitHub issue? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

qgallouedec · 2025-10-01T14:50:30Z

trl/trainer/grpo_config.py

    )
+    trust_remote_code: bool = field(
+        default=False,
+        metadata={"help": "Whether to trust remote code when loading custom models e.g. from the Hugging Face Hub."},


Suggested change

metadata={"help": "Whether to trust remote code when loading custom models e.g. from the Hugging Face Hub."},

metadata={"help": "Whether to trust remote code when loading custom models from the Hugging Face Hub."},

trust_remote_code matters when loading things from local files too. The docs of vLLM describe it similarly.

Trust remote code (e.g., from HuggingFace) when downloading the model and tokenizer.

https://docs.vllm.ai/en/latest/api/vllm/index.html#vllm.LLM

That said, the docs of transformers only mention the hub. Do you think e.g. should be deleted? I don't have a strong opinion so will follow your preference.

Whether or not to allow for custom models defined on the Hub in their own modeling files. This option should only be set to True for repositories you trust and in which you have read the code, as it will execute code present on the Hub on your local machine.

https://huggingface.co/docs/transformers/v4.56.2/en/model_doc/auto#transformers.AutoModel.from_pretrained.trust_remote_code

qgallouedec · 2025-10-01T15:15:53Z

trl/trainer/grpo_trainer.py

            # Disable caching if gradient checkpointing is enabled (not supported)
-            config = AutoConfig.from_pretrained(model_id)
+            config = AutoConfig.from_pretrained(model_id, trust_remote_code=self.args.trust_remote_code)
            architecture = getattr(transformers, config.architectures[0])


When using remote code, the idea is that that model is not included in transformers, right? So maybe you need something like this instead:

if hasattr(transformers, config.architectures[0]): architecture = getattr(transformers, config.architectures[0]) model = architecture.from_pretrained(model_id, **model_init_kwargs) else: model = AutoModelForCausalLM.from_pretrained(model_id, trust_remote_code=True)

Thanks, you are right. I haven't tested the code path of AutoConfig as I was passing an already loaded model to GRPOTrainer. I will try rewriting it like your code.

I fixed it. Now it works when model passed to GRPOTrainer is str.

muupan · 2025-10-16T04:23:14Z

@qgallouedec Can you review this again?

muupan mentioned this pull request Oct 1, 2025

Setting trust_remote_code=True for vLLM in GRPOTrainer with vllm_mode=="colocate" #4129

Open

qgallouedec reviewed Oct 1, 2025

View reviewed changes

muupan force-pushed the feature/grpo-config-trust-remote-code branch from 8376f61 to 90c91b0 Compare October 2, 2025 02:09

muupan requested a review from qgallouedec October 2, 2025 02:13

muupan force-pushed the feature/grpo-config-trust-remote-code branch from db4efa9 to 22ced34 Compare November 17, 2025 12:30

Add trust_remote_code to GRPOConfig

abf40ca

muupan force-pushed the feature/grpo-config-trust-remote-code branch from 22ced34 to abf40ca Compare November 17, 2025 12:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add trust_remote_code to GRPOConfig #4186

Add trust_remote_code to GRPOConfig #4186

Uh oh!

muupan commented Oct 1, 2025

Uh oh!

qgallouedec Oct 1, 2025

Uh oh!

muupan Oct 1, 2025

Uh oh!

qgallouedec Oct 1, 2025

Uh oh!

muupan Oct 1, 2025

Uh oh!

muupan Oct 2, 2025

Uh oh!

muupan commented Oct 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	metadata={"help": "Whether to trust remote code when loading custom models e.g. from the Hugging Face Hub."},
	metadata={"help": "Whether to trust remote code when loading custom models from the Hugging Face Hub."},

Add trust_remote_code to GRPOConfig #4186

Are you sure you want to change the base?

Add trust_remote_code to GRPOConfig #4186

Uh oh!

Conversation

muupan commented Oct 1, 2025

What does this PR do?

Before submitting

Who can review?

Uh oh!

qgallouedec Oct 1, 2025

Choose a reason for hiding this comment

Uh oh!

muupan Oct 1, 2025

Choose a reason for hiding this comment

Uh oh!

qgallouedec Oct 1, 2025

Choose a reason for hiding this comment

Uh oh!

muupan Oct 1, 2025

Choose a reason for hiding this comment

Uh oh!

muupan Oct 2, 2025

Choose a reason for hiding this comment

Uh oh!

muupan commented Oct 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants