Skip to content

Conversation

@qgallouedec qgallouedec changed the base branch from main to multi-image-support September 20, 2025 05:11
@HuggingFaceDocBuilderDev

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

@qgallouedec
Copy link
Member Author

cc @Peter-Chou, customization is made easier with this one

Copy link
Member

@lewtun lewtun left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM once the tests pass and a question about whether we should split _generate from scoring entirely

@Peter-Chou
Copy link
Contributor

@qgallouedec Yes. The original _generate_and_score_completions method was way too lengthy.
Breaking it down into finer-grained sub-methods and chaining them together like this is an excellent approach!

Base automatically changed from multi-image-support to main September 23, 2025 00:17
@qgallouedec qgallouedec changed the title Refactor GRPO to isolate _generate 😷 Refactor GRPO to isolate _generate Sep 23, 2025
@qgallouedec qgallouedec changed the title 😷 Refactor GRPO to isolate _generate 😷 Refactor GRPO/RLOO to isolate _generate Sep 24, 2025
@qgallouedec qgallouedec changed the title 😷 Refactor GRPO/RLOO to isolate _generate [WIP] 😷 Refactor GRPO/RLOO to isolate _generate Sep 25, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants