Skip to content

Conversation

@XinyuYe-Intel
Copy link
Collaborator

Description

Add DPO support in finetuning microservice.

Issues

n/a.

Type of change

List the type of change like below. Please delete options that are not relevant.

  • New feature (non-breaking change which adds new functionality)

Dependencies

List the newly introduced 3rd party dependency if exists.

Tests

Describe the tests that you ran to verify your changes.

@XinyuYe-Intel XinyuYe-Intel linked an issue Nov 7, 2024 that may be closed by this pull request
@XinyuYe-Intel XinyuYe-Intel added this to the v1.1 milestone Nov 7, 2024
@lkk12014402
Copy link
Collaborator

support dpo training on Gaudi?

@XinyuYe-Intel
Copy link
Collaborator Author

support dpo training on Gaudi?

yes, it does.

@joshuayao joshuayao added the r1.1 label Nov 12, 2024
@ftian1 ftian1 merged commit 37f3514 into main Nov 12, 2024
@ftian1 ftian1 deleted the dpo branch November 12, 2024 03:35
madison-evans pushed a commit to SAPD-Intel/GenAIComps that referenced this pull request May 12, 2025
* added dpo support.

Signed-off-by: Ye, Xinyu <[email protected]>

* make dpo trainer compatible with newest transformers.

Signed-off-by: Ye, Xinyu <[email protected]>

* added ut for dpo.

Signed-off-by: Ye, Xinyu <[email protected]>

* added training successfulness check in finetuning ut.

Signed-off-by: Ye, Xinyu <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* updated broken link.

Signed-off-by: Ye, Xinyu <[email protected]>

---------

Signed-off-by: Ye, Xinyu <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: ZePan110 <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

DPO fine-tuning

7 participants