Skip to content

Conversation

@pan-x-c
Copy link
Collaborator

@pan-x-c pan-x-c commented Jan 5, 2026

AgentScope Version

1.0.11

Description

Enhance the current tune related modules.

flowchart TD
    Model[Model] --> WorkflowFunction[Workflow Function]
    WorkflowFunction --> JudgeFunction[Judge Function]
    Task[Task] --> WorkflowFunction
    Task[Task] --> JudgeFunction
    JudgeFunction --> Reward[Reward]

    classDef wfcolor fill:#e67e22,stroke:#333,color:#111;
    classDef judgecolor fill:#1abc9c,stroke:#333,color:#111,stroke-dasharray: 5 5;
    classDef taskcolor fill:#3498db,stroke:#333,color:#111;
    class WorkflowFunction wfcolor;
    class JudgeFunction judgecolor;
    class Task taskcolor;
Loading

Enhancements include:

  1. Pass auxiliary_models (Dict[str, BaseChatModel]) to workflow_function to support the use of different models in multi-agent applications.
  2. workflow_function can return raw response with any type.
  3. Add judge_function to calculate rewards based on the raw response returned by workflow_function, and the judge_function can use auxiliary_models (Dict[BaseChatModel]) to implement LLM-as-a-judge.
  4. Simplify configuration file, use Dataset, TunerChatModel and Algorithm to construct the tuning configuration.

Checklist

Please check the following items before code is ready to be reviewed.

  • Code has been formatted with pre-commit run --all-files command
  • All tests are passing
  • Docstrings are in Google style
  • Related documentation has been updated (e.g. links, examples, etc.)
  • Code is ready for review

@pan-x-c pan-x-c added the Enhancement Enhance the existing functionalities label Jan 5, 2026
@cla-assistant
Copy link

cla-assistant bot commented Jan 5, 2026

CLA assistant check
All committers have signed the CLA.

@cla-assistant
Copy link

cla-assistant bot commented Jan 5, 2026

CLA assistant check
Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.
You have signed the CLA already but the status is still pending? Let us recheck it.

@pan-x-c pan-x-c changed the title Enhance Agent Tune Interface feat(tuner): Enhance Agent Tune Interface Jan 5, 2026
Copy link

@yanxi-chen yanxi-chen left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Some minor comments

Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Copilot reviewed 29 out of 30 changed files in this pull request and generated 19 comments.

Copy link
Member

@DavdGao DavdGao left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please see my inline comments. I have some concerns about whether abstractions like Algorithm actually simplify RL training configuration. In my view, consolidating all tuning parameters in a single JSON file would be more straightforward and easier to manage.

Copy link

@lingzhq lingzhq left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

To support the data-augmentation example in AgentScope-Tuner, we need to pass the Task Selector parameters to Trinity and fix an issue with loading eval tasksets via YAML. The corresponding implementation can be checked in the samples PR. Thanks for looking into this!

Copy link
Member

@DavdGao DavdGao left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

plz see inline comments

Copy link
Member

@DavdGao DavdGao left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@DavdGao DavdGao merged commit 0ca24a2 into agentscope-ai:main Jan 14, 2026
11 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Enhancement Enhance the existing functionalities

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants