Skip to content

[feat]add skills graders#152

Merged
XiaoBoAI merged 6 commits intoagentscope-ai:mainfrom
helloml0326:zhuohua/skills_grader
Mar 27, 2026
Merged

[feat]add skills graders#152
XiaoBoAI merged 6 commits intoagentscope-ai:mainfrom
helloml0326:zhuohua/skills_grader

Conversation

@helloml0326
Copy link
Copy Markdown
Collaborator

OpenJudge Version

[The version of OpenJudge you are working on, e.g. import openjudge; print(openjudge.__version__)]

Description

[Please describe the background, purpose, changes made, and how to test this PR]

Checklist

Please check the following items before code is ready to be reviewed.

  • Code has been formatted with pre-commit run --all-files command
  • All tests are passing
  • Docstrings are in Google style
  • Related documentation has been updated (e.g. links, examples, etc.)
  • Code is ready for review

@helloml0326 helloml0326 changed the title Zhuohua/skills grader [feat]add skills graders Mar 20, 2026
@helloml0326 helloml0326 changed the title [feat]add skills graders [WIP][feat]add skills graders Mar 20, 2026
- Refine skill grader modules (comprehensive, pairwise, relevance, safety) and exports
- Simplify skill grader test suites with less boilerplate
- Ignore tests/graders/skills/*.json artifacts in .gitignore
- Run pytest in pre-commit via uv so the project environment provides pytest
- Fix flake8 E226 spacing and long-line issues in cookbooks, experiments, UI batch panels

Made-with: Cursor
Align skill graders with tool_call_accuracy: each default template now uses
a system message (LLMGrader.SYSTEM_PROMPT_EN/ZH) plus the existing user rubric.

Made-with: Cursor
Use python -m pytest for the local pytest hook (no uv in CI). Match main
hooks: drop gitleaks and detect-aws-credentials extras from the feature branch.

Made-with: Cursor
@helloml0326 helloml0326 changed the title [WIP][feat]add skills graders [feat]add skills graders Mar 27, 2026
@XiaoBoAI XiaoBoAI merged commit 749c499 into agentscope-ai:main Mar 27, 2026
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants