feat(runner): add optional timing collection for grading workflows by Alexxigang · Pull Request #160 · agentscope-ai/OpenJudge

Alexxigang · 2026-04-05T16:47:15Z

Summary

add a reusable TimingCollector utility under openjudge/utils/timer.py with context-manager and decorator support
instrument GradingRunner to optionally collect latency records for single evaluations, whole datasets, and multi-dataset runs
expose timing records and summaries through get_timing_records(), get_timing_summary(), and clear_timing_records()
add unit tests for the timing utility and runner timing integration

Why this fix

OpenJudge did not have a built-in way to measure the latency of core evaluation workflow steps, which made it harder to identify bottlenecks or track regressions over time.

This PR introduces a lightweight timing utility that logs at DEBUG level by default and stores in-memory timing records for programmatic access. As a pilot integration, the grading workflow now supports optional timing collection without changing existing behavior when timing is disabled.

Closes #81.

Validation

PYTHONPATH=. pytest tests/utils/test_timer.py tests/runner/test_grading_runner.py -q

feat(runner): add reusable timing collector for grading workflows

43e47de

Alexxigang mentioned this pull request Apr 5, 2026

[Feature]: Add Time Consumption Evaluation for Key Operations #81

Open

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(runner): add optional timing collection for grading workflows#160

feat(runner): add optional timing collection for grading workflows#160
Alexxigang wants to merge 1 commit intoagentscope-ai:mainfrom
Alexxigang:feat/grading-runner-timing

Alexxigang commented Apr 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Alexxigang commented Apr 5, 2026

Summary

Why this fix

Validation

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant