Skip to content

Conversation

Kevinjil
Copy link
Contributor

@Kevinjil Kevinjil commented Jul 6, 2025

The idea behind this mode is that we are not really shadowing and trust the event feed results. However, we want to judge "interesting" runs locally to get useful information without the judging capacity to judge all testcases due to limited judgehost assignment.

We do not consider 'TLE' or 'AC' interesting, as rerunning will not yield much more information. We consider 'WA' very interesting and prioritize the judging, but allow manual judging to overtake the priority. We consider 'CE' somewhat interesting, but downprioritize them a lot. For other verdicts, keep the normal priority.

@Kevinjil Kevinjil force-pushed the analyst-eval branch 2 times, most recently from 299c4a2 to 60f966d Compare July 6, 2025 11:47
@Kevinjil Kevinjil marked this pull request as ready for review July 6, 2025 15:03
@Kevinjil Kevinjil force-pushed the analyst-eval branch 3 times, most recently from b1429a8 to ca69da5 Compare September 3, 2025 07:15
The idea behind this mode is that we are not really shadowing and trust
the event feed results. However, we want to judge "interesting" runs
locally to get useful information without the judging capacity to judge
all testcases due to limited judgehost assignment.

We do not consider 'TLE' or 'AC' interesting, as rerunning will not
yield much more information. We consider 'WA' very interesting and
prioritize the judging, but allow manual judging to overtake the
priority. We consider 'CE' somewhat interesting, but downprioritize them
a lot. For other verdicts, keep the normal priority.
When the QueueTask is already processed before the arrival of the
upstream verdict, the udpated JudgeTask will never be picked up by a
judge daemon. To ensure that the interesting run is actually picked up,
create a new QueueTask. Note that, in case of an already existing
unprocessed queue task, this should not break anything.
Otherwise there is a hidden dependency between the 'actual' constants, and these constants.
So instead of `-9` use `JudgeTask::PRIORITY_HIGH + 1` and instead of `9` use `JudgeTask::PRIORITY_LOW - 1`.

Suggested-by: Mart Pluijmaekers <[email protected]>
@Kevinjil Kevinjil requested a review from tuupke September 3, 2025 11:22
@Kevinjil Kevinjil added this pull request to the merge queue Sep 3, 2025
Merged via the queue into main with commit 7577142 Sep 3, 2025
42 of 47 checks passed
@Kevinjil Kevinjil deleted the analyst-eval branch September 3, 2025 11:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants