Skip to content

feat: Add agent/evaluate/agent-framework as sub-skill to microsoft foundry skill#914

Open
kuojianlu wants to merge 3 commits intomicrosoft:mainfrom
kuojianlu:kuojianlu/eval-skill
Open

feat: Add agent/evaluate/agent-framework as sub-skill to microsoft foundry skill#914
kuojianlu wants to merge 3 commits intomicrosoft:mainfrom
kuojianlu:kuojianlu/eval-skill

Conversation

@kuojianlu
Copy link
Contributor

@kuojianlu kuojianlu commented Feb 13, 2026

Add agent/evaluate/agent-framework as sub-skill to microsoft foundry skill, which covers:

  • Clarify metrics, dataset
  • Configure judge model
  • Generate evaluation codes
  • Run-fix loop to resolve code issue
  • Generate evaluation.md

Important: This needs to be updated and merged after #824

…undry

Add agent/evaluate/agent-framework as sub-skill to microsoft foundry skill, which covers:
- Clarify metrics, dataset
- Configure judge model
- Generate evaluation codes
- Run-fix loop to resolve code issue
- Generate evaluation.md
@kuojianlu kuojianlu changed the title feat: Add agent/evaluate/agent-framework as sub-skill to microsoft foundry skill [DRAFT] feat: Add agent/evaluate/agent-framework as sub-skill to microsoft foundry skill Feb 13, 2026
@kuojianlu kuojianlu marked this pull request as ready for review February 27, 2026 10:15
Copilot AI review requested due to automatic review settings February 27, 2026 10:15
Comment on lines +47 to +50
const agentMetadata = await run({
prompt: "Evaluate my Foundry agent built with Microsoft Agent Framework using pytest evaluators.",
shouldEarlyTerminate: terminateOnCreate,
});

Check failure

Code scanning / CodeQL

Invocation of non-function Error test

Callee is not a function: it has type undefined.
Comment on lines +75 to +78
const agentMetadata = await run({
prompt: "Add a custom evaluator to assess my agent's task completion using pytest-agent-evals.",
shouldEarlyTerminate: terminateOnCreate,
});

Check failure

Code scanning / CodeQL

Invocation of non-function Error test

Callee is not a function: it has type undefined.
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This draft PR adds a new evaluate/agent-framework sub-skill to the microsoft-foundry skill for evaluating AI agents built with Microsoft Agent Framework using the pytest-agent-evals plugin. The skill provides comprehensive guidance for setting up agent evaluation workflows, including metric selection, dataset creation, judge model configuration, and VS Code Test Explorer integration.

Changes:

  • Adds new agent-framework evaluation sub-skill with comprehensive workflow documentation
  • Includes three reference files documenting code examples, built-in evaluators, and custom evaluators
  • Adds comprehensive test coverage (unit, trigger, and integration tests)

Reviewed changes

Copilot reviewed 9 out of 9 changed files in this pull request and generated 1 comment.

Show a summary per file
File Description
plugin/skills/microsoft-foundry/foundry-agent/evaluate/agent-framework/SKILL.md Main skill file with evaluation workflow, judge model configuration, and error handling
plugin/skills/microsoft-foundry/foundry-agent/evaluate/agent-framework/references/code-example.md Complete pytest-agent-evals code example with key concepts and generation guidelines
plugin/skills/microsoft-foundry/foundry-agent/evaluate/agent-framework/references/built-in-evaluators.md Catalog of 15+ built-in evaluators for agents, general purpose, RAG, and similarity metrics
plugin/skills/microsoft-foundry/foundry-agent/evaluate/agent-framework/references/custom-evaluators.md Patterns for custom prompt-based (LLM judge) and code-based evaluators
plugin/skills/microsoft-foundry/SKILL.md Adds evaluate sub-skill entry to parent skill's sub-skill table and workflow mapping
tests/microsoft-foundry/foundry-agent/evaluate/agent-framework/unit.test.ts Unit tests validating skill metadata, content sections, and reference files
tests/microsoft-foundry/foundry-agent/evaluate/agent-framework/triggers.test.ts Trigger tests with positive/negative prompts and snapshot validation
tests/microsoft-foundry/foundry-agent/evaluate/agent-framework/integration.test.ts Integration tests measuring skill invocation rates for evaluation prompts
tests/microsoft-foundry/foundry-agent/evaluate/agent-framework/__snapshots__/triggers.test.ts.snap Jest snapshot capturing expected keywords and description format

Comment on lines +3 to +6
description: |
Evaluate AI agents and workflows built with Microsoft Agent Framework using pytest-agent-evals plugin. Supports built-in and custom evaluators with VS Code Test Explorer integration.
USE FOR: evaluate agent, test agent, assess agent, agent evaluation, pytest evaluation, measure agent performance, agent quality, add evaluator, evaluation dataset, judge model.
DO NOT USE FOR: creating agents (use agent/create), deploying agents (use agent/deploy), evaluating non-Agent-Framework agents.
Copy link

Copilot AI Feb 27, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The frontmatter description uses a multiline literal block format (description: |) which is incompatible with skills.sh tooling. According to repository conventions (.github/skills/sensei/references/SCORING.md:140), descriptions MUST use inline double-quoted strings instead. Convert this to: description: "Evaluate AI agents and workflows built with Microsoft Agent Framework using pytest-agent-evals plugin. Supports built-in and custom evaluators with VS Code Test Explorer integration. USE FOR: evaluate agent, test agent, assess agent, agent evaluation, pytest evaluation, measure agent performance, agent quality, add evaluator, evaluation dataset, judge model. DO NOT USE FOR: creating agents (use agent/create), deploying agents (use agent/deploy), evaluating non-Agent-Framework agents."

Copilot generated this review using guidance from repository custom instructions.
@kuojianlu kuojianlu changed the title [DRAFT] feat: Add agent/evaluate/agent-framework as sub-skill to microsoft foundry skill feat: Add agent/evaluate/agent-framework as sub-skill to microsoft foundry skill Feb 28, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants