feat: Add agent/evaluate/agent-framework as sub-skill to microsoft foundry skill by kuojianlu · Pull Request #914 · microsoft/GitHub-Copilot-for-Azure

kuojianlu · 2026-02-13T08:09:07Z

Add agent/evaluate/agent-framework as sub-skill to microsoft foundry skill, which covers:

Clarify metrics, dataset
Configure judge model
Generate evaluation codes
Run-fix loop to resolve code issue
Generate evaluation.md

Important: This needs to be updated and merged after #824

…undry Add agent/evaluate/agent-framework as sub-skill to microsoft foundry skill, which covers: - Clarify metrics, dataset - Configure judge model - Generate evaluation codes - Run-fix loop to resolve code issue - Generate evaluation.md

tests/microsoft-foundry/foundry-agent/evaluate/agent-framework/integration.test.ts

…r-Azure into kuojianlu/eval-skill

tests/microsoft-foundry/foundry-agent/evaluate/agent-framework/integration.test.ts

+          const agentMetadata = await run({
+            prompt: "Evaluate my Foundry agent built with Microsoft Agent Framework using pytest evaluators.",
+            shouldEarlyTerminate: terminateOnCreate,
+          });


tests/microsoft-foundry/foundry-agent/evaluate/agent-framework/integration.test.ts

+          const agentMetadata = await run({
+            prompt: "Add a custom evaluator to assess my agent's task completion using pytest-agent-evals.",
+            shouldEarlyTerminate: terminateOnCreate,
+          });


Copilot

Pull request overview

This draft PR adds a new evaluate/agent-framework sub-skill to the microsoft-foundry skill for evaluating AI agents built with Microsoft Agent Framework using the pytest-agent-evals plugin. The skill provides comprehensive guidance for setting up agent evaluation workflows, including metric selection, dataset creation, judge model configuration, and VS Code Test Explorer integration.

Changes:

Adds new agent-framework evaluation sub-skill with comprehensive workflow documentation
Includes three reference files documenting code examples, built-in evaluators, and custom evaluators
Adds comprehensive test coverage (unit, trigger, and integration tests)

Reviewed changes

Copilot reviewed 9 out of 9 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
`plugin/skills/microsoft-foundry/foundry-agent/evaluate/agent-framework/SKILL.md`	Main skill file with evaluation workflow, judge model configuration, and error handling
`plugin/skills/microsoft-foundry/foundry-agent/evaluate/agent-framework/references/code-example.md`	Complete pytest-agent-evals code example with key concepts and generation guidelines
`plugin/skills/microsoft-foundry/foundry-agent/evaluate/agent-framework/references/built-in-evaluators.md`	Catalog of 15+ built-in evaluators for agents, general purpose, RAG, and similarity metrics
`plugin/skills/microsoft-foundry/foundry-agent/evaluate/agent-framework/references/custom-evaluators.md`	Patterns for custom prompt-based (LLM judge) and code-based evaluators
`plugin/skills/microsoft-foundry/SKILL.md`	Adds evaluate sub-skill entry to parent skill's sub-skill table and workflow mapping
`tests/microsoft-foundry/foundry-agent/evaluate/agent-framework/unit.test.ts`	Unit tests validating skill metadata, content sections, and reference files
`tests/microsoft-foundry/foundry-agent/evaluate/agent-framework/triggers.test.ts`	Trigger tests with positive/negative prompts and snapshot validation
`tests/microsoft-foundry/foundry-agent/evaluate/agent-framework/integration.test.ts`	Integration tests measuring skill invocation rates for evaluation prompts
`tests/microsoft-foundry/foundry-agent/evaluate/agent-framework/__snapshots__/triggers.test.ts.snap`	Jest snapshot capturing expected keywords and description format

Copilot · 2026-02-27T10:26:14Z

plugin/skills/microsoft-foundry/foundry-agent/evaluate/agent-framework/SKILL.md

+description: |
+  Evaluate AI agents and workflows built with Microsoft Agent Framework using pytest-agent-evals plugin. Supports built-in and custom evaluators with VS Code Test Explorer integration.
+  USE FOR: evaluate agent, test agent, assess agent, agent evaluation, pytest evaluation, measure agent performance, agent quality, add evaluator, evaluation dataset, judge model.
+  DO NOT USE FOR: creating agents (use agent/create), deploying agents (use agent/deploy), evaluating non-Agent-Framework agents.


The frontmatter description uses a multiline literal block format (description: |) which is incompatible with skills.sh tooling. According to repository conventions (.github/skills/sensei/references/SCORING.md:140), descriptions MUST use inline double-quoted strings instead. Convert this to: description: "Evaluate AI agents and workflows built with Microsoft Agent Framework using pytest-agent-evals plugin. Supports built-in and custom evaluators with VS Code Test Explorer integration. USE FOR: evaluate agent, test agent, assess agent, agent evaluation, pytest evaluation, measure agent performance, agent quality, add evaluator, evaluation dataset, judge model. DO NOT USE FOR: creating agents (use agent/create), deploying agents (use agent/deploy), evaluating non-Agent-Framework agents."

kuojianlu changed the title ~~feat: Add agent/evaluate/agent-framework as sub-skill to microsoft foundry skill~~ [DRAFT] feat: Add agent/evaluate/agent-framework as sub-skill to microsoft foundry skill Feb 13, 2026

github-advanced-security bot found potential problems Feb 13, 2026

View reviewed changes

tests/microsoft-foundry/foundry-agent/evaluate/agent-framework/integration.test.ts Fixed Show fixed Hide fixed

tests/microsoft-foundry/foundry-agent/evaluate/agent-framework/integration.test.ts Fixed Show fixed Hide fixed

github-actions bot mentioned this pull request Feb 20, 2026

[repo-status] Weekly Repo Status — Feb 14–20, 2026 #989

Closed

kuojianlu added 2 commits February 27, 2026 14:50

Merge branch 'main' of https://github.com/microsoft/GitHub-Copilot-fo…

78db49b

…r-Azure into kuojianlu/eval-skill

move skill to foundry-agent/evaluate/agent-framework

c430428

kuojianlu marked this pull request as ready for review February 27, 2026 10:15

Copilot AI review requested due to automatic review settings February 27, 2026 10:15

github-advanced-security bot found potential problems Feb 27, 2026

View reviewed changes

Copilot started reviewing on behalf of kuojianlu February 27, 2026 10:15 View session

Copilot AI reviewed Feb 27, 2026

View reviewed changes

kuojianlu changed the title ~~[DRAFT] feat: Add agent/evaluate/agent-framework as sub-skill to microsoft foundry skill~~ feat: Add agent/evaluate/agent-framework as sub-skill to microsoft foundry skill Feb 28, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add agent/evaluate/agent-framework as sub-skill to microsoft foundry skill#914

feat: Add agent/evaluate/agent-framework as sub-skill to microsoft foundry skill#914
kuojianlu wants to merge 3 commits intomicrosoft:mainfrom
kuojianlu:kuojianlu/eval-skill

kuojianlu commented Feb 13, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Check failure

Check failure

Copilot AI left a comment

Uh oh!

Copilot AI Feb 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

kuojianlu commented Feb 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Check failure

Check failure

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

kuojianlu commented Feb 13, 2026 •

edited

Loading