Code Search Relevancy Scoring Agent #283

pranath-reddy · 2025-11-20T21:18:57Z

Summary 📝

This PR introduces a Dynamic Relevancy Scoring Agent that evaluates repository content against dynamically generated relevance criteria using a three-level scoring rubric. The agent assesses combined repository content (README, description, title, topics) against both required and nice-to-have criteria, producing quantitative relevance scores with detailed qualitative reasoning. This enables precise ranking of search results while providing transparency into scoring decisions through specific content evidence.

Details

Dynamic Relevancy Scoring

Core Functionality

Evaluates repository content against query-specific criteria from DynamicRelevancyCriteriaAgent
Uses three-level scoring rubric for each criterion:
- Fully Satisfies (3.0): Content clearly and comprehensively addresses the criterion
- Partially Satisfies (1.5): Content somewhat addresses the criterion but incompletely
- Does Not Satisfy (0.0): Content does not address the criterion
Assesses all repository content fields: README text, description, title, topics, metadata
Provides detailed reasoning with specific evidence from content for each score

Evaluation Outputs

Quantitative relevance metric for ranking repositories
Qualitative reasoning traces explaining scoring decisions
Overall assessment with actionable feedback for query refinement
Identifies well-matched aspects, missing elements, and suggestions for improving retrieval

Code Changes

Added system prompt at akd/configs/code_prompts.py:

RELEVANCY_SCORING_PROMPT for evidence-based relevancy evaluation logic

New Agent Implementation: RelevancyScoringAgent

Extends LiteLLMInstructorBaseAgent with specialized input/output schemas
Single evaluation call efficiently assesses all criteria
Input: Query, content, required criteria, nice-to-have criteria
Output: Criterion evaluations, overall assessment with refinement feedback

Extended RelevancyCriterion schema:

Added is_required: bool field to distinguish required vs. nice-to-have criteria
This extends the base RelevanceCriterion from DynamicRelevancyCriteriaAgent

Usage

from akd.agents.relevance import RelevancyScoringAgent, RelevancyScoringAgentInputSchema, RelevancyCriterion
from akd.configs.code_prompts import RELEVANCY_SCORING_PROMPT

# Initialize scoring agent
scoring_agent = RelevancyScoringAgent(
    system_prompt=RELEVANCY_SCORING_PROMPT,
    model="gpt-4"
)

# Prepare criteria from DynamicRelevancyCriteriaAgent output
required_criteria = [
    RelevancyCriterion(
        name="ml_climate_integration",
        description="Implements ML algorithms for climate science",
        is_required=True
    ),
    RelevancyCriterion(
        name="climate_data_handling",
        description="Processes climate datasets",
        is_required=True
    )
]

nice_to_have_criteria = [
    RelevancyCriterion(
        name="visualization_tools",
        description="Provides visualization for climate outputs",
        is_required=False
    )
]

# Evaluate repository content
result = await scoring_agent.arun(
    RelevancyScoringAgentInputSchema(
        query="machine learning for climate modeling",
        content=repository_content,  # Concatenated README, description, etc.
        required_criteria=required_criteria,
        nice_to_have_criteria=nice_to_have_criteria
    )
)

Checks

Tested Changes
Stakeholder Approval

- Add an agent that uses the criteria generated by dynamic criteria generation agent and outputs a relevancy score - Add initial system prompt for the relevancy scoring agent

github-actions · 2025-11-20T21:23:00Z

❌ Tests failed (exit code: 2)

📊 Test Results

Passed: 0
Failed: 0
Skipped: 0
Warnings: 7
Coverage: 0%

⚠️ Note: Test counts are 0, which may indicate parsing issues or early test failure. Check the workflow logs for details.
Parsing strategy used: summary_line

Branch: feature/code-relevancy-score
PR: #283
Commit: f1cad70

📋 Full coverage report and logs are available in the workflow run.

…ancy-score

github-actions · 2025-11-20T21:28:25Z

❌ Tests failed (exit code: 2)

📊 Test Results

Passed: 0
Failed: 0
Skipped: 0
Warnings: 7
Coverage: 0%

⚠️ Note: Test counts are 0, which may indicate parsing issues or early test failure. Check the workflow logs for details.
Parsing strategy used: summary_line

Branch: feature/code-relevancy-score
PR: #283
Commit: 51ad6bf

📋 Full coverage report and logs are available in the workflow run.

NISH1001 · 2025-11-21T14:35:05Z

cc: @iamsims check this. i think better collaborate and come to a common ground on score criteria thing here. We don't want various disconnected components doing the same things.
cc: @pranath-reddy as well. I will let you 2 discuss on this.

NISH1001 · 2025-11-21T14:36:12Z

also @pranath-reddy i'd recommend to put the new relevance agents (if we happen to finalize it/merge it later) to already existing module akd.agents.relevancy

- Update base relevancy criterion to match the criteria gen agent

github-actions · 2025-11-24T01:18:11Z

❌ Tests failed (exit code: 2)

📊 Test Results

Passed: 0
Failed: 0
Skipped: 0
Warnings: 8
Coverage: 0%

⚠️ Note: Test counts are 0, which may indicate parsing issues or early test failure. Check the workflow logs for details.
Parsing strategy used: summary_line

Branch: feature/code-relevancy-score
PR: #283
Commit: 8f51bbd

📋 Full coverage report and logs are available in the workflow run.

Add Relevancy scoring agent

647e66e

- Add an agent that uses the criteria generated by dynamic criteria generation agent and outputs a relevancy score - Add initial system prompt for the relevancy scoring agent

pranath-reddy self-assigned this Nov 20, 2025

pranath-reddy temporarily deployed to integration November 20, 2025 21:19 — with GitHub Actions Inactive

Merge remote-tracking branch 'origin/develop' into feature/code-relev…

c495438

…ancy-score

pranath-reddy temporarily deployed to integration November 20, 2025 21:24 — with GitHub Actions Inactive

pranath-reddy requested a review from NISH1001 November 20, 2025 21:24

Update relevance criterion

e4cfe48

- Update base relevancy criterion to match the criteria gen agent

pranath-reddy temporarily deployed to integration November 24, 2025 01:14 — with GitHub Actions Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Code Search Relevancy Scoring Agent #283

Code Search Relevancy Scoring Agent #283

Uh oh!

pranath-reddy commented Nov 20, 2025

Uh oh!

github-actions bot commented Nov 20, 2025

Uh oh!

github-actions bot commented Nov 20, 2025

Uh oh!

NISH1001 commented Nov 21, 2025 •

edited

Loading

Uh oh!

NISH1001 commented Nov 21, 2025

Uh oh!

github-actions bot commented Nov 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Code Search Relevancy Scoring Agent #283

Are you sure you want to change the base?

Code Search Relevancy Scoring Agent #283

Uh oh!

Conversation

pranath-reddy commented Nov 20, 2025

Summary 📝

Details

Dynamic Relevancy Scoring

Code Changes

Usage

Checks

Uh oh!

github-actions bot commented Nov 20, 2025

📊 Test Results

Uh oh!

github-actions bot commented Nov 20, 2025

📊 Test Results

Uh oh!

NISH1001 commented Nov 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

NISH1001 commented Nov 21, 2025

Uh oh!

github-actions bot commented Nov 24, 2025

📊 Test Results

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

NISH1001 commented Nov 21, 2025 •

edited

Loading