Feature/llm reranker #278

iamsims · 2025-11-19T21:19:23Z

Summary

Implements LLM-based reranker that evaluates search results using configurable criteria, weights and fields.
In addition to sorting provides the label for each critera
Two implemnetations:
- Single LLM call per result evaluates all criteria simultaneously commit hash (02eb523)
- Multiple LLM call per result (for per criteria) commit hash (166f475)
Supports weighted multi-criteria scoring with customizable categories

github-actions · 2025-11-19T21:34:32Z

❌ Tests failed (exit code: 1)

📊 Test Results

Passed: 548
Failed: 5
Skipped: 7
Warnings: 164
Coverage: 79%

Branch: feature/llm-reranker
PR: #278
Commit: 0386629

📋 Full coverage report and logs are available in the workflow run.

NISH1001 · 2025-11-20T21:04:59Z

@iamsims add the detail # Usage sectoin as well in the pr description.

See some of the older PRs we have done:
eg #279

NISH1001

Initial review

NISH1001 · 2025-11-20T21:06:17Z

akd/tools/reranker.py

+    agent_system_prompt: str = Field(
+        default=(
+            "You are an expert at evaluating search results. "
+            "Analyze the provided result for the query against all given criteria and "
+            "select the most appropriate category for each. Provide clear reasoning."
+        ),
+        description="System prompt for the internal scoring agent",
+    )


have this get from os.getenv as well so that we can globally modify whenever we want...and default to this value you have. something like AKD_LLM_RERANKER_SYSTEM_PROMPT

NISH1001 · 2025-11-20T21:07:46Z

akd/tools/reranker.py

+            ScoringCriterion(
+                name="Processing Level",
+                description="How well does this result match the required processing level?",
+                weight=0.5,
+            ),


This seems too specific to data search agent. we need to find some general criteria that can be globally applied to any usecase...

NISH1001 · 2025-11-20T21:08:37Z

akd/tools/reranker.py

+            debug: Enable debug logging
+        """
+        super().__init__(config=config, debug=debug)
+        self.config: LLMRerankerToolConfig = self.config  # type hint


we don't need this...it will automatically be done through super()...

NISH1001 · 2025-11-20T21:10:11Z

akd/tools/reranker.py

+
+        class ScoringAgent(LiteLLMInstructorBaseAgent):
+            input_schema = DummyInput
+            output_schema = dynamic_scoring_model


move this to anotehr internal method like _setup_scoring_agent or something

and we can just do `self.scoring_agent = self._setup_scoring_agent(model....)

NISH1001 · 2025-11-20T21:10:36Z

akd/tools/reranker.py

+        Similar to relevancy-ranker.py approach - creates explicit named fields
+        for each criterion so LLM can see them in the JSON schema.
+        """
+        from pydantic import create_model


move to top-level import

NISH1001 · 2025-11-20T21:12:10Z

akd/tools/reranker.py

+        )
+
+        if self.debug:
+            print(formatted_prompt)


logger.debug(...)

NISH1001 · 2025-11-20T21:13:12Z

akd/tools/reranker.py

+                },
+            ]
+
+            print(messages)


remove print stattement...or add if self.debug: logger.debug(...)

NISH1001 · 2025-11-20T21:14:34Z

akd/tools/reranker.py

 def create_reranker(
    reranker_type: RerankerType,
-    config: RerankerToolConfig | None = None,
+    config: RerankerToolConfig | LLMRerankerToolConfig | None = None,


Technically don't need LLMRerankerToolConfig because it's also a type of RerankerToolConfig. Redundant type hint

NISH1001 · 2025-11-20T21:16:39Z

akd/tools/reranker.py

+
+            response = await self.scoring_agent.get_response_async(
+                messages=messages,
+            )
+
+            results = {}
+            response_dict = response.model_dump()


what if we do this at agent.arun level? since it's one-level higher abstraction. is it possible to do it with the formatted_prompt? you have to convert it to ap ydantic input schema

github-actions · 2025-12-02T21:11:35Z

❌ Tests failed (exit code: 1)

📊 Test Results

Passed: 548
Failed: 5
Skipped: 7
Warnings: 169
Coverage: 79%

Branch: feature/llm-reranker
PR: #278
Commit: b59272c

📋 Full coverage report and logs are available in the workflow run.

github-actions · 2025-12-02T21:18:57Z

❌ Tests failed (exit code: 1)

📊 Test Results

Passed: 548
Failed: 5
Skipped: 7
Warnings: 167
Coverage: 79%

Branch: feature/llm-reranker
PR: #278
Commit: ce52969

📋 Full coverage report and logs are available in the workflow run.

iamsims added 2 commits November 18, 2025 13:59

Add basic implementation and example for llm based reranker

166f475

Add reranker for one LLM call for all criterias

02eb523

iamsims temporarily deployed to integration November 19, 2025 21:21 — with GitHub Actions Inactive

NISH1001 self-requested a review November 20, 2025 21:04

NISH1001 requested changes Nov 20, 2025

View reviewed changes

iamsims temporarily deployed to integration December 2, 2025 20:58 — with GitHub Actions Inactive

Create a base class for Criteria

93ea754

iamsims force-pushed the feature/llm-reranker branch from 7819464 to 93ea754 Compare December 2, 2025 21:05

iamsims temporarily deployed to integration December 2, 2025 21:05 — with GitHub Actions Inactive

Feature/llm reranker #278

Are you sure you want to change the base?

Feature/llm reranker #278

Uh oh!

Conversation

iamsims commented Nov 19, 2025

Summary

Uh oh!

github-actions bot commented Nov 19, 2025

📊 Test Results

Uh oh!

NISH1001 commented Nov 20, 2025

Uh oh!

NISH1001 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Dec 2, 2025

📊 Test Results

Uh oh!

github-actions bot commented Dec 2, 2025

📊 Test Results

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants