Routing service #1716

madison-evans · 2025-05-20T20:28:18Z

the following PR is a re submission of a previously closed PR

Description

This PR adds a Router micro-service to GenAIComps.
The router decides which downstream LLM endpoint is best suited for an incoming prompt and returns that endpoint URL.
It is stateless and supports two interchangeable controller back-ends:

Controller	Technique	Config file	Typical use-case
RouteLLM	Matrix-factorisation ranking trained on preference datasets (e.g. `gpt4_judge_battles`)	`deployment/docker_compose/configs/routellm_config.yaml`	Fine-grained complexity routing
Semantic-Router	Embedding-similarity / threshold	`deployment/docker_compose/configs/semantic_router_config.yaml`	Simple “weak vs strong” intent separation

How the configs fit together


comps/router/
├─ deployment/docker\_compose/configs/
│  ├─ router.yaml                # *Global* — maps strong/weak endpoints & points to per-controller YAMLs
│  ├─ routellm\_config.yaml       # RouteLLM-specific knobs (datasets, MF checkpoints, thresholds)
│  └─ semantic\_router\_config.yaml# Semantic-Router routes & encoder names
└─ src/opea\_router\_microservice.py
• Reads CONFIG\_PATH (env var) → loads router.yaml
• Uses “controller\_config\_paths” section to locate the chosen sub-config
• Instantiates controller via `ControllerFactory`

At runtime, docker compose mounts ./configs → /app/configs in the container.
CONFIG_PATH=/app/configs/router.yaml tells the service where to start.

Deployment:

Docker Compose bundle (deployment/docker_compose/compose.yaml + deploy_router.sh)

Issues

n/a — new component

Type of change

New feature (non-breaking change which adds new functionality)
Bug fix
Breaking change
Others (enhancement, documentation, validation, etc.)

Dependencies

New PyPI packages added to comps/router/src/requirements.txt:

routellm — an Intel fork of the RouteLLM project (https://github.com/SAPD-Intel/RouteLLM)
semantic-router — embedding-based router
huggingface-hub (pulled transitively)

These depend on the existing stack (FastAPI, Pydantic, etc.).
Runtime requires HF_TOKEN and OPENAI_API_KEY secrets.

Tests

End-to-end validation script tests/router/test_router_routellm_on_xeon.sh

Builds the router image (comps/router/src/Dockerfile)
Spins up the service via Docker Compose
Sends two sample prompts
expects "weak" route for easy math, "strong"
Shuts everything down

This assumes CI pipeline already exposes HF_TOKEN and OPENAI_API_KEY, so the script is invoked automatically.

# local smoke-test
export HF_TOKEN=***   OPENAI_API_KEY=***
chmod +x tests/router/test_router_routellm_on_xeon.sh
tests/router/test_router_routellm_on_xeon.sh

Signed-off-by: Madison Evans <[email protected]>

…ments.txt contents Signed-off-by: Madison Evans <[email protected]>

Signed-off-by: Madison Evans <[email protected]>

…ty str Signed-off-by: Madison Evans <[email protected]>

Signed-off-by: Madison Evans <[email protected]>

lvliang-intel · 2025-05-22T08:10:16Z

@madison-evans,
please fix the CI issue, you need to add your new service to docker build yaml.

comps/router/src/opea_router_microservice.py

ftian1 · 2025-05-27T08:09:33Z

https://github.com/SAPD-Intel/RouteLLM

have you went through security process for this fork repo to be referenced?

joshuayao · 2025-05-28T01:39:37Z

Hi @madison-evans Could you please check the CI failures?

…g 'routellm-e5-base-V2' under OPEA HF group Signed-off-by: Madison Evans <[email protected]>

Signed-off-by: Madison Evans <[email protected]>

madison-evans · 2025-05-28T19:09:05Z

@madison-evans, please fix the CI issue, you need to add your new service to docker build yaml.

for clarity, are you saying that I need to add my compose.yaml to .github/workflows/docker/compose as router-compose.yaml? Do I have that correct?

Signed-off-by: Madison Evans <[email protected]>

…cy. Now pulls from the referenced repo and then applies the patch located at 'comps/router/src/hf_compatibility.patch' Signed-off-by: Madison Evans <[email protected]>

Signed-off-by: Madison Evans <[email protected]>

madison-evans · 2025-05-28T20:30:16Z

example-test seems to be failing...

line 17, in <module>
      from llama_index.llms.openai import OpenAI
    File "/usr/local/lib/python3.11/site-packages/llama_index/llms/openai/__init__.py", line 2, in <module>
      from llama_index.llms.openai.responses import OpenAIResponses
    File "/usr/local/lib/python3.11/site-packages/llama_index/llms/openai/responses.py", line 6, in <module>
      from openai.types.responses import (
  ImportError: cannot import name 'ResponseTextAnnotationDeltaEvent' from 'openai.types.responses' (/usr/local/lib/python3.11/site-packages/openai/types/responses/__init__.py)
  + exit 1
  Error: Process completed with exit code 1.

A bit confused why that is. The commits I've made have been contained within comps/router

codecov · 2025-05-29T17:59:17Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Files with missing lines	Coverage Δ
comps/cores/proto/api_protocol.py	`92.08% <100.00%> (+0.03%)`	⬆️

... and 3 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

chensuyue · 2025-05-30T01:36:37Z

example-test seems to be failing...

line 17, in <module>
      from llama_index.llms.openai import OpenAI
    File "/usr/local/lib/python3.11/site-packages/llama_index/llms/openai/__init__.py", line 2, in <module>
      from llama_index.llms.openai.responses import OpenAIResponses
    File "/usr/local/lib/python3.11/site-packages/llama_index/llms/openai/responses.py", line 6, in <module>
      from openai.types.responses import (
  ImportError: cannot import name 'ResponseTextAnnotationDeltaEvent' from 'openai.types.responses' (/usr/local/lib/python3.11/site-packages/openai/types/responses/__init__.py)
  + exit 1
  Error: Process completed with exit code 1.

A bit confused why that is. The commits I've made have been contained within comps/router

This issue caused by deps update, and has been fixed yesterday. This example-test triggered due to your code update of comps/cores/ folder.

Signed-off-by: Madison Evans <[email protected]>

madison-evans · 2025-05-30T14:21:19Z

all tests are passing now. Ready for review

.github/workflows/docker/compose/router-compose.yaml

comps/router/deployment/docker_compose/configs/routellm_config.yaml

comps/router/deployment/docker_compose/compose.yaml

comps/router/deployment/docker_compose/configs/router.yaml

comps/router/deployment/docker_compose/configs/semantic_router_config.yaml

comps/router/src/Dockerfile

comps/router/src/README.md

...router/src/integrations/controllers/semantic_router_controller/semantic_router_controller.py

comps/router/src/requirements.txt

Signed-off-by: Haim Barad <[email protected]>

haim-barad · 2025-06-09T05:42:43Z

Fixed as per your review comments - @ashahba

Signed-off-by: Haim Barad <[email protected]>

ashahba

LGTM!

haim-barad · 2025-06-10T02:59:02Z

What step (I assume last step) is needed for merge? Is "Update Branch" enough? If yes, which method?

madison-evans requested review from chensuyue, ftian1, lkk12014402, lvliang-intel, minmin-intel and rbrugaro as code owners May 20, 2025 20:28

madison-evans added 6 commits May 20, 2025 20:29

initial file structure created. Populated with unimplemented files

de9facc

Signed-off-by: Madison Evans <[email protected]>

added relevant code to files within comps/router/deployment

2763712

Signed-off-by: Madison Evans <[email protected]>

added Dockerfile, opea_router_microservice.py, README.md, and require…

358e027

…ments.txt contents Signed-off-by: Madison Evans <[email protected]>

added controller components for router instances

4fc9690

Signed-off-by: Madison Evans <[email protected]>

added initial routellm controller test script in router directory

de5cfee

Signed-off-by: Madison Evans <[email protected]>

fixed requirements.txt issue

56b8b2d

Signed-off-by: Madison Evans <[email protected]>

madison-evans force-pushed the routing-service branch from a23d2fb to 710df33 Compare May 20, 2025 20:29

added HUGGINGFACEHUB_API_TOKEN as an env variable

23aeeeb

Signed-off-by: Madison Evans <[email protected]>

madison-evans force-pushed the routing-service branch 2 times, most recently from 5d91ac2 to 47e15f4 Compare May 21, 2025 15:09

removed hard OPENAI dependency and made OPENAI_API_KEY default to emp…

5aab6bc

…ty str Signed-off-by: Madison Evans <[email protected]>

madison-evans force-pushed the routing-service branch from 47e15f4 to 5aab6bc Compare May 21, 2025 15:12

madison-evans added 3 commits May 21, 2025 15:23

removed empty str fallback for OPENAI_API_KEY var

b4c86b0

Signed-off-by: Madison Evans <[email protected]>

target localhost in RouteLLM E2E test to avoid Docker network issues

9262b05

Signed-off-by: Madison Evans <[email protected]>

fixed e2e test issue for routellm test

64c8507

Signed-off-by: Madison Evans <[email protected]>

joshuayao linked an issue May 22, 2025 that may be closed by this pull request

[Feature] RouteLLM #936

Closed

joshuayao mentioned this pull request May 22, 2025

[Feature] RouteLLM #936

Closed

ftian1 reviewed May 27, 2025

View reviewed changes

comps/router/src/opea_router_microservice.py Outdated Show resolved Hide resolved

madison-evans added 2 commits May 28, 2025 18:58

changed the checkpoint path for the custom mf model weights. Now usin…

cf1622c

…g 'routellm-e5-base-V2' under OPEA HF group Signed-off-by: Madison Evans <[email protected]>

moved RouteEndpointDoc class into 'api_protocol.py' under cores/proto

efdd653

Signed-off-by: Madison Evans <[email protected]>

madison-evans added 2 commits May 28, 2025 19:14

added 'router-compose.yaml' to workflows/docker/compose

2d8e71e

Signed-off-by: Madison Evans <[email protected]>

pre commit format updates

9eb977a

Signed-off-by: Madison Evans <[email protected]>

madison-evans requested review from Spycsh, ZePan110 and letonghan as code owners May 28, 2025 19:17

madison-evans added 2 commits May 28, 2025 19:58

removed the forked version of RouteLLM from requirements.txt dependen…

8db8aa2

…cy. Now pulls from the referenced repo and then applies the patch located at 'comps/router/src/hf_compatibility.patch' Signed-off-by: Madison Evans <[email protected]>

updated README to reflect the patch usage for modified RouteLLM repo

beadac5

Signed-off-by: Madison Evans <[email protected]>

Merge branch 'opea-project:main' into routing-service

febad4f

madison-evans added 2 commits May 30, 2025 02:12

added H1 title to README

5316753

Signed-off-by: Madison Evans <[email protected]>

Merge branch 'opea-project:main' into routing-service

a9bf90a

Merge branch 'opea-project:main' into routing-service

cbae17f

ashahba requested changes Jun 6, 2025

View reviewed changes

comply with formatting requests.

96fcda3

Signed-off-by: Haim Barad <[email protected]>

haim-barad force-pushed the routing-service branch from 30501c0 to 96fcda3 Compare June 9, 2025 05:41

fix pre-commit issues: remove trailing whitespace and add newline

357ff33

Signed-off-by: Haim Barad <[email protected]>

ashahba approved these changes Jun 9, 2025

View reviewed changes

lvliang-intel approved these changes Jun 10, 2025

View reviewed changes

ZePan110 approved these changes Jun 10, 2025

View reviewed changes

ashahba merged commit 5e08c3f into opea-project:main Jun 10, 2025
17 checks passed

haim-barad deleted the routing-service branch June 10, 2025 10:03

yinghu5 added this to the v1.4 milestone Jun 11, 2025

Routing service #1716

Routing service #1716

Conversation

madison-evans commented May 20, 2025

Description

Issues

Type of change

Dependencies

Tests

Uh oh!

lvliang-intel commented May 22, 2025

Uh oh!

Uh oh!

ftian1 commented May 27, 2025

Uh oh!

joshuayao commented May 28, 2025

Uh oh!

madison-evans commented May 28, 2025

Uh oh!

madison-evans commented May 28, 2025

Uh oh!

codecov bot commented May 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

chensuyue commented May 30, 2025

Uh oh!

madison-evans commented May 30, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

haim-barad commented Jun 9, 2025

Uh oh!

ashahba left a comment

Choose a reason for hiding this comment

Uh oh!

haim-barad commented Jun 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants

codecov bot commented May 29, 2025 •

edited

Loading