Introduce automatic LLM routing #3454
Labels
area: backend
Related to backend functionality or under the /backend directory
enhancement
New feature or request
rag: retrieval
Currently, we manually configure LLMs, and associate the same LLM to the different nodes of a workflow.
Implementing automatic routing, for instance using https://github.com/lm-sys/RouteLLM, will allow us to cut costs and latency, while maintaining performances largely unaltered.
See also:
The text was updated successfully, but these errors were encountered: