-
-
Notifications
You must be signed in to change notification settings - Fork 35
Open
Labels
HARDOSCG26enhancementNew feature or requestNew feature or requestinfrastructureInfrastructure and scalabilityInfrastructure and scalability
Description
Description
No request queue, load balancer, or worker pool. Each FastAPI request runs the full pipeline in-process. Under concurrent load, model API calls exhaust connections, rate limits stack up, and memory grows with in-memory caches. mq_agent.py worker exists but is not integrated into the web backend.
Component
System Architecture, ai_council/worker/mq_agent.py
Acceptance Criteria
Integrate load balancing and the existing message queue worker for handling concurrent async queries. Distribute model API calls across worker pools.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
HARDOSCG26enhancementNew feature or requestNew feature or requestinfrastructureInfrastructure and scalabilityInfrastructure and scalability