-
Notifications
You must be signed in to change notification settings - Fork 71
[Roadmap] UCM Roadmap Q1 2026 #679
Copy link
Copy link
Open
Description
This roadmap is a living guide to Unified Cache Manager (UCM)’s evolution. We’re shaping its Q1 2026 direction and welcome your input—your feedback will help us prioritize what matters most for the community and production use.
Primary goal for 2026 Q1:
Refactor the UCM Store and sparse attention architectures to enhance modularity and stability, and optimize performance, while broadening northbound and southbound compatibility with more inference engines and storage backends.
Core
- Store
- Refactor UCM Store Architecture
- PipelineStore Framework Upgrade and Optimization
- Add Layerwise Connector
- Adapt Deepseek V4
- KVCompress
- Adapt Ds3fs Store
- Garbage Collection in Posix Store
- Sparse
- Spare Attention Framework Upgrade(v2)
- Spare Attention Offload Performance Optimization
- GSA on device
- Adapt SGLang
- Support Prefix Cache
- Adapt MindIE
- Support Prefix Cache
- CacheBlend Framework Optimization
- Model Window Extrapolation-Rerope
CI/CD
- Workflow (build/unittest/install/e2e inference)
- Correctness test (continuously updated)
Test
- Unified LLM interface and simplified test configuration
- Expanded accuracy validation (offline test + UC Eval)
- Added pre-run environment verification
Others
- Observability: Metrics Optimization
- Add UCM Logger module
Documentation & community management
- Add Contributing Guide
- Add Code of Conduct
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels