-
Notifications
You must be signed in to change notification settings - Fork 60
Open
Description
This roadmap is a living guide to Unified Cache Manager (UCM)’s evolution. We’re shaping its Q1 2026 direction and welcome your input—your feedback will help us prioritize what matters most for the community and production use.
Primary goal for 2026 Q1:
Refactor the UCM Store and sparse attention architectures to enhance modularity and stability, and optimize performance, while broadening northbound and southbound compatibility with more inference engines and storage backends.
Core
- Store
- Refactor UCM Store Architecture
- PipelineStore Framework Upgrade and Optimization
- Add Layerwise Connector
- KVCompress
- Adapt Ds3fs Store
- Sparse
- Spare Attention Framework Upgrade(v2)
- Spare Attention Offload Performance Optimization
- GSA/ESA on device
- Adapt SGLang
- Support Prefix Cache
- CacheBlend Framework Optimization
- Model Window Extrapolation-Rerope
CI/CD
- Workflow (build/unittest/install/e2e inference)
- Correctness test (continuously updated)
Test
- Unified LLM interface and simplified test configuration
- Expanded accuracy validation (offline test + UC Eval)
- Added pre-run environment verification
Others
- Observability: Metrics Optimization
- Add UCM Logger module
Documentation & community management
- Add Contributing Guide
- Add Code of Conduct
Metadata
Metadata
Assignees
Labels
No labels