Skip to content

[Roadmap] UCM Roadmap Q1 2026 #679

@yuanzhg078

Description

@yuanzhg078

This roadmap is a living guide to Unified Cache Manager (UCM)’s evolution. We’re shaping its Q1 2026 direction and welcome your input—your feedback will help us prioritize what matters most for the community and production use.

Primary goal for 2026 Q1:
Refactor the UCM Store and sparse attention architectures to enhance modularity and stability, and optimize performance, while broadening northbound and southbound compatibility with more inference engines and storage backends.

Core

  • Store
    • Refactor UCM Store Architecture
    • PipelineStore Framework Upgrade and Optimization
    • Add Layerwise Connector
    • KVCompress
    • Adapt Ds3fs Store
  • Sparse
    • Spare Attention Framework Upgrade(v2)
    • Spare Attention Offload Performance Optimization
    • GSA/ESA on device
  • Adapt SGLang
    • Support Prefix Cache
  • CacheBlend Framework Optimization
  • Model Window Extrapolation-Rerope

CI/CD

  • Workflow (build/unittest/install/e2e inference)
  • Correctness test (continuously updated)

Test

  • Unified LLM interface and simplified test configuration
  • Expanded accuracy validation (offline test + UC Eval)
  • Added pre-run environment verification

Others

  • Observability: Metrics Optimization
  • Add UCM Logger module

Documentation & community management

  • Add Contributing Guide
  • Add Code of Conduct

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions