Add initial KV Cache benchmark implementation for MLPerf Storage v3 #214

hazemawadalla · 2025-11-21T19:50:58Z

This commit introduces a comprehensive KV Cache benchmark suite designed to measure storage system performance under AI/ML inference workloads, specifically targeting Large Language Model (LLM) key-value cache operations.

Key components added:

Core benchmark scripts (kv-cache.py, kv-cache_sharegpt_replay.py)
Benchmark wrapper and validation tools (kv-cache-wrapper.sh, validate.sh)
Comprehensive proposal documentation for MLPerf Storage v3 integration
README with benchmark overview and usage guidelines

The benchmark simulates realistic LLM inference patterns including:

Key-value cache read/write operations
Mixed sequential and random access patterns
Multi-threaded concurrent access scenarios
Conversation-based workload replay using ShareGPT dataset

This work addresses the growing need to standardize storage performance measurements for AI inference workloads and provides a foundation for MLPerf Storage v3.0 KV cache benchmark specification.

This commit introduces a comprehensive KV Cache benchmark suite designed to measure storage system performance under AI/ML inference workloads, specifically targeting Large Language Model (LLM) key-value cache operations. Key components added: - Core benchmark scripts (kv-cache.py, kv-cache_sharegpt_replay.py) - Benchmark wrapper and validation tools (kv-cache-wrapper.sh, validate.sh) - Comprehensive proposal documentation for MLPerf Storage v3 integration - README with benchmark overview and usage guidelines The benchmark simulates realistic LLM inference patterns including: - Key-value cache read/write operations - Mixed sequential and random access patterns - Multi-threaded concurrent access scenarios - Conversation-based workload replay using ShareGPT dataset This work addresses the growing need to standardize storage performance measurements for AI inference workloads and provides a foundation for MLPerf Storage v3.0 KV cache benchmark specification.

github-actions · 2025-11-21T19:51:09Z

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

FileSystemGuy

Initial version being imported.

wvaske

We will turn comments into issues once this is merged.

wvaske · 2025-11-24T22:10:39Z

kv_cache_benchmark/kv-cache.py

+        total_prob = sum(chunk_probabilities)
+        chunk_probabilities = [p / total_prob for p in chunk_probabilities]
+
+        retrieved_indices = np.random.choice(


TODO: Add support for different random distributions (random, uniform, zipfian)

wvaske · 2025-11-24T22:21:35Z

kv_cache_benchmark/kv-cache.py

+
+        # --- Tiering Logic ---
+        # Decide which tier to write to based on available memory.
+        with self.memory_lock:


New KVs should be written to the top layer and trigger eviction from a tier if sufficient space doesn't exist.

hazemawadalla requested a review from a team November 21, 2025 19:50

hazemawadalla requested a review from a team as a code owner November 21, 2025 19:50

hazemawadalla changed the base branch from main to TF_KVCache November 21, 2025 22:52

FileSystemGuy reviewed Nov 25, 2025

View reviewed changes

FileSystemGuy approved these changes Nov 25, 2025

View reviewed changes

wvaske approved these changes Nov 25, 2025

View reviewed changes

FileSystemGuy merged commit 39246aa into mlcommons:TF_KVCache Nov 25, 2025
1 check passed

github-actions bot locked and limited conversation to collaborators Nov 25, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add initial KV Cache benchmark implementation for MLPerf Storage v3 #214

Add initial KV Cache benchmark implementation for MLPerf Storage v3 #214

Uh oh!

hazemawadalla commented Nov 21, 2025

Uh oh!

github-actions bot commented Nov 21, 2025

Uh oh!

FileSystemGuy left a comment

Uh oh!

wvaske left a comment

Uh oh!

wvaske Nov 24, 2025

Uh oh!

wvaske Nov 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add initial KV Cache benchmark implementation for MLPerf Storage v3 #214

Add initial KV Cache benchmark implementation for MLPerf Storage v3 #214

Uh oh!

Conversation

hazemawadalla commented Nov 21, 2025

Uh oh!

github-actions bot commented Nov 21, 2025

Uh oh!

FileSystemGuy left a comment

Choose a reason for hiding this comment

Uh oh!

wvaske left a comment

Choose a reason for hiding this comment

Uh oh!

wvaske Nov 24, 2025

Choose a reason for hiding this comment

Uh oh!

wvaske Nov 24, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants