add Intel XPU platform support for benchmark_v2 #42993

kaixuanliu · 2025-12-22T07:29:55Z

No description provided.

Signed-off-by: Liu, Kaixuan <[email protected]>

Rocketknight1 · 2025-12-22T15:07:33Z

cc @IlyasMoutawwakil maybe?

benchmark_v2/framework/hardware_metrics.py

Signed-off-by: Liu, Kaixuan <[email protected]>

kaixuanliu · 2025-12-24T01:14:32Z

@IlyasMoutawwakil Hi, pls help review when you are available, thx!

Signed-off-by: Liu, Kaixuan <[email protected]>

yao-matrix · 2026-01-05T22:52:21Z

benchmark_v2/run_benchmarks.py

+        type=str,
+        default="cuda",
+        help="Device to run benchmarks on (cuda, xpu, cpu). If not specified, will auto-detect.",
+    )


inconsistent, the docstring says if not specified, will auto detect, but the default is set to cuda rather than auto

yao-matrix · 2026-01-05T22:53:39Z

benchmark_v2/framework/hardware_metrics.py

        self.logger = logger if logger is not None else logging.getLogger(__name__)
+        self.device_type = device_type
+
+        # Detect available accelerators


you mean "detect the number of available accelerators"?

yao-matrix · 2026-01-05T22:55:27Z

benchmark_v2/framework/hardware_metrics.py

+            device_type = None
+            if hasattr(torch, "cuda") and torch.cuda.is_available():
+                device_type = "cuda"
+            elif hasattr(torch, "xpu") and torch.xpu.is_available():


use is_torch_xpu_available from transformers

yao-matrix · 2026-01-05T22:56:04Z

benchmark_v2/framework/hardware_metrics.py


-    def __init__(self) -> None:
-        self.device_name, self.device_memory_total = get_device_name_and_memory_total()
+    def __init__(self, device_type: str = "cuda") -> None:


default to None?

yao-matrix · 2026-01-05T22:57:13Z

benchmark_v2/framework/hardware_metrics.py

-            self.gpu_name, self.gpu_memory_total_gb = get_device_name_and_memory_total()
+            # Auto-detect device type
+            device_type = None
+            if hasattr(torch, "cuda") and torch.cuda.is_available():


suppose torch.cuda.is_available() is enough

yao-matrix · 2026-01-05T23:00:07Z

benchmark_v2/framework/hardware_metrics.py

+
+    Args:
+        device_type: The type of device to query ('cuda', 'xpu', etc.)
+    """


seems device_type is unnecessary, just detect the available device and use torch_accelerator_module like:
`device_type = torch.accelerator.current_accelerator().type if is_torch_accelerator_available() else "cuda"

torch_accelerator_module = getattr(torch, device_type, torch.cuda)

torch_accelerator_module .get_device_properties(0).name
...
`

yao-matrix · 2026-01-05T23:02:00Z

benchmark_v2/framework/benchmark_runner.py

-            f"Time generate done in {e2e_latency:.2f} seconds. Memory usage: {torch.cuda.memory_allocated() / 1024**2:.2f} MB"
-        )
+
+        # Get memory usage based on device type


just use device_type to get torch_accelerator_module?

yao-matrix · 2026-01-05T23:02:45Z

benchmark_v2/framework/benchmark_runner.py

        """Profile the latency of a call to model.generate() with the given (inputs) and (max_new_tokens)."""
+        # Build activities list based on available devices
+        activities = [torch.profiler.ProfilerActivity.CPU]
+        if hasattr(torch, "xpu") and torch.xpu.is_available():


use is_torch_xpu_available from transformers

kaixuanliu · 2026-01-06T02:08:07Z

Close this PR first.

add Intel XPU platform support for benchmark_v2

9498867

Signed-off-by: Liu, Kaixuan <[email protected]>

yao-matrix suggested changes Dec 22, 2025

View reviewed changes

benchmark_v2/framework/hardware_metrics.py Show resolved Hide resolved

yao-matrix reviewed Dec 22, 2025

View reviewed changes

benchmark_v2/framework/hardware_metrics.py Outdated Show resolved Hide resolved

kaixuanliu marked this pull request as draft December 23, 2025 01:23

kaixuanliu added 3 commits December 23, 2025 06:41

use xpu-smi to get intel xpu status

67d3f7c

Signed-off-by: Liu, Kaixuan <[email protected]>

add more XPU support

dbaca4c

Signed-off-by: Liu, Kaixuan <[email protected]>

skip continuous batching for XPU

742114f

Signed-off-by: Liu, Kaixuan <[email protected]>

kaixuanliu marked this pull request as ready for review December 23, 2025 09:16

remove sudo

bdbfb7f

Signed-off-by: Liu, Kaixuan <[email protected]>

kaixuanliu requested a review from yao-matrix December 24, 2025 01:15

disable cuda graphs for non-cuda devices

a6f059c

Signed-off-by: Liu, Kaixuan <[email protected]>

yao-matrix approved these changes Jan 5, 2026

View reviewed changes

yao-matrix suggested changes Jan 5, 2026

View reviewed changes

kaixuanliu closed this Jan 6, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add Intel XPU platform support for benchmark_v2 #42993

add Intel XPU platform support for benchmark_v2 #42993

Uh oh!

kaixuanliu commented Dec 22, 2025

Uh oh!

Rocketknight1 commented Dec 22, 2025

Uh oh!

Uh oh!

Uh oh!

kaixuanliu commented Dec 24, 2025

Uh oh!

yao-matrix Jan 5, 2026

Uh oh!

yao-matrix Jan 5, 2026

Uh oh!

yao-matrix Jan 5, 2026

Uh oh!

yao-matrix Jan 5, 2026

Uh oh!

yao-matrix Jan 5, 2026

Uh oh!

yao-matrix Jan 5, 2026

Uh oh!

yao-matrix Jan 5, 2026

Uh oh!

yao-matrix Jan 5, 2026

Uh oh!

kaixuanliu commented Jan 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

add Intel XPU platform support for benchmark_v2 #42993

add Intel XPU platform support for benchmark_v2 #42993

Uh oh!

Conversation

kaixuanliu commented Dec 22, 2025

Uh oh!

Rocketknight1 commented Dec 22, 2025

Uh oh!

Uh oh!

Uh oh!

kaixuanliu commented Dec 24, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kaixuanliu commented Jan 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants