feat(benchmarks): add LIBERO training benchmark pipeline by pkooij · Pull Request #3337 · huggingface/lerobot

pkooij · 2026-04-09T13:33:56Z

Summary

Add a complete benchmark pipeline to train and evaluate all 9 LeRobot policies (pi0, pi0_fast, pi05, groot, act, diffusion, smolvla, xvla, multi_task_dit) on LIBERO
Generates SLURM sbatch scripts for cluster submission (train + eval + GPU monitoring per policy)
Publishes results as a HuggingFace leaderboard dataset with full training metadata (steps, BS, LR, GPU memory, training time, eval success rates, etc.)
Handles pi0_fast tokenizer pre-training as a dependency job

Test plan

Dry-run generation of all 9 policy scripts verified locally
Run single-policy (ACT) on cluster to validate end-to-end flow
Run collect_results.py on completed outputs
Verify HuggingFace dataset push with correct schema

🤖 Generated with Claude Code

Single-script benchmark that trains and evaluates all 9 LeRobot policies on LIBERO. Each SLURM job self-publishes its result row to a HuggingFace leaderboard dataset — no separate collection step needed. Policies: pi0, pi0_fast, pi05, groot, act, diffusion, smolvla, xvla, multi_task_dit. 5000 steps, BS 256, with per-policy GPU allocation and default LR/scheduler presets. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

github-actions bot added documentation Improvements or fixes to the project’s docs CI Issues related to the continuous integration pipeline github_actions labels Apr 9, 2026

pkooij force-pushed the feat/libero-benchmark branch from 3c74bf6 to fd00e38 Compare April 9, 2026 15:01

github-actions bot removed CI Issues related to the continuous integration pipeline github_actions labels Apr 9, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(benchmarks): add LIBERO training benchmark pipeline#3337

feat(benchmarks): add LIBERO training benchmark pipeline#3337
pkooij wants to merge 1 commit intomainfrom
feat/libero-benchmark

pkooij commented Apr 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

pkooij commented Apr 9, 2026

Summary

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant