pick-101

RL training and real-robot deployment for the SO-101 arm — picks up a Jenga block using a projection-based policy trained entirely in simulation.

▶ Watch demo video

Installation

git clone [email protected]:ggand0/pick-101.git
cd pick-101
git submodule update --init --recursive
uv sync

Assets (STL meshes) are stored via Git LFS and pulled automatically. Alternatively, copy from SO-ARM100:

cp -r SO-ARM100/Simulation/SO101/assets models/so101/

Running on the Real Robot

Prerequisites

SO-101 arm connected via USB
Printed calibration board (make_aruco_board.py) — flat on the table
Printed Jenga block tag (make_jenga_tag.py) — ID 101, attached to top face of Jenga block
Overhead camera pointing down at the board
Robot base tip positioned at the bottom-centre of the board

Step 1 — Print the board and block tag

# Calibration board (place flat on table, robot base at bottom-centre)
uv run python make_aruco_board.py        # → aruco_board.pdf

# Jenga block tag — ID 101, sized to fit the 25×75mm block face (use this one)
uv run python make_jenga_tag.py          # → jenga_tag.pdf

# (Optional) Full sheet of general-purpose bordered tags IDs 100–150
uv run python make_bordered_tags.py      # → bordered_tags.pdf

Print jenga_tag.pdf at 100% scale (no fit-to-page scaling) and attach to the top face of the Jenga block.

Step 2 — Restore the trained model (after fresh clone)

The best trained model is stored in best_run/ and committed to git. Restore it into runs/ so the scripts can find it:

uv run python export_best_run.py restore

Step 3 — Calibrate joint mapping (first time or after hardware change)

Maps real robot joint readings to simulation joint angles. Produces calibration/joint_calibration.json.

# Full calibration (all 6 joints)
uv run python calibrate_joints_real.py --port /dev/tty.usbmodem5A680089441

# Single joint only (e.g. after re-mounting the wrist)
uv run python calibrate_joints_real.py --port /dev/tty.usbmodem5A680089441 --joint wrist_roll

Controls during calibration:

Move the arm by hand to match the reference image shown
SPACE — record this position
S — skip this point
Q — quit

Step 4 — Calibrate block position detection

Corrects for camera angle so the detected block position matches its true location on the board.

uv run python tests/test_aruco_homography_3d.py

Press C to start calibration
Place the Jenga block (tag ID 101) at each of the 4 interior corners in order: TL → TR → BL → BR
- The block's physical corner should touch the interior corner each time
- Keep block orientation consistent across all 4 corners
Press SPACE at each position to record it
After 4 corners, the script prints DELTA_X / DELTA_Y values

Paste the printed values into the DELTA_X / DELTA_Y constants at the top of run_real_t1.py and visualize_irl_block.py.

Fine-tuning manually:

If the corrected position (green dot in the rectified panel) is still slightly off, adjust DELTA_X / DELTA_Y (mm) directly:

Green dot position	Adjustment
Too far right	decrease `DELTA_X`
Too far left	increase `DELTA_X`
Too far down	decrease `DELTA_Y`
Too far up	increase `DELTA_Y`

Copy the final values into run_real_t1.py and visualize_irl_block.py.

Step 5 — Test block detection (no robot required)

uv run python visualize_irl_block.py --camera 0

Place the Jenga block on the board and verify the 4 panels:

Panel	Shows
1 — Camera	Raw camera feed with detected tags
2 — Sim side	Simulation side view with block at detected position
3 — Top-down (homography)	Rectified board view; green dot = corrected block position
4 — Sim top-down	Simulation top-down; block should match panel 3

Step 6 — Dry run (no robot)

uv run python run_real_t1.py --no-robot --camera 0

Verify all 5 panels look correct before connecting the robot.

Step 7 — Run the policy on the real robot

uv run python run_real_t1.py --port /dev/tty.usbmodem5A680089441

Video is saved to real_t1_run.mp4.

Exporting / Sharing the Trained Model

best_run/ is committed to git. runs/ is in .gitignore.

# Export best model from runs/ into best_run/ (then commit)
uv run python export_best_run.py export

# Restore best_run/ back into runs/ (after fresh clone)
uv run python export_best_run.py restore

Training

Projection-based policy (T1 — what runs on the real robot)

uv run python train_lift_projection.py --config configs/state_based/curriculum_stage3.yaml

Evaluate a checkpoint

uv run python eval_projection.py --run runs/lift_proj_t1_s3/<timestamp>

State-based curriculum (SAC)

uv run python train_lift.py --config configs/state_based/curriculum_stage3.yaml
uv run python eval_cartesian.py --run runs/lift_curriculum_s3/<timestamp>

Plot learning curves

uv run python plot_learning_curves.py --run runs/lift_proj_t1_s3/<timestamp>

Project Structure

models/so101/                   # MuJoCo robot models
├── lift_cube.xml               # Main scene
├── so101_new_calib.xml         # Robot with finger pads
└── assets/                     # STL meshes (Git LFS)

src/
├── envs/
│   ├── lift_cube.py            # Cartesian gym environment
│   └── lift_cube_projection.py # Projection-obs environment (used by T1 policy)
├── robot/
│   ├── real_robot.py           # SO-101 hardware interface
│   └── joint_calibration.py   # Real→sim joint angle mapping
└── training/
    └── train_image_rl.py       # DrQ-v2 image-based training

configs/state_based/            # Training configs

calibration/                    # Generated after running Step 3
└── joint_calibration.json

best_run/                       # Committed trained model
├── best_model/best_model.zip
├── vec_normalize.pkl
└── config.yaml

# Key scripts (root)
run_real_t1.py                  # Real-robot runner (Steps 6–7)
visualize_irl_block.py          # Block detection test (Step 5)
tests/test_aruco_homography_3d.py  # Homography viewer + position calibration (Step 4)
calibrate_joints_real.py        # Joint mapping calibration (Step 3)
export_best_run.py              # Export / restore trained model (Step 2)
make_aruco_board.py             # Generate printable calibration board (Step 1)
make_jenga_tag.py               # Generate printable Jenga block tag ID 101 (Step 1)
make_bordered_tags.py           # Generate full sheet of general-purpose bordered tags
train_lift_projection.py        # Train T1 policy
eval_projection.py              # Evaluate T1 policy

Calibration Board Layout

The board is a 180×180mm square with ArUco border tags. The interior working area is 112×112mm. The robot base tip sits at the bottom-centre of the board.

┌─────────────────────────┐  ← 180mm
│   [border ArUco tags]   │
│  ┌───────────────────┐  │
│  │                   │  │
│  │   interior area   │  │
│  │   112 × 112 mm    │  │
│  │                   │  │
│  └───────────────────┘  │
│      ▲ robot base ▲     │
└─────────────────────────┘

Name		Name	Last commit message	Last commit date
Latest commit History 121 Commits
calibration		calibration
checks		checks
configs		configs
docs		docs
examples		examples
external		external
lerobot-tagged		lerobot-tagged
models/so101		models/so101
projection_test_visuals		projection_test_visuals
scripts		scripts
src		src
tests		tests
wrist_tests		wrist_tests
.DS_Store		.DS_Store
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
bordered_aruco.py		bordered_aruco.py
calibrate_3d.py		calibrate_3d.py
calibrate_camera_intrinsics.py		calibrate_camera_intrinsics.py
calibrate_gripper_gap.py		calibrate_gripper_gap.py
calibrate_joint_mapping.py		calibrate_joint_mapping.py
calibrate_joints_real.py		calibrate_joints_real.py
calibrate_wrist_cam.py		calibrate_wrist_cam.py
calibration_viewer.py		calibration_viewer.py
collect_straddle_states.py		collect_straddle_states.py
eval.py		eval.py
eval_cartesian.py		eval_cartesian.py
eval_cross.py		eval_cross.py
eval_projection.py		eval_projection.py
eval_vision.py		eval_vision.py
export_best_run.py		export_best_run.py
fit_calibration.py		fit_calibration.py
fit_linear_layers.py		fit_linear_layers.py
make_aruco_board.py		make_aruco_board.py
make_aruco_tags.py		make_aruco_tags.py
make_bordered_tags.py		make_bordered_tags.py
make_cube_template.py		make_cube_template.py
make_jenga_tag.py		make_jenga_tag.py
measure_gripper_geometry.py		measure_gripper_geometry.py
plot_calibration.py		plot_calibration.py
plot_calibration_efficiency.py		plot_calibration_efficiency.py
plot_calibration_xy.py		plot_calibration_xy.py
plot_corrected_vs_true.py		plot_corrected_vs_true.py
plot_correction_errors.py		plot_correction_errors.py
plot_jenga_calibration.py		plot_jenga_calibration.py
plot_learning_curves.py		plot_learning_curves.py
plot_x_calibration.py		plot_x_calibration.py
pyproject.toml		pyproject.toml
real_t1_run.mp4		real_t1_run.mp4
rebuild_calibration.py		rebuild_calibration.py
run_real_t1.py		run_real_t1.py
train.py		train.py
train_her.py		train_her.py
train_lift.py		train_lift.py
train_lift_cartesian.py		train_lift_cartesian.py
train_lift_projection.py		train_lift_projection.py
train_lift_vision.py		train_lift_vision.py
uv.lock		uv.lock
visualize_irl_block.py		visualize_irl_block.py
visualize_sites.py		visualize_sites.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

pick-101

Installation

Running on the Real Robot

Prerequisites

Step 1 — Print the board and block tag

Step 2 — Restore the trained model (after fresh clone)

Step 3 — Calibrate joint mapping (first time or after hardware change)

Step 4 — Calibrate block position detection

Step 5 — Test block detection (no robot required)

Step 6 — Dry run (no robot)

Step 7 — Run the policy on the real robot

Exporting / Sharing the Trained Model

Training

Projection-based policy (T1 — what runs on the real robot)

Evaluate a checkpoint

State-based curriculum (SAC)

Plot learning curves

Project Structure

Calibration Board Layout

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

pick-101

Installation

Running on the Real Robot

Prerequisites

Step 1 — Print the board and block tag

Step 2 — Restore the trained model (after fresh clone)

Step 3 — Calibrate joint mapping (first time or after hardware change)

Step 4 — Calibrate block position detection

Step 5 — Test block detection (no robot required)

Step 6 — Dry run (no robot)

Step 7 — Run the policy on the real robot

Exporting / Sharing the Trained Model

Training

Projection-based policy (T1 — what runs on the real robot)

Evaluate a checkpoint

State-based curriculum (SAC)

Plot learning curves

Project Structure

Calibration Board Layout

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages