Generate TransitMatters travel-time benchmarks for rapid transit by devinmatte · Pull Request #85 · transitmatters/mbta-performance

devinmatte · 2026-04-17T20:31:58Z

Summary

Adds chalicelib/benchmarks/tm_benchmarks.py — computes TM-defined travel-time benchmarks per (from, to) stop pair for all rapid-transit lines, writing to s3://tm-mbta-performance/Benchmarks-tm/traveltimes/{Color}.json.
Consumes the existing SlowZones archive (per-day p50 travel times + dwells, back to 2016) as the source of truth — no new LAMP reads, no new raw-event processing.
Monthly Chalice cron (historical p50 barely moves week-to-week) with a new IAM policy scoped to SlowZones/* read + Benchmarks-tm/* write.

How it works

For each rapid-transit line:

Load every adjacent stop-pair traveltime CSV and dwell CSV.
Build a directed graph from adjacency filenames (handles branch splits like JFK → Ashmont/Braintree and the Green Line's shared trunk naturally).
DFS forward from each stop, accumulating a per-day Series of move(current→next) + dwell(current). Origin's dwell is skipped (the dashboard measures travel time from origin departure to destination arrival, which excludes origin dwell but includes every intermediate dwell).
At each reachable destination, take the median of the aligned per-day cumulative sums and ceil to 30s.
Pairs with fewer than 365 aligned service-days fall back to the MBTA benchmark on the dashboard side (no entry written).
When multiple paths reach the same stop (Green Line trunk), keep the minimum benchmark.

Running locally

cd mbta-performance
uv run python -m chalicelib.benchmarks.tm_benchmarks

Reads the SlowZones archive and writes the output JSONs directly. Cheaper than invoking the Lambda; intended for ad-hoc refreshes.

Cost impact

Negligible. Monthly lambda run over ~175 small CSVs per line, ~5–10 minutes total. One new IAM role, one new cron, five small JSON objects written to the existing bucket.

Test plan

Run locally and eyeball Red / Orange / Blue / Mattapan / Green output JSONs for sanity (Davis→Porter ≈ 2m, Davis→Kendall ≈ 9m, etc.)
Verify against a known slow-zone pair — TM benchmark should sit below the current MBTA scheduled time.
Deploy to beta, confirm cron fires on 1st of next month.

Adds a new chalicelib/benchmarks module that builds TM-defined travel-time benchmarks from the existing SlowZones archive (per-day p50 travel times and dwells, back to 2016-01-15). A directed graph built from adjacent stop-pair filenames is walked via DFS from each stop, summing per-day p50 move + dwell series along each path. Intermediate dwells are included; origin dwell is not (matches how t-performance-dash measures travel time: departure at origin to arrival at destination). The TM benchmark per pair is the median of the aligned per-day sums, ceil'd to 30s. Pairs with fewer than 365 aligned service-days are skipped. Output is one small JSON per rapid-transit line at s3://tm-mbta-performance/Benchmarks-tm/traveltimes/{Color}.json. Scheduled monthly (historical p50 barely moves week-to-week). Can also be run locally with AWS creds: `uv run python -m chalicelib.benchmarks.tm_benchmarks`.

Local run across all 5 rapid-transit lines finishes in ~1m 45s with modest memory. 900s / 4096MB was overprovisioned.

devinmatte mentioned this pull request Apr 17, 2026

Use TransitMatters benchmark as replacement for MBTA on travel-time chart transitmatters/t-performance-dash#1157

Open

6 tasks

devinmatte commented Apr 19, 2026

View reviewed changes

Comment thread mbta-performance/.chalice/config.json Outdated

Reduce TM benchmark lambda resources (300s / 1024MB)

b3c5350

Local run across all 5 rapid-transit lines finishes in ~1m 45s with modest memory. 900s / 4096MB was overprovisioned.

devinmatte marked this pull request as ready for review April 19, 2026 22:23

devinmatte requested review from a team, ankoure and hamima-halim as code owners April 19, 2026 22:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generate TransitMatters travel-time benchmarks for rapid transit#85

Generate TransitMatters travel-time benchmarks for rapid transit#85
devinmatte wants to merge 2 commits intomainfrom
tm-travel-time-benchmarks

devinmatte commented Apr 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

devinmatte commented Apr 17, 2026

Summary

How it works

Running locally

Cost impact

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant