College Football Data warehouse using dlt pipelines and Supabase Postgres.
-
Copy
.dlt/secrets.toml.exampleto.dlt/secrets.tomland add your credentials:- CFBD API key from https://collegefootballdata.com/key
- Supabase Postgres connection string
-
Install dependencies:
pip install -e ".[dev]" -
Run the schemas:
psql $DATABASE_URL -f src/schemas/001_reference.sql # ... additional schema files
-
Load data:
python -m src.pipelines.run --source reference
# Load reference data (full refresh)
python -m src.pipelines.run --source reference
# Load current season games (incremental)
python -m src.pipelines.run --source games --mode incremental
# Backfill historical data
python -m src.pipelines.run --source games --mode backfill --years 2020 2021 2022
# Check pipeline status
python -m src.pipelines.run --status- Source: CFBD API (61 endpoints)
- ETL: dlt pipelines with year-based iteration
- Destination: Supabase Postgres (10 schemas, ~35 tables)
Using Tier 3 (75,000 calls/month). Full historical backfill requires ~1,100 calls.