truth-bot 🔍

Automated political rhetoric fact-checker. Feed it a transcript; get back a structured, scored, shareable fact-check report.

What It Does

Ingest — Accepts speech transcripts as text, files, or URLs. Normalizes format and extracts metadata (speaker, date, venue).
Extract — Uses an LLM (Anthropic Claude) to decompose rhetoric into atomic, verifiable claims.
Verify — Checks each claim against credible sources: government data APIs (BLS, FRED, Census, CBO), Brave Search, and existing fact-check databases (PolitiFact, FactCheck.org).
Score — Assigns each claim a verdict from the taxonomy below, with a confidence level and supporting evidence.
Publish — Generates an HTML report, Bluesky thread, RSS feed entry, and JSON API response.

Verdict Taxonomy

Verdict	Meaning
`True`	Accurate and supported by primary sources
`Mostly True`	Accurate but missing nuance or context
`Misleading`	Technically accurate framing that implies something false
`Exaggerated`	Directionally correct but overstated
`False`	Contradicted by credible evidence
`Unverifiable`	Insufficient evidence to confirm or deny

Confidence levels: High / Medium / Low

Architecture

Transcript → [Ingest] → [Extract Claims] → [Verify per Claim]
                                               ↓
                                    [Score + Rubric] → [Cache]
                                               ↓
                              [Publish: Web / Bluesky / RSS / API]

Source trust hierarchy (descending):

Government primary data (BLS, FRED, CBO, Census)
Wire services (AP, Reuters)
Established outlets (NYT, WaPo, BBC)
Academic / NGO
Other

Setup

Requirements

Python 3.11+
API keys (see below)

Install

git clone git@github.com:jackiemclean/truth-bot.git
cd truth-bot
python -m venv .venv
source .venv/bin/activate
pip install -e ".[dev]"

Environment Variables

Copy .env.example to .env and fill in your keys:

cp .env.example .env

Variable	Required	Description
`ANTHROPIC_API_KEY`	✅	Claude API key for claim extraction + verdict synthesis
`BRAVE_API_KEY`	✅	Brave Search API for web evidence gathering
`BLUESKY_HANDLE`	optional	Your Bluesky handle (e.g. `yourname.bsky.social`)
`BLUESKY_APP_PASSWORD`	optional	Bluesky app password (not your main password)
`FRED_API_KEY`	optional	FRED (Federal Reserve) economic data API

Run

# Check a transcript file
truthbot --transcript speech.txt --speaker "Speaker Name" --date 2025-01-20

# Or pipe text
echo "Unemployment is at a 50-year low." | truthbot --speaker "Politician"

# Output formats
truthbot --transcript speech.txt --output html --output-dir ./reports/
truthbot --transcript speech.txt --post-bluesky

Development

pytest -v                    # Run all tests
ruff check src/ tests/       # Lint
black src/ tests/            # Format

Project Structure

src/truthbot/
├── config.py          — Settings from environment variables
├── models.py          — Pydantic data models (Transcript, Claim, Evidence, Verdict, Report)
├── pipeline.py        — End-to-end orchestrator
├── ingest/            — Transcript ingestion and normalization
├── extract/           — LLM-powered claim extraction
├── verify/            — Evidence gathering and verdict synthesis
│   └── sources/       — Pluggable source connectors
├── scoring/           — Verdict rubric and confidence scoring
├── cache/             — Claim deduplication and caching
└── publish/           — Output: HTML, Bluesky, RSS, JSON API

Status

🚧 Alpha — Core architecture is in place; LLM integration and source connectors are stubbed and ready for implementation.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 102 Commits
.cursor/rules		.cursor/rules
Historical-SOTU-Transcripts		Historical-SOTU-Transcripts
data		data
design-prototypes		design-prototypes
docs		docs
eval		eval
metrics		metrics
scripts		scripts
site-test		site-test
site		site
social-media		social-media
src/truthbot		src/truthbot
tests		tests
.env.example		.env.example
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
PROJECT_BOARD.md		PROJECT_BOARD.md
README.md		README.md
STATUS.md		STATUS.md
TODO.md		TODO.md
pyproject.toml		pyproject.toml
regen_site.py		regen_site.py
truthy_score_ref.py		truthy_score_ref.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

truth-bot 🔍

What It Does

Verdict Taxonomy

Architecture

Setup

Requirements

Install

Environment Variables

Run

Development

Project Structure

Status

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

truth-bot 🔍

What It Does

Verdict Taxonomy

Architecture

Setup

Requirements

Install

Environment Variables

Run

Development

Project Structure

Status

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages