An open source agentic build system. Detailed plans go in. A planner sizes the work and shapes the pipeline - small intent runs through a fast path, large intent compiles into a dependency graph of sub-plans that build in parallel across isolated worktrees and merge in topological order. Implementation, blind review, and validation are always separated across agents and stages. The build phase runs in the background while you plan the next thing.
Drive eforge from Claude Code or Pi. Pipeline stages delegate to either the Claude Agent SDK or pi-agent-core - the interface you drive and the harness that executes are independent. A Claude Max subscription covers the SDK path; an OpenAI Codex subscription covers the Pi path, which also reaches 20+ other providers including local models.
The name: E from the Expedition-Excursion-Errand methodology + forge - shaping code from plans.
Status: This is a young project moving fast. Used daily to build real features (including itself), but expect rough edges - bugs are likely, change is expected, and YMMV. Source is public so you can read, learn from, and fork it. Not accepting issues or PRs at this time.
Traditional build systems transform source code into artifacts. An agentic build system transforms specifications into source code - then verifies its own output.
The key insight: a single AI agent writing and reviewing its own code will almost always approve it. Quality requires separation of concerns - distinct agents for planning, building, reviewing, and evaluating.
An agentic build system applies build-system thinking to this multi-agent pipeline:
- Spec-driven - Input is a requirement, not a code edit. The system decides how to implement it.
- Multi-stage pipeline - Planning, implementation, review, and validation are separate stages with separate agents, not one conversation.
- Blind review - The reviewer operates without builder context (see below).
- Dependency-aware orchestration - Large work decomposes into modules with a dependency graph. Plans build in parallel across isolated git worktrees, merging in topological order.
- Adaptive complexity - The system assesses scope and selects the right workflow: a one-file fix doesn't need architecture review, and a cross-cutting refactor shouldn't skip it.
Plan a feature interactively, then hand it off to eforge with /eforge:build. A daemon picks up the plan and runs planning, building, blind review, and validation autonomously. A web monitor (port assigned deterministically per project in the 4567-4667 range) tracks progress, cost, and token usage in real time.
Because the coding agent you drive from and the agent library eforge delegates to are independent, a few ways this plays out:
- Plan and execute on Claude. Drive eforge from Claude Code and delegate to the Claude Agent SDK. Your Claude Max subscription covers both sides.
- Plan and execute on OpenAI. Drive eforge from Pi and delegate to pi-agent-core with GPT. Your OpenAI Codex subscription covers both sides.
- Plan with GPT in Pi, build on Claude with Opus and Sonnet. Drive eforge from Pi for planning, then hand off to the Claude Agent SDK so the build runs against your Max subscription.
- Run builds on local models when subscription limits hit. Switch to a profile that delegates to a local model like Qwen 3.6 27B via pi-agent-core - work keeps moving while quotas reset, with no API spend.
eforge also runs standalone. By default, eforge build enqueues and a daemon processes it. Use --foreground to run in the current process instead.
Formatting and enqueue - eforge accepts input from multiple sources: a CLI prompt, rough notes, a session plan, a playbook, or a detailed PRD file. Playbooks and session plans are reusable input artifacts that the daemon compiles to ordinary build source via @eforge-build/input before reaching the engine queue. The engine always receives normalized build source and does not know whether that source originated from a playbook, session plan, wrapper app, CLI prompt, or PRD file. The normalized PRD is committed to a queue directory on the current branch; the daemon watches this queue and picks up new PRDs to build.
Workflow profiles - The planner assesses complexity and selects a profile:
- Errand - Small, self-contained changes. Passthrough compile, fast build.
- Excursion - Multi-file features. Planner writes a plan, blind review cycle, then build.
- Expedition - Large cross-cutting work. Architecture doc, module decomposition, cohesion review across plans, parallel builds in dependency order.
Blind review - Every build gets reviewed by a separate agent with no builder context. Separating generation from evaluation dramatically improves quality - solo agents tend to approve their own work regardless. A fixer applies suggestions, then an evaluator accepts strict improvements while rejecting intent changes. The goal is fidelity to the plan - minimizing drift and slop so the code that lands is what was specified, not a reinterpretation.
Parallel orchestration - Each plan builds in an isolated git worktree. Expeditions run multiple plans in parallel, merging in topological dependency order. Post-merge validation runs with auto-fix.
Queue and merge - Completed builds merge back to the base branch as merge commits via --no-ff, preserving the full branch history while keeping first-parent history clean. When the next build starts from the queue, the planner re-evaluates against the current codebase - so plans adapt to changes that landed since they were enqueued.
For a deeper look at the engine internals, see the architecture docs. For context on the workflow shift that motivated eforge, see The Handoff.
Prerequisites: Node.js 22+, Claude Code or Pi, and an LLM provider credential - Anthropic API key or Claude subscription for the claude-sdk harness, or a provider-specific API key or OAuth token for the pi harness
Claude Code plugin:
/plugin marketplace add eforge-build/eforge
/plugin install eforge@eforge
/eforge:init
Pi package:
pi install npm:@eforge-build/pi-eforge
/eforge:initAdd -l to pi install if you want to write to project settings (.pi/settings.json) instead of your global Pi settings:
pi install -l npm:@eforge-build/pi-eforgeThe main @eforge-build/eforge npm package is the standalone CLI and daemon runtime. The Pi integration is published separately as @eforge-build/pi-eforge.
The /eforge:init command creates eforge/config.yaml with sensible defaults and adds .eforge/ to your .gitignore. If you already have user-scope profiles in ~/.config/eforge/profiles/, it offers to activate one of those instead of creating a new project profile. Otherwise it walks you through a Quick setup (one harness and model for every tier) or a Mix-and-match flow (different harness, provider, or model per tier). In Claude Code you choose between claude-sdk and pi; in Pi the harness is pinned to pi and you pick from available providers and models. For further customization, run /eforge:config --edit.
The Pi package also provides native interactive commands for agent runtime profile management (/eforge:profile, /eforge:profile-new) and config viewing (/eforge:config) with interactive overlay UX. Both the Claude Code plugin and the Pi extension expose /eforge:plan for structured planning conversations - exploring scope, code impact, architecture, design decisions, documentation, and risks - before handing off to /eforge:build. Both surfaces also expose /eforge:playbook for creating, editing, running, and managing reusable automation playbooks that encode recurring workflows as named, version-controlled templates.
Standalone CLI:
npx @eforge-build/eforge build "Add rate limiting to the API"
npx @eforge-build/eforge build plans/my-feature-prd.md
# Run a saved playbook
npx @eforge-build/eforge play docs-sync
# Manage playbooks
npx @eforge-build/eforge playbook list
npx @eforge-build/eforge playbook run docs-sync --after q-abc
npx @eforge-build/eforge playbook promote tech-debt-sweepOr install globally: npm install -g @eforge-build/eforge
For standalone use, run /eforge:init (in Claude Code or Pi) to create both eforge/config.yaml and an active agent runtime profile under eforge/profiles/<name>.yaml. A profile configures one harness, model, and effort level per build tier (planning → implementation → review → evaluation). A minimal Claude Agent SDK profile looks like:
# eforge/profiles/claude-sdk.yaml
agents:
tiers:
planning:
harness: claude-sdk
model:
id: claude-opus-4-7
effort: high
implementation:
harness: claude-sdk
model:
id: claude-sonnet-4-6
effort: medium
review:
harness: claude-sdk
model:
id: claude-opus-4-7
effort: high
evaluation:
harness: claude-sdk
model:
id: claude-opus-4-7
effort: highFor Pi (OpenAI Codex, OpenRouter, local models, and more), set harness: pi on each tier and add a pi: { provider: <provider> } block. See docs/config.md for the full schema.
eforge supports three config tiers, merged lowest to highest priority:
| Tier | Path | Committed? | Purpose |
|---|---|---|---|
| User | ~/.config/eforge/config.yaml |
No | Cross-project, personal |
| Project | eforge/config.yaml |
Yes | Team-canonical |
| Project-local | .eforge/config.yaml |
No (gitignored) | Dev-personal override, highest priority |
The project-local tier (.eforge/) is automatically gitignored by /eforge:init. Use it for personal tuning that should not be committed - it deep-merges over the project and user tiers. Agent runtime profiles, custom workflow profiles, hooks, MCP servers, and plugins are all configurable. Scope precedence and lookup behavior live in @eforge-build/scopes; reusable input artifact protocols (playbooks, session plans) live in @eforge-build/input. The build engine consumes normalized build source and does not know whether the source originated from a playbook, session plan, wrapper app, CLI prompt, or PRD file. Agent runtime profiles follow the same three-tier pattern: eforge/profiles/ (project scope), ~/.config/eforge/profiles/ (user scope), and .eforge/profiles/ (project-local scope, highest precedence). Playbooks follow the same pattern using eforge/playbooks/, ~/.config/eforge/playbooks/, and .eforge/playbooks/ respectively - higher-precedence tiers shadow lower ones by name. Use eforge playbook list to see available playbooks with their source and shadow chain, eforge playbook run <name> (or the shortcut eforge play <name>) to enqueue one, and eforge playbook edit <name> to modify it in $EDITOR. eforge playbook promote moves a playbook from project-local to project scope (staged for commit); eforge playbook demote moves it back. See docs/config.md and docs/hooks.md.
pnpm build # Bundle all workspace packages
pnpm test # Run unit tests (vitest)
pnpm type-check # Type check without emittingThe eforge plugin uses npx -y @eforge-build/eforge to invoke the CLI. This ensures the plugin works for all users regardless of install method - global install, npx, or local development. The -y flag auto-confirms install prompts, which is required because the MCP server runs headless and cannot prompt interactively.
When developing eforge locally, pnpm build compiles the CLI to dist/cli.js and makes eforge available on PATH via the bin entry in package.json. After making changes to the engine or CLI, rebuild with pnpm build so the daemon picks up the latest code.
To restart the daemon after a local rebuild, use /eforge:restart from Claude Code. This calls the daemon's MCP tool to safely stop and restart, checking for active builds first.
For the eforge repository itself, the /eforge-daemon-restart project-local skill rebuilds from source and restarts the daemon in one step.
See eforge-build/eval for the end-to-end evaluation harness.
eforge is licensed under Apache-2.0.
eforge's harness abstraction allows different AI providers. Each harness carries its own license terms:
-
Claude Agent SDK (
@anthropic-ai/claude-agent-sdk) is proprietary software owned by Anthropic PBC. By using eforge with this harness, you agree to Anthropic's Commercial Terms (API users) or Consumer Terms (Free/Pro/Max users), plus the Acceptable Use Policy. See Anthropic's legal page for details.Note: If you are building a product or service on top of eforge, Anthropic requires API key authentication through Claude Console - OAuth tokens from Free, Pro, or Max plans may not be used for third-party products.
-
Pi harness (
@mariozechner/pi-ai,@mariozechner/pi-agent-core,@mariozechner/pi-coding-agent) - a fully open-source harness alternative supporting 20+ LLM providers (OpenAI, Google, Mistral, Groq, xAI, Bedrock, Azure, OpenRouter, and more). All three packages are MIT licensed from the pi-mono monorepo.
eforge's Apache 2.0 license applies to eforge's own source code. It does not extend to or override the license terms of its dependencies.



