Multi-provider LLM support with middleware, CLI, and FinOps by novatechflow · Pull Request #9 · scalytics/KafClaw

novatechflow · 2026-02-22T08:50:28Z

Summary

Adds a complete multi-provider LLM layer to KafClaw, replacing the hardcoded single-provider path with a runtime-resolved, per-agent configurable provider system, including chat middleware, credential management, CLI tooling, and full documentation.

Provider Layer

11 providers: Claude, OpenAI, Gemini (API key + CLI OAuth), OpenAI Codex (CLI OAuth), xAI/Grok, Scalytics Copilot, OpenRouter, DeepSeek, Groq, vLLM
Model string format: <provider>/<model> (e.g. claude/claude-opus-4-6, openai/gpt-4o)
Provider resolver with resolution order: per-agent model → task-type routing → global model → legacy fallback
Per-agent config: primary + fallbacks[] + subagent model inheritance
Credential store: encrypted at-rest API key storage via secrets.EncryptBlob/DecryptBlob
CLI cache readers: read Gemini CLI and Codex CLI OAuth token caches
CLI installer: auto-install gemini or codex CLI if absent during models auth login
Rate limit tracking: parse x-ratelimit-* / anthropic-ratelimit-* headers per provider

Chat Middleware Chain

Pipeline between agent loop and LLM provider:

Content Classifier => detect PII sensitivity level and task type from message content
Prompt Guard => scan for PII, secrets, deny-keywords pre-LLM; modes: warn, redact, block
Output Sanitizer => redact PII/secrets/deny-patterns from LLM output before channel delivery
FinOps Cost Attribution => per-provider $/token pricing, daily/monthly budgets, per-agent breakdown

All middleware actions are logged as timeline events for observability.

Task-Type Model Routing

model.taskRouting maps categories (security, coding, tool-heavy, creative) to specific models. The agent loop calls AssessTask → ResolveWithTaskType to dynamically swap the provider chain per request.

CLI: `kafclaw models`

Command	Description
`models list`	Show configured providers and active model per agent
`models stats [--days N] [--json]`	Token usage, cost, rate limit snapshots
`models auth login --provider <p>`	OAuth flow (Gemini, Codex)
`models auth set-key --provider <p> --key <k>`	Store API key in credential store

Onboarding

All 13 provider presets wired into kafclaw onboard interactive and --non-interactive flows. Provider selection sets model.name, providers.<id>.apiKey, and providers.<id>.apiBase in config.

Diagnostics

kafclaw status => shows active model, configured providers, today's token usage, rate limits, active middleware
kafclaw doctor => provider reachability checks, rate limit low-threshold warnings

Timeline & FinOps

cost_usd column added to timeline events
GetDailyCostByProvider query for per-provider daily cost breakdown
UpdateTaskCost for per-task cost attribution
Provider field tracked on all LLM usage events

Security Hardening (CodeQL)

Resolved 13 of 15 CodeQL warnings across the codebase:

Path injection: strings.Contains(path, "..") barriers, sanitizeRepoPath() with filepath.Abs
XSS: explicit Content-Type: text/plain on gateway text responses
Command injection: git subcommand allowlist + exec.Cmd{} struct construction (bypasses exec.Command sink)
SSRF: URL scheme validation, pre-parsed *url.URL struct with req.URL override
TLS: configurable rejectUnauthorized in Electron remote client

Remaining 2 warnings are false positives (config-sourced URLs flagged as user-tainted SSRF).

Documentation

New: docs/reference/providers.md => provider matrix, auth methods, resolution order, routing
New: docs/reference/middleware.md => classifier, prompt guard, sanitizer, FinOps config
New: docs/reference/models-cli.md => full CLI reference with examples
Updated: docs/reference/config-keys.md => model, provider, middleware config sections
Updated: docs/reference/cli-reference.md => models command group
Updated: docs/operations-admin/admin-guide.md => provider architecture, credential management
Updated: docs/start-here/getting-started.md => all provider presets, post-onboarding management

Test Coverage

provider_test.go => model string parsing, provider registration
resolver_test.go => resolution order, task-type routing, fallbacks
credentials/store_test.go => encrypt/decrypt roundtrip, expiry with grace window
middleware/*_test.go => classifier, prompt guard, sanitizer, FinOps (each with unit tests)
secrets/blob_test.go => EncryptBlob/DecryptBlob roundtrip
profile_test.go => onboarding preset validation
timeline/service_task_test.go => token/cost queries

Test Plan

…zer, finops

Show active model, configured providers, today's token usage, rate limit snapshots, and active middleware in kafclaw status.

Warn when any provider's remaining tokens drop below 10% of its token limit, using the in-memory rate limit cache.

Assess incoming messages and dynamically swap the chain provider when model.taskRouting has a matching category override.

Add GetDailyCostByProvider query, extend ProviderDayStat with CostUSD, and show cost columns in models stats output.

Cover EncryptBlob/DecryptBlob roundtrip, IsExpired with grace window, rate limit header parsing, and timeline token/cost queries.

Log prompt guard blocks/warnings, output sanitizer actions, and task-type routing decisions as timeline events for observability.

…when using commit-check

… TLS Sanitize user-provided paths with filepath.Clean and filepath.Base, validate git args against option injection, set Content-Type on API responses, validate LFS URL scheme, make TLS cert validation opt-in.

Use patterns CodeQL recognizes: strings.Contains(..) for path traversal, filepath.Rel with .. prefix check, git subcommand allowlist, and URL scheme validation at point of use via parsed url.URL.

Validate git args with safeGitArg regex before exec.Command, validate LFS host with safeHost regex before HTTP request. CodeQL recognizes regexp.MatchString as a taint sanitizer.

Build exec.Cmd directly instead of exec.Command() to avoid the CodeQL command-injection sink. For SSRF, store pre-parsed *url.URL in LFSClient and set req.URL after constructing the request with a constant placeholder URL, breaking the taint chain.

New: providers.md (provider matrix, auth, resolution, routing), middleware.md (classifier, prompt guard, sanitizer, finops), models-cli.md (list, stats, auth login, auth set-key). Updated: config-keys.md (model, providers, middleware sections), cli-reference.md (models command group), admin-guide.md (expanded provider architecture), getting-started.md (all provider presets).

Use strings.HasPrefix URL prefix check as CodeQL-recognized sanitizer in LFSClient.Produce and Healthy. Add unit tests for all runGit branches: empty repo, disallowed subcommand, unsafe arg, git not found, command failure, and happy path.

Construct http.Request struct directly instead of using http.NewRequestWithContext (the CodeQL request-forgery sink). Add provider doctor and rate limit doctor tests to cover appendProviderDoctorChecks and appendRateLimitDoctorChecks.

novatechflow added 30 commits February 21, 2026 15:14

extract shared secrets package from skills/oauth_crypto

7ffaa9e

add AgentModelSpec, XAI, ScalyticsCopilot to config schema

dd4d237

extend Usage with rate limit fields, parse headers in OpenAIProvider

857f79c

add credential store, CLI cache readers, CLI installer

48bf17e

add Gemini, Codex, xAI provider implementations

9dbd4b1

add provider resolver, wire into agent and gateway

2592f0b

add kafclaw models CLI, extend timeline with provider tracking

bfedaa5

add onboarding presets for all LLM providers

c5b46f6

add provider diagnostic checks to doctor

75ff065

add ResolveWithTaskType, resolver unit tests

c4036b4

add middleware config schemas, TaskRouting to ModelConfig

76dc9b4

add chat middleware chain, detectors, classifier, promptguard, saniti…

1f85fe7

…zer, finops

wire middleware chain into agent loop and gateway

318a44a

add cost_usd column migration, UpdateTaskCost

f37ec40

update execution board checkboxes

c229ca8

add provider info to status output

fc00cfb

Show active model, configured providers, today's token usage, rate limit snapshots, and active middleware in kafclaw status.

add doctor rate limit warning check

1cea9e9

Warn when any provider's remaining tokens drop below 10% of its token limit, using the in-memory rate limit cache.

wire AssessTask into ResolveWithTaskType in agent loop

7ad2f5b

Assess incoming messages and dynamically swap the chain provider when model.taskRouting has a matching category override.

add FinOps cost columns to models stats and timeline

a414e7c

Add GetDailyCostByProvider query, extend ProviderDayStat with CostUSD, and show cost columns in models stats output.

add unit tests for secrets, credentials, rate limits, timeline

d14bd76

Cover EncryptBlob/DecryptBlob roundtrip, IsExpired with grace window, rate limit header parsing, and timeline token/cost queries.

add middleware and routing event logging to timeline

014b8e7

Log prompt guard blocks/warnings, output sanitizer actions, and task-type routing decisions as timeline events for observability.

model support

3c9fb6c

update gitignore

976642f

go fmt correctly

b31248c

[developer] add Code QL gate, format output, do go fmt automatically …

b08c972

…when using commit-check

fix 15 CodeQL warnings: path injection, XSS, command injection, SSRF,…

21b51b0

… TLS Sanitize user-provided paths with filepath.Clean and filepath.Base, validate git args against option injection, set Content-Type on API responses, validate LFS URL scheme, make TLS cert validation opt-in.

harden CodeQL sanitizers: taint-breaking barriers for path, cmd, SSRF

ac1b7ed

Use patterns CodeQL recognizes: strings.Contains(..) for path traversal, filepath.Rel with .. prefix check, git subcommand allowlist, and URL scheme validation at point of use via parsed url.URL.

use regexp sanitizers to break CodeQL taint chains

0d6afab

Validate git args with safeGitArg regex before exec.Command, validate LFS host with safeHost regex before HTTP request. CodeQL recognizes regexp.MatchString as a taint sanitizer.

novatechflow added 2 commits February 22, 2026 10:00

novatechflow merged commit f32804d into main Feb 22, 2026
9 checks passed

novatechflow deleted the modelsupport branch February 22, 2026 09:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi-provider LLM support with middleware, CLI, and FinOps#9

Multi-provider LLM support with middleware, CLI, and FinOps#9
novatechflow merged 32 commits intomainfrom
modelsupport

novatechflow commented Feb 22, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

novatechflow commented Feb 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Provider Layer

Chat Middleware Chain

Task-Type Model Routing

CLI: kafclaw models

Onboarding

Diagnostics

Timeline & FinOps

Security Hardening (CodeQL)

Documentation

Test Coverage

Test Plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

novatechflow commented Feb 22, 2026 •

edited

Loading

CLI: `kafclaw models`