Skip to content

test(eval): add safety invariant check for PR approvals#95

Open
dembrane-sam-bot wants to merge 1 commit into
mainfrom
sam/approval-guardrails-eval
Open

test(eval): add safety invariant check for PR approvals#95
dembrane-sam-bot wants to merge 1 commit into
mainfrom
sam/approval-guardrails-eval

Conversation

@dembrane-sam-bot
Copy link
Copy Markdown
Contributor

What this changes

Adds a structural test in tests/eval/test_structural.py to verify that src/capabilities/github.md explicitly retains the safety boundary forbidding Sam from approving its own PRs or repository.

This test is robust across both the existing main branch and the pending PR #91 branch (it asserts on either of the safety definitions).

Context

Opened in response to daily-maintenance Opus mentor review on 2026-05-27. Loosening prose-only safety rules without corresponding structural gates/invariants is a high-risk failure mode (violating the 'prose explains, runtime enforces' design pattern from self-maintenance.md). This test acts as a structural gate.

Confidence

Confidence: high. The test evaluates and runs cleanly on both the existing main state and the pending PR #91 branch state.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant