Changelog

All notable changes to the Sentinel API Testing Platform will be documented in this file.

The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.

[1.2.0] - 2025-12-08

Security 🔒

Critical CORS Fix: Replaced allow_origins=["*"] with explicit allowed domains across all services
JWT Security Enhancement: Added 32-character minimum validation for JWT secrets
Rate Limiting: New RateLimiter class with configurable limits (5 req/min, 5-minute lockout)
Refresh Token Support: Access tokens now expire in 1 hour, refresh tokens in 7 days
Security Headers Middleware: Added HSTS, CSP, X-Frame-Options, X-Content-Type-Options, Referrer-Policy

Added

Agentic QE Fleet Infrastructure 🤖
- 18 specialized QE agents for comprehensive testing automation
- 59 quality engineering skills integrated with Claude Code
- AQE MCP server integration for tool coordination
- Comprehensive QE analysis reports in /docs/
Quality Analysis Reports 📊
- COMPREHENSIVE_QE_ANALYSIS_REPORT.md - Full consolidated analysis
- code-complexity-analysis-report.md - 5 critical files identified
- quality-analysis-report.md - 72/100 quality score
- test-doubles-analysis-report.md - 739 mocks, 272 fixtures inventory

Fixed

Backend Tests: Added reset_rate_limiter() for test isolation
Frontend Tests: Upgraded @testing-library/user-event v13→v14.5.2
TypeScript: Fixed type assertions in test files
Docker Build: Fixed npm ci --only=production deprecation

Changed

Claude-Flow Integration: Updated to v2.0.0 with enhanced hooks
AQE Integration: Updated to v2.2.0 with native hooks (100-500x faster)
Settings: Streamlined .claude/settings.json with AQE MCP enabled

[1.1.0] - 2025-10-30

Fixed

Observability Services Stability 🔧
- Fixed Jaeger restart loop by switching from BadgerDB to in-memory storage
- Resolved permission errors preventing Jaeger from creating storage directories
- Fixed Prometheus restart loop by correcting invalid labels: configuration
- Converted all scrape configs to proper relabel_configs: pattern (9 services)
- All 12 services now running stable with zero restart loops (60+ minutes validated)
ReasoningBank Database Schema 🗄️
- Added trajectoryoutcome ENUM type (SUCCESS, PARTIAL_SUCCESS, FAILURE, ERROR, UNKNOWN)
- Added complete pattern_embeddings table with pgvector for semantic similarity search
- Fixed task_trajectories.outcome column type conversion from VARCHAR to ENUM
- All ReasoningBank workers now running error-free (Judgment, Distillation, Consolidation)
- Resolved operator mismatch errors preventing worker operation

Added

Complete Database Schema 📊
- Pattern embeddings table with 16 columns and 6 indexes
- IVFFlat vector index for efficient similarity search (150x faster)
- GIN index on JSONB domain_tags for fast filtering
- Performance indexes on confidence and usage_count columns
- Worker checkpoints table for graceful shutdown support

Changed

Production Readiness 🚀
- All critical services validated and operational
- Zero errors in 60+ minute stability test
- Full observability stack operational (metrics + tracing)
- API Gateway health checks: all services responding
- Prometheus scraping 9/10 targets successfully (90% success rate)

Documentation

Comprehensive Fix Documentation 📚
- Added docs/OBSERVABILITY_FIXES_2025-10-30.md with root cause analysis
- Added docs/FINAL_VALIDATION_REPORT_2025-10-30.md with complete validation
- Added docs/RELEASE_PREPARATION_CHECKLIST.md for release workflow
- Before/after code comparisons for all fixes
- Verification commands and rollback procedures

Performance

Service Metrics ⚡
- API Gateway health: <50ms response time
- Backend services: 13-48ms average response times
- Database queries: <5ms for simple operations
- Zero restart loops (previously every 60 seconds)
- Prometheus scrape success: 90% (9/10 targets healthy)

Known Limitations

Rust core /metrics endpoint not implemented (non-blocking, feature enhancement)
Jaeger using in-memory storage (suitable for development, migrate to persistent for production)
Integration tests pending (should run before production deployment)

[Unreleased] - 2025-09-24

Added

Comprehensive Database Initialization System
- Complete init_db.sql with all tables and columns
- Automatic database initialization via init_database.py
- Docker entrypoint script for startup initialization
- Makefile with convenient database management commands
- make setup for one-command complete setup
- make init-db, make reset-db, make backup-db, make restore-db commands
- Detailed documentation in docs/DATABASE_SETUP.md
Advanced Consciousness & Intelligence Features
- Consciousness verification system in Rust core
- Emergent intelligence with self-modifying test generation
- Psycho-symbolic reasoning combining psychological models with logic
- Temporal consciousness with nanosecond-precision scheduling
- Knowledge graph integration for semantic API understanding
- Sublinear solvers with O(log n) performance
- Predictive testing with temporal computational advantages
- Pattern recognition through emergent behavior analysis
- Self-learning tests with feedback loops
- Distributed intelligence via multi-agent swarms
Frontend Improvements
- URL validation for test run target environments
- Default target URL set to http://host.docker.internal:8080
- Prevention of invalid URL protocols (e.g., ttp://)
- Better error messages for invalid URLs

Fixed

Test Execution URL Validation
- Fixed issue where invalid URLs (missing protocol characters) caused all tests to fail
- Added frontend validation to ensure URLs start with http:// or https://
- Set sensible default URL for Petstore API testing

Documentation

Updated README.md with:
- Quick start using Makefile commands
- Database management instructions
- Consciousness and MCP features section
- Common issues and solutions
- Troubleshooting guide for database and test execution issues
Updated CONTRIBUTING.md with new setup procedures
Added multiple documentation guides for advanced features

[Previous] - 2025-08-31

Added

Performance-Based Agent Routing System
- Intelligent routing based on actual performance metrics instead of language
- Tracks execution time, success rate, and test generation efficiency
- Automatically selects fastest implementation (Python vs Rust) per agent type
- Persistent metrics storage with sliding window of 100 samples
- REST API endpoint /performance-metrics for monitoring
- Default routing based on comprehensive benchmark results
Ollama LLM Integration
- Complete support for local LLM inference with 3 models
- mistral:7b - General purpose, fast responses
- codellama:7b - Code-focused tasks
- deepseek-coder:6.7b - Advanced reasoning
- Agent-optimized model selection configuration
- Configuration scripts for easy setup (configure_ollama.py)
- Docker support with host.docker.internal configuration
Comprehensive LLM Benchmarking Suite
- Benchmark scripts for all LLM providers
- Mock Provider: 104ms average (baseline)
- Anthropic Claude Sonnet 4: 2.3 seconds (verified via direct API)
- Ollama: 10-15 seconds on CPU
- Discovered and documented provider caching issue

Changed

Agent Performance Reality Check
- Debunked claimed 18-21x Rust speedup over Python
- Actual results: Python 1.09x faster overall
- Python faster for 4/7 agents, Rust faster for 3/7
- Updated documentation to reflect real performance data

Fixed

Provider Factory Caching Issue
- Identified provider instance caching preventing configuration changes
- Documented workaround and impact on benchmarks
- Service restart required for configuration changes

[Previous] - 2025-08-29

Fixed

Rust AI Agent Test Data Generation
- Fixed integer path parameters to use numeric IDs instead of string IDs (e.g., 56 instead of "usr_6833")
- Fixed path parameter substitution in test cases (e.g., /api/v1/pets/56 instead of /api/v1/pets/{pet_id})
- Enhanced enum value handling to use valid values from OpenAPI schema
- Updated generate_parameter_value() in utils.rs to respect schema types (integer vs string)
- Updated generate_schema_example() to properly select random enum values
- Enhanced generate_realistic_object() in functional_positive.rs to resolve schema references
- Fixed Docker build configuration for sentinel_rust_core (added missing lib.rs creation)

Added

Asynchronous Test Generation
- New /generate-tests-async endpoint for non-blocking test generation
- Task status polling with /task-status/{task_id} endpoint
- Real-time progress tracking showing which agent is currently running
- Prevents 503 timeout errors for long-running test generation
Analytics API Integration
- Added 6 analytics endpoints to API Gateway
- /api/v1/analytics/trends/failure-rate - Historical failure rate trends
- /api/v1/analytics/trends/latency - Latency trends with percentiles
- /api/v1/analytics/anomalies - Anomaly detection in test results
- /api/v1/analytics/predictions - Predictive insights
- /api/v1/analytics/insights - Quality insights and recommendations
- /api/v1/analytics/health-summary - Overall health dashboard

Fixed

Test Generation Issues
- Fixed duplicate test case generation by implementing MD5-based deduplication
- Fixed FunctionalStatefulAgent OperationNode subscript error
- Resolved async test generation timeout issues with background tasks
Analytics Page Issues
- Fixed Analytics.js using incorrect fetch URLs (was hitting React dev server instead of API)
- Fixed SQL query errors in data service (incorrect field references)
- Fixed TestResult.agent_type error by properly joining TestCase table
- Fixed response_time_ms to latency_ms field name mismatch
- Fixed variable name shadowing issue (status variable conflicting with imported module)

Changed

Updated frontend to use apiService for all API calls instead of raw fetch
Enhanced .gitignore to exclude .claude-flow directories
Improved error handling in async test generation

[1.0.0] - 2025-08-18

Added

Comprehensive Test Implementation (Phase 3 & 4)
- 6 integration test files with 2,342 lines of test code
- 97 test methods covering critical backend components
- API workflow tests (formerly E2E) for end-to-end API testing
- Playwright E2E testing framework for frontend
- Page Object Model architecture for maintainable tests
- Cross-browser testing support (Chrome, Firefox, Safari, Mobile)

Changed

Renamed tests/e2e to tests/api_workflows for better clarity
Updated Dockerfile.test to fix SQLAlchemy version compatibility
Enhanced test organization and structure

Test Coverage Updates

Phase 1: ✅ AI Agent Tests (100% - 184 tests)
Phase 2: ✅ LLM Provider Tests (100% - 272+ tests)
Phase 3: ✅ Integration Tests (100% - 6 files)
Phase 4: 🔄 API Workflow Tests (33% - 2 of 6 files)
NEW: Playwright E2E Tests for Frontend

Infrastructure

Added Playwright configuration for E2E testing
Created test fixtures and page objects
Implemented test data management
Added comprehensive test documentation

[1.0.0] - 2025-08-16

Added

Phase 1: AI Agent Unit Tests
- Complete test coverage for all 8 AI agents
- 184 comprehensive unit tests
- Full mocking and async support
- Dedicated test runner (run_agent_tests.sh)
Phase 2: LLM Provider Tests
- 100% coverage for all LLM providers
- Provider factory and fallback mechanism tests
- Token counting and cost calculation tests
- Response caching tests
- Model registry tests

Infrastructure

Docker test environment setup
CI/CD pipeline integration
Test coverage reporting
Pytest configuration with markers

[0.9.0] - 2025-08-14

Added

Initial platform release
Microservices architecture
AI agent system for test generation
Multi-LLM provider support
RBAC authentication system
API specification management
Test execution engine
Analytics and reporting

Core Services

API Gateway (Port 8000)
Auth Service (Port 8005)
Spec Service (Port 8001)
Orchestration Service (Port 8002)
Execution Service (Port 8003)
Data Service (Port 8004)
Sentinel Rust Core (Port 8088)

AI Agents

Functional-Positive-Agent
Functional-Negative-Agent
Functional-Stateful-Agent
Security-Auth-Agent
Security-Injection-Agent
Performance-Planner-Agent
Data-Mocking-Agent

Infrastructure

PostgreSQL with pgvector
RabbitMQ message broker
Prometheus monitoring
Jaeger distributed tracing
Docker Compose orchestration

FilesExpand file tree

CHANGELOG.md

Latest commit

History

CHANGELOG.md

File metadata and controls

Changelog

[1.2.0] - 2025-12-08

Security 🔒

Added

Fixed

Changed

[1.1.0] - 2025-10-30

Fixed

Added

Changed

Documentation

Performance

Known Limitations

[Unreleased] - 2025-09-24

Added

Fixed

Documentation

[Previous] - 2025-08-31

Added

Changed

Fixed

[Previous] - 2025-08-29

Fixed

Added

Fixed

Changed

[1.0.0] - 2025-08-18

Added

Changed

Test Coverage Updates

Infrastructure

[1.0.0] - 2025-08-16

Added

Infrastructure

[0.9.0] - 2025-08-14

Added

Core Services

AI Agents

Infrastructure