📊 Lock File Statistics Report - November 10, 2025 #3555
Closed
Replies: 1 comment
-
|
This discussion was automatically closed because it was created by an agentic workflow more than 1 week ago. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
📊 Agentic Workflow Lock File Statistics - November 10, 2025
This comprehensive statistical analysis examines all 77 lock files (.lock.yml) in the githubnext/gh-aw repository, revealing usage patterns, structural characteristics, and best practices for GitHub's agentic workflows.
Executive Summary
The gh-aw repository contains 77 agentic workflow lock files representing a diverse ecosystem of automated agents. These workflows total 16.09 MB with an average size of 214 KB, demonstrating consistent structure and maturity. The analysis reveals strong patterns in trigger usage (40% use scheduled triggers), job architecture (5.4 jobs per workflow on average), and security practices (264 explicit permission declarations, predominantly read-only).
Key highlights include the dominance of the lightweight
ubuntu-slimrunner (65% usage), conservative timeout settings (12.4 minute average), and a highly modular job architecture centered around core patterns:agent,activation, anddetectionjobs appearing in the majority of workflows.Full Statistical Report
File Size Analysis
Overall Statistics
Size Distribution
Key Finding: 71.4% of lock files fall in the 200-300 KB range, indicating highly consistent workflow structure and complexity across the repository.
File Size Extremes
Smallest Files (typically test or shared imports):
Largest Files (feature-rich agents):
Trigger Analysis
Trigger Type Distribution
Total Trigger Entries: 95 across 77 files
Unique Trigger Types: 11
Schedule Patterns
Top scheduled execution times (cron patterns):
0 9 * * *0 0,6,12,18 * * *0 6 * * 00 2 * * 1-50 15 * * 10 0 * * *Insight: Scheduled workflows favor off-peak times (midnight, early morning) and business day patterns, optimizing for resource availability and timely reporting.
Multi-Trigger Workflows
Many workflows respond to multiple event types, enabling flexible activation patterns. Common combinations include:
issues+issue_comment+pull_request(interactive agents)schedule+workflow_dispatch(automated + manual)discussion+discussion_comment(discussion-based bots)Job Architecture Analysis
Job Complexity Statistics
Most Common Job Names
Standard Job Pattern
A typical agentic workflow follows this job dependency pattern:
Key Observations:
activationandagentjobsdetectionandmissing_tooljobs provide safety and observabilityupdate_reactionprovides user feedbackPermission Patterns
Permission Distribution
contents: readpull-requests: readissues: readactions: readdiscussions: readsecurity-events: readrepository-projects: readTotal Permission Declarations: 264
Write Permission Ratio: ~5% (estimated, mostly in output jobs)
Security Posture
Excellent: The workflows demonstrate strong security practices:
write-all)Safe Outputs Analysis
Safe Output Usage
Note: The safe-outputs pattern is embedded in job logic rather than always explicit in
with:sections, explaining the low direct count. Manual inspection confirms widespread use of safe output patterns across create_discussion, create_issue, and comment jobs.Output Strategy
Workflows primarily use these output strategies:
Infrastructure Patterns
Runner Usage
Total Runner Declarations: 416
Insight: Strong preference for
ubuntu-slim(65%) indicates cost optimization and faster startup times for lightweight agent tasks.Timeout Configuration
Timeout Distribution:
Insight: Conservative timeout settings (median 10 minutes) prevent runaway processes while allowing complex analysis when needed.
Concurrency Control
Common Concurrency Patterns:
gh-aw-${{ github.workflow }}-${{ github.event.issue.number }}- Per-issue concurrencygh-aw-${{ github.workflow }}- Workflow-level serializationgh-aw-copilot-${{ github.workflow }}- Engine-specific groupsPurpose: Prevent duplicate agent runs on the same issue/PR, optimizing resource usage and avoiding conflicting updates.
Structural Characteristics
Average Lock File Profile
Based on statistical analysis, a typical
.lock.ymlfile has:Workflow Naming Patterns
Common naming conventions:
daily-news,weekly-issue-summary,audit-workflowsarchie,brave,scout,grumpy-reviewercopilot-*,go-*,python-*duplicate-code-detector,schema-consistency-checkerMCP Server Usage
Finding: No explicit MCP server configuration detected in lock files at the infrastructure level. MCP (Model Context Protocol) integration appears to be handled at the agent/application level rather than workflow configuration, or embedded in shared imports (e.g.,
shared/mcp/arxiv.lock.yml,shared/mcp/context7.lock.yml).Interesting Findings
Remarkable Consistency: 71% of files within a narrow 100KB range (200-300KB) indicates mature, standardized workflow patterns.
Schedule Dominance: 40% of workflows use scheduled triggers, suggesting strong emphasis on proactive automation vs. reactive responses.
Job Pattern Convergence: The
agent/activation/detectiontriple appears in 82% of workflows, representing a stable architectural pattern.Security-First Design: 95%+ read-only permissions with write access strictly isolated to safe-output jobs demonstrates security best practices.
Resource Optimization: 65% use of
ubuntu-slimand 50% of jobs using 10-minute timeouts shows cost-conscious infrastructure choices.Persona-Based Naming: Many workflows use persona names (Archie, Brave, Scout) suggesting agent identity and role clarity.
Test Coverage: 11 dedicated test workflows (14% of total) including firewall, secret masking, and OAuth testing.
Shared Components: Presence of
shared/directory indicates code reuse and modular design patterns.Minimal Safe Output Variance: Only 2 safe output types detected in direct usage (
add-comment,create-issue), suggesting standardization on proven patterns.Schedule Time Optimization: Most scheduled workflows run during off-peak hours (midnight-9am UTC), optimizing for resource availability.
Recommendations
Based on this analysis, we recommend:
For New Workflows
pre_activation → activation → agent → outputpatternFor Optimization
For Security
For Monitoring
Methodology
Analysis Tools
Data Sources
.lock.ymlfiles in.github/workflows/and subdirectoriesCache Memory
/tmp/gh-aw/cache-memory/scripts//tmp/gh-aw/cache-memory/history/Validation
Historical Context
This is the initial comprehensive statistical analysis of the gh-aw lock file ecosystem. Future analyses will track:
Baseline metrics captured in
/tmp/gh-aw/cache-memory/history/2025-11-10.jsonfor comparison.Generated by Lockfile Statistics Analysis Agent
Analysis Date: November 10, 2025
Repository: githubnext/gh-aw
Lock Files Analyzed: 77 (16.09 MB)
Beta Was this translation helpful? Give feedback.
All reactions