Skip to content

Operational patterns for AI agents and infrastructure from solo to enterprise scale. Applies DevOps + SRE principles to the intersection: infrastructure FOR AI + AI FOR infrastructure. Alpha framework extracting meta-patterns from mission critical environments.

License

Notifications You must be signed in to change notification settings

boshu2/12-factor-agentops

Repository files navigation

12-Factor AgentOps

Operational principles for reliable AI agents

License Version GitHub Stars


The Improvement Cycle



How do we take AI agents from 80% reliable to production-grade?


I've watched too many teams follow the same pattern: build exciting AI features, ship to production, watch it break, add manual review, bottleneck, abandon.

The framework isn't the problem—the operational model is.

Deterministic Code

  • Same input → same output
  • Predictable, reproducible errors
  • Unit tests, edge cases
  • Code review, testing

AI Agents

  • Same input → different output
  • Novel, context-dependent errors
  • Probabilistic sampling
  • Observability, validation, fallbacks

The answer is 20 years of DevOps/SRE wisdom—validation gates, observability, zero-trust, pattern extraction—finally applied to the right problem.

The operational layer of the Vibe Ecosystem. Gene Kim and Steve Yegge's Vibe Coding is the methodology for working with AI. Dex Horthy's 12-Factor Agents shows how to build autonomous systems. This shows how to operate them reliably.


Quick Load for AI Sessions — Compressed context (~10k tokens)
Document Tokens Coverage
docs/00-SUMMARY.md ~10k 99%

The summary compresses the full corpus into ~10k tokens with zero essential loss. Start here for AI-assisted work.


The 12 Factors

Foundation (I-IV)

Build reliability from the ground up

I. Automated Tracking
Persistent memory. No tribal knowledge.

II. Context Loading
Load only what you need. Stay under 40%.

III. Focused Agents
One agent, one job. Compose specialists.

IV. Continuous Validation
Check at every step. Catch errors early.

Operations (V-VIII)

Scale and maintain in production

V. Measure Everything
If you can't measure it, you can't improve it.

VI. Resume Work
Save state. Pick up where you left off.

VII. Smart Routing
Send tasks to the right specialist.

VIII. Human Validation
Humans approve critical steps.

Improvement (IX-XII)

Continuous learning and adaptation

IX. Mine Patterns
Search history. Find what works.

X. Small Iterations
Improve continuously. Small tweaks compound.

XI. Fail-Safe Checks
Prevent repeating mistakes. Add guardrails.

XII. Package Patterns
Bundle what works. Reuse it.


12-Factor AgentOps Landscape



Getting Started

Start with your pain point:

Problem Solution
Context problems? Factor II: Context Loading
Reliability issues? Factor IV: Continuous Validation
No visibility? Factor V: Measure Everything

Go deeper:


Plugins — Claude Code marketplace implementing these factors

agentops — Production-ready plugins:

Plugin What It Does
constitution Nine Laws, 40% Rule, Git discipline
core-workflow Research → Plan → Implement → Learn cycle
session-management Context bundling, progress tracking
vibe-coding 5 core metrics, 6 difficulty levels
devops-operations Kubernetes, Helm, ArgoCD patterns
software-development Python, JavaScript, Go with TDD

Credit

12-Factor App (Adam Wiggins) • 12-Factor Agents (Dex Horthy) • Vibe Coding (Gene Kim & Steve Yegge) • Beads (Steve Yegge)


License: CC BY-SA 4.0 (content) / Apache 2.0 (code)

About

Operational patterns for AI agents and infrastructure from solo to enterprise scale. Applies DevOps + SRE principles to the intersection: infrastructure FOR AI + AI FOR infrastructure. Alpha framework extracting meta-patterns from mission critical environments.

Topics

Resources

License

Code of conduct

Contributing

Stars

Watchers

Forks

Packages

No packages published

Contributors 2

  •  
  •  

Languages