Archon: The YAML Harness That Makes Claude Code Run the Same Every Time

Every dev who used Claude Code for more than a week hit this wall: you run the same task twice and get completely different results. Different file structure, skipped tests, PR descriptions that don’t meet team standards. That’s not a Claude problem — it’s an architecture problem. And Archon just became the most starred open source fix for it.

What is Archon?

Archon (github.com/coleam00/Archon) is a workflow orchestration engine that wraps AI coding agents — Claude Code, Codex CLI, and others — inside pipelines defined in YAML. You define the sequence: plan, implement, validate, review, open the PR. The AI writes all the code. But the structure is deterministic. Same YAML, same sequence, every execution.

It just surpassed 19,700 stars on GitHub and is sitting at #2 trending this week, following a complete rewrite announced on April 7th. The Python code was archived and Archon reemerged as a TypeScript engine with cleaner architecture and sharper focus: not an “agent that builds agents”, but a harness that makes the agents you already use behave predictably.

Four types of nodes, one workflow

Archon workflows are directed acyclic graphs made up of four basic blocks:

  • AI nodes — pass a prompt to the agent so it can reason, plan, or generate code
  • Deterministic nodes — run scripts, git commands, linting
  • Loop nodes — keep iterating until a condition is met (for example, tests passing)
  • Interactive nodes — pause and wait for human approval before continuing

This hybrid design is the core idea: AI generates, deterministic nodes validate, loops ensure quality, and humans approve before anything ships. You’re not limiting the model — you’re channeling it.

Proof in real production

Stripe merges over 1,300 AI-generated pull requests every week. Zero lines written by humans. Their internal agents (“Minions”) run on a harness architecture: isolated environments, hard iteration limits, automatic diff extraction, no direct pushes to main. What makes that possible isn’t a better model — it’s structure.

The data backs this up: two teams using identical models can see task completion rates of 60% vs. 98% based purely on harness quality. The gap between models is shrinking. The gap between harness quality across teams is growing.

Get started in 5 minutes

Archon comes with 17 pre-built workflows: bug fixes, feature development, PR review, refactoring, architectural improvements. To get going:

curl -fsSL https://archon.diy/install | bash
cd your-project
archon init
archon run workflows/fix-bug.yml

Workflows are YAML files that live in your repository, versioned alongside your code. New dev joining the team? Run the workflow. Compliance audit? Show the YAML.

The honest take

Archon is in beta — the repo itself says it: expect not everything to work 100%. The April 7th rewrite is very recent: 106 open issues, 45 pending PRs. If you need solid stability today, wait a few weeks. If you want to help define what the harness engineering standard looks like, this is the moment to get involved.

The stars tell you the community sees something real here. The beta label tells you to adjust expectations accordingly.