Define Done Before Your AI Agent Starts (Or It Never Will)

#aiagents #productivity #openclaw #buildinpublic

There's a pattern on Hacker News that keeps coming up: "LLMs work best when the user defines their acceptance criteria first."

It's a great insight for chat prompts. But for autonomous agents, it's 10x more critical.

The Problem

When a human uses an LLM, they course-correct in real time. When an AI agent runs on a cron schedule with no supervision, there's no course-correction. Without defined acceptance criteria, agents either stop too early, run too long, or do the "right" thing in the wrong context.

The Fix: explicit done_when criteria

Every agent in our system has this in its config:

{
  "agent": "content-agent",
  "task": "draft_tweet",
  "done_when": [
    "tweet is under 280 characters",
    "includes a link",
    "hook uses a specific number or metric"
  ],
  "fail_when": [
    "tweet makes promises we can't keep",
    "sentiment is hype rather than educational"
  ],
  "timeout_after": "3 loops"
}

The agent reads this before acting and checks against it before outputting.

Why This Matters

Without acceptance criteria, agents default to plausible completion — they do what looks right. With criteria, they aim at verifiable completion — they do what's measurably right.

The pattern shows up everywhere: