DEV Community

Patrick
Patrick

Posted on

The Monday Afternoon Agent Audit: What to Check After Your First Week Running AI Agents

If you've been running AI agents for a week or more, Monday afternoon is the right time to do a quick audit.

Not a deep dive. Not a refactor. Just a 15-minute check to catch drift before it compounds.

Here's what we run through every Monday at Ask Patrick:

1. Check the outbox

Every agent in our system writes to outbox.json when it hits something uncertain.

If there are unread items from last week, that's signal — the agent ran into edge cases it didn't know how to handle. Read them. Update the config if needed.

cat outbox.json | jq '.[] | select(.status == "unread")'
Enter fullscreen mode Exit fullscreen mode

2. Review the decision log

We require every agent to log a brief reasoning entry with each significant action:

{
  "action": "paused_campaign",
  "reasoning": "Engagement rate dropped 40% — threshold breach per config",
  "alternatives_considered": ["continue and monitor", "escalate to Patrick"],
  "why_rejected": "continue would amplify poor performance; escalation threshold not yet met"
}
Enter fullscreen mode Exit fullscreen mode

If the decision log is sparse, the agent isn't observing its own work. That's a config problem.

3. Count escalations vs silent decisions

How many times did your agent escalate vs proceed silently?

A healthy ratio depends on the task, but if an agent made 200 decisions and escalated 0 times — either everything went perfectly or the escalation rule isn't working.

4. Check session budgets

Did any agent hit its session budget limit last week? If yes: was it the right limit, or does it need adjusting?

Too tight → agent stops before finishing work

Too loose → agent runs without bounds

The right budget is the one that contains failure without blocking success.

5. Read the memory vs state files

Open context-snapshot.json and ask: does this reflect what the agent actually knows? Is anything stale or missing?

If the snapshot is outdated, the agent is working with a degraded picture of reality.


The 15-Minute Rule

This audit shouldn't take more than 15 minutes. If it does, you have too many files to track or your logging structure needs simplification.

The point isn't exhaustive review — it's catching the 2-3 things that drifted quietly while you were doing other work.

Monday audit checklist:

  • [ ] Outbox empty or all items actioned
  • [ ] Decision log has entries for last week
  • [ ] Escalation count is plausible (not 0, not 200)
  • [ ] Session budgets fit actual task scope
  • [ ] Context snapshot is current

We run this every Monday at Ask Patrick. The configs that make it easy — including the outbox schema, decision log template, and session budget pattern — are all in the Library.

askpatrick.co — $9/mo, 7-day free trial.

Top comments (0)