Jack M

Posted on Jun 25

AI Agent Rollback Plan: Undo Bad Actions Before Users Lose Trust

#agents #ai #architecture #systemdesign

A reliable AI agent is not the one that never makes a mistake. It is the one that can stop, explain what happened, and recover before a small error becomes a customer-facing mess.

That sounds obvious until an agent updates the wrong CRM field, sends a duplicate webhook, retries a payment lookup five times, or writes a half-finished configuration into production. The model may look smart. The workflow may even return success. But your system is now carrying damage that a normal retry cannot fix.

If you are building agents that touch real customer data, you need a rollback plan before you need a rollback incident. This guide gives you a practical blueprint.

Why rollback is becoming an AI agent feature

Recent AI platform activity points in one direction: agents are moving closer to real work.

Developer tools are adding shared coding agents inside team chat. Workflow platforms are making it easier to connect agents to docs, boards, forms, CRMs, browsers, and internal APIs. Model gateways and multi-model platforms are pushing teams to route work across providers. Open-source agent frameworks keep improving long-running execution, memory, tool calling, and local deployment.

That is useful. It is also risky.

The moment an AI agent can mutate state, rollback stops being a “nice backend concern” and becomes part of the product experience. Users do not care whether the failure came from a prompt, a tool timeout, a stale cache, or a retry race. They care whether the system can recover cleanly.

Common agent failure modes include:

The agent calls the right tool with the wrong record ID.
A retry repeats a non-idempotent action.
A model switch changes the shape of a tool argument.
A browser agent clicks through a page after the UI changes.
A workflow resumes with stale memory.
A partial tool sequence leaves the system in an inconsistent state.
A background job finishes after the user already cancelled the task.

Traditional apps already need transactions, queues, logs, and recovery playbooks. AI agents need all of that plus a readable explanation layer, because the failure path often includes natural language decisions.

Start with an action ledger

You cannot roll back what you did not record.

Every state-changing agent tool call should create an action ledger entry before and after execution. This ledger is not just an observability log. It is the source of truth for recovery.

type AgentActionLedgerEntry = {
  actionId: string;
  runId: string;
  tenantId: string;
  userId: string;
  toolName: string;
  targetType: 'ticket' | 'contact' | 'invoice' | 'document' | 'setting';
  targetId: string;
  idempotencyKey: string;
  status: 'planned' | 'approved' | 'running' | 'succeeded' | 'failed' | 'compensated';
  beforeSnapshotRef?: string;
  afterSnapshotRef?: string;
  compensationActionId?: string;
  createdAt: string;
};

The key fields are boring on purpose. actionId gives each action a stable identity, runId connects actions to one workflow, tenantId prevents cross-customer recovery mistakes, and idempotencyKey stops duplicate writes. Do not bury this in raw model traces only. Traces are helpful for debugging. The ledger is for operations.

Classify actions by rollback difficulty

Not every tool call can be reversed in the same way. Before giving an agent write access, classify each action.

Action type	Example	Rollback strategy
Read-only	Search docs, fetch CRM record	No rollback needed; log access
Draft-only	Create email draft, generate report	Delete or archive draft
Internal update	Change ticket priority	Restore previous value from snapshot
External reversible	Create calendar event	Delete event or update status
External irreversible	Send email, submit form, charge card	Require approval and use compensation, not true rollback
Multi-step workflow	Update CRM, notify Slack, create task	Use saga-style compensating actions

The dangerous category is “looks reversible but is not.” Sending a Slack message can be deleted in some workspaces, but not always. Sending an email is not meaningfully reversible. A payment can often be refunded, but that is not the same as never charging the card.

For irreversible actions, the rollback plan should be prevention plus compensation:

Ask for human approval before execution.
Show the exact action preview.
Use scoped credentials.
Store a clear audit trail.
Provide a follow-up compensation path.

Use idempotency keys for every write

AI agents retry more often than users realize. They retry after tool timeouts, provider errors, queue restarts, browser navigation failures, and model fallback events.

That is fine if your writes are idempotent. It is painful if they are not.

An idempotency key lets the tool layer say, “I have already performed this logical action.” The agent can ask again without duplicating the side effect.

import crypto from 'crypto';

function createIdempotencyKey(input: {
  tenantId: string;
  runId: string;
  toolName: string;
  targetId: string;
  logicalOperation: string;
}) {
  return crypto
    .createHash('sha256')
    .update(JSON.stringify(input))
    .digest('hex');
}

Use the key at the boundary where the write happens:

async function updateTicketPriority(args: {
  tenantId: string;
  ticketId: string;
  priority: 'low' | 'normal' | 'high';
  idempotencyKey: string;
}) {
  const existing = await db.agentWrites.findUnique({
    where: { idempotencyKey: args.idempotencyKey }
  });

  if (existing?.status === 'succeeded') {
    return existing.result;
  }

  const before = await db.ticket.findFirstOrThrow({
    where: { id: args.ticketId, tenantId: args.tenantId }
  });

  await db.agentWrites.create({
    data: {
      idempotencyKey: args.idempotencyKey,
      tenantId: args.tenantId,
      targetId: args.ticketId,
      status: 'running',
      beforeSnapshot: before
    }
  });

  const after = await db.ticket.update({
    where: { id: args.ticketId, tenantId: args.tenantId },
    data: { priority: args.priority }
  });

  return after;
}

The important detail: the model does not enforce idempotency. Your tool runtime does.

Add compensating actions, not just database rollback

Database transactions help when the work is local and short. Agent workflows are often long and distributed. They may call your database, a vector store, a ticketing API, a calendar API, a messaging system, and a browser session.

For that, use a saga pattern: every forward action has a compensating action where possible.

Example:

Forward action	Compensation
Create task	Delete or mark cancelled
Update record field	Restore previous value
Add user to segment	Remove user from segment
Create draft	Archive draft
Post internal message	Post correction or delete if allowed
Send external email	Send correction, not rollback

A compensating action should be a real tool with the same safety rules as any other tool. Do not let the model invent undo behavior from scratch during an incident.

type CompensationPlan = {
  originalActionId: string;
  compensationTool: string;
  riskLevel: 'low' | 'medium' | 'high';
  requiresApproval: boolean;
  args: Record<string, unknown>;
};

For high-risk compensation, show the user what will happen:

“I changed ticket T-182 from normal to high. I can restore it to normal and add a note explaining the correction. Do you want me to do that?”

That moment builds trust. Silent recovery can be useful, but visible recovery is better when user data changed.

Design the workflow as checkpoints

A rollback plan works best when the agent workflow has checkpoints.

A checkpoint is a safe pause point where the system has enough information to resume, retry, or roll back without guessing.

For example:

Understand the request.
Fetch relevant records.
Plan tool actions.
Ask for approval when needed.
Execute action group A.
Verify result.
Execute action group B.
Summarize outcome.

Each checkpoint should store:

Current workflow state
Planned actions
Completed actions
Pending actions
Approval state
Recovery instructions
Trace links
User-visible summary

This is where frameworks such as LangGraph-style state graphs, durable workflow engines, queues, and custom orchestration layers become useful. The specific framework matters less than the state discipline.

A simple checkpoint object can look like this:

{
  "runId": "run_91x",
  "checkpoint": "after_ticket_update",
  "completedActionIds": ["act_1", "act_2"],
  "pendingActionIds": ["act_3"],
  "requiresUserApproval": false,
  "recoveryMode": "resume_or_compensate",
  "lastVerifiedAt": "2026-06-26T02:31:00+05:30"
}

If the worker dies here, the next worker should not ask the model to “figure out where we were.” It should load the checkpoint.

Separate model retry from tool retry

One production mistake is treating every failure as a reason to ask the model again.

Do not do that.

Separate retries into layers:

Model retry: The response was malformed, incomplete, or violated a schema.
Tool retry: The API timed out, returned a transient error, or hit a rate limit.
Workflow retry: A worker crashed or a queue job resumed.
Human retry: The user corrected the plan or approved a different path.

Tool retries should usually reuse the same validated tool arguments and idempotency key. They should not ask the model to regenerate the action unless the failure was caused by bad arguments.

Bad pattern:

Tool timeout → ask model to create a new action → duplicate write risk

Better pattern:

Tool timeout → retry same actionId with same idempotencyKey → verify final state

This reduces accidental drift. The agent does not get a new chance to reinterpret the task every time the network sneezes.

Verify after every important action

Rollback is not only about undo. It is also about detecting when undo is needed.

After a state-changing action, run a verification step that does not depend on the model’s confidence.

Examples:

Read the record back and compare expected fields.
Check the external API status.
Confirm the action ledger changed from running to succeeded.
Validate that the tenant ID and target ID match the boundary.
Run a small policy check on the result.
Ask for human confirmation when the result has external impact.

async function verifyTicketPriority(args: {
  tenantId: string;
  ticketId: string;
  expectedPriority: string;
}) {
  const ticket = await db.ticket.findFirstOrThrow({
    where: { id: args.ticketId, tenantId: args.tenantId }
  });

  if (ticket.priority !== args.expectedPriority) {
    return {
      ok: false,
      reason: 'priority_mismatch',
      actualPriority: ticket.priority
    };
  }

  return { ok: true };
}

If verification fails, do not let the workflow continue blindly. Move the run into a recovery queue.

Build a recovery queue for humans and agents

A recovery queue is where failed or suspicious runs go for review. It should be boring, searchable, and operationally useful.

Include:

Run ID
Customer or workspace
User request
Failed action
Risk level
Current state
Suggested compensation
Required approval
Trace link
Time since failure

The queue should support three actions:

Resume: Continue from the last safe checkpoint.
Compensate: Run the predefined undo or correction path.
Close: Mark as expected or already handled.

For low-risk internal changes, the system may auto-compensate. For high-risk actions, ask a human.

This is especially important for solo developers and small teams. You may not have a full operations department, but you can still create a small admin page that prevents recovery work from living in logs, Slack threads, or memory.

Show users a clear correction trail

When an agent makes a visible mistake, the worst response is vague language.

Avoid:

“Something went wrong. Please try again.”

Better:

“I updated the wrong ticket priority during a retry. I restored ticket T-182 to normal, left ticket T-204 unchanged, and saved the incident in the activity log.”

A correction trail should explain:

What changed
Why it was corrected
What was restored
What could not be undone
Whether any human reviewed it
Where the audit record lives

This does not need to expose internal chain-of-thought or raw prompts. It should expose operational facts.

A practical implementation sequence

You do not need to build the perfect rollback platform in one sprint. Start with the risky paths.

Step 1: Inventory write tools

List every agent tool that changes state. Include internal APIs, external APIs, browser actions, file writes, messages, and workflow triggers.

Step 2: Add risk levels

Mark each write as low, medium, or high risk. High-risk actions require approval or a stronger verification path.

Step 3: Add idempotency

Make duplicate writes boring. This one change prevents many ugly incidents.

Step 4: Store before snapshots

For reversible internal changes, save the previous state before execution.

Step 5: Create compensation tools

Define explicit undo or correction tools. Do not rely on prompt instructions alone.

Step 6: Add checkpoints

Store workflow state after important phases. Resume from state, not from model memory.

Step 7: Build a small recovery queue

Even a basic admin table is better than searching logs during an incident.

How this fits into a broader AI architecture

Rollback is the recovery layer beside observability, approval gates, tenant isolation, evals, failover, and output provenance. A strong agent architecture does not assume the model will always choose correctly. It assumes the system must constrain, verify, and recover from model behavior.

Final takeaway

Agents are becoming more capable, more connected, and more trusted with real work. That makes rollback a product feature, not just an engineering cleanup task.

If your AI agent can change customer data, it should also be able to answer three questions:

What exactly did I change?
Can I safely undo or compensate for it?
Can I prove the recovery path worked?

Build that before the incident. Your future self will be grateful.

FAQ

What is an AI agent rollback plan?

An AI agent rollback plan is a set of technical patterns for recovering from bad or partial agent actions. It usually includes action ledgers, idempotency keys, before snapshots, compensating actions, checkpoints, verification steps, and a recovery queue.

Can every AI agent action be rolled back?

No. Internal updates are often reversible, but external actions such as emails, payments, submitted forms, and third-party messages may only be compensatable. For those actions, use approval gates, previews, audit logs, and correction workflows.

Why are idempotency keys important for AI agents?

AI agents often retry tool calls after timeouts, queue restarts, or provider errors. Idempotency keys prevent the same logical action from creating duplicate side effects, such as duplicate tickets, messages, or record updates.

Should rollback logic live in the prompt?

No. Prompts can describe recovery behavior, but rollback logic should live in the tool runtime, workflow engine, database, and recovery UI. The model can suggest a path, but the system should enforce safe recovery.

How do I test agent rollback behavior?

Test failure cases directly: crash workers after writes, force tool timeouts, replay duplicate jobs, resume from stale checkpoints, cancel runs mid-flow, and verify that compensation actions restore or correct the right resources.

What is the difference between rollback and compensation?

Rollback restores a previous state, usually inside systems you control. Compensation performs a follow-up action when true undo is impossible, such as sending a correction message, cancelling a task, or issuing a refund.

DEV Community