The Hype vs Reality
AI agents are everywhere in 2025. But deploying an LLM for every automation task is like using a jackhammer to hang a picture frame.
Understanding where agents excel—and where they don't—is the difference between building useful software and chasing trends.
What Makes Something an "AI Agent"
An agent:
- Has access to tools (functions it can call)
- Decides which tools to use and when
- Iterates until a goal is achieved
import Anthropic from '@anthropic-ai/sdk';
const anthropic = new Anthropic();
const tools: Anthropic.Tool[] = [
{
name: 'search_codebase',
description: 'Search for code patterns in the repository',
input_schema: {
type: 'object',
properties: { query: { type: 'string' } },
required: ['query'],
},
},
{
name: 'run_tests',
description: 'Execute the test suite',
input_schema: { type: 'object', properties: {} },
},
{
name: 'edit_file',
description: 'Edit a file in the repository',
input_schema: {
type: 'object',
properties: {
path: { type: 'string' },
content: { type: 'string' },
},
required: ['path', 'content'],
},
},
];
// Agentic loop
async function runAgent(task: string) {
const messages: Anthropic.MessageParam[] = [
{ role: 'user', content: task },
];
while (true) {
const response = await anthropic.messages.create({
model: 'claude-sonnet-4-6',
max_tokens: 4096,
tools,
messages,
});
if (response.stop_reason === 'end_turn') {
return response; // Agent is done
}
if (response.stop_reason === 'tool_use') {
const toolUses = response.content.filter(b => b.type === 'tool_use');
const toolResults = await Promise.all(toolUses.map(executeTool));
messages.push({ role: 'assistant', content: response.content });
messages.push({ role: 'user', content: toolResults });
// Loop continues
}
}
}
When Agents Win
Ambiguous tasks:
// Agent: "Refactor this authentication system to use JWT"
// Needs to: understand current code, identify what to change,
// make multiple edits, verify tests pass, fix issues
// → Agent iterates until done
Multi-step research:
Task: "Summarize the competitive landscape for SaaS billing tools"
Agent:
1. search('SaaS billing tools 2025')
2. fetch(competitor URLs)
3. search('customer reviews Stripe vs Paddle')
4. synthesize findings
Adaptive workflows:
When the next step depends on the result of the previous step, agents handle branching logic naturally.
When Traditional Automation Wins
Deterministic processes:
// DON'T: agent to send welcome email
// DO: webhook → email function
stripe.on('checkout.session.completed', async (event) => {
await sendWelcomeEmail(event.data.object.customer_email);
});
High-frequency, low-variance tasks:
// DON'T: agent to process 10,000 CSV rows
// DO: batch processing script
for (const row of csvRows) {
await processRow(row); // predictable, fast, cheap
}
Real-time requirements:
LLM inference takes 500ms-5s. A traditional function takes 1ms.
The Cost Reality
Traditional function: ~$0.0001 per invocation
LLM agent (claude-haiku-4-5-20251001, simple task): ~$0.001
LLM agent (claude-sonnet-4-6, complex task): ~$0.05-0.50
For 10,000 daily automations:
Traditional: $1/day
Agent: $10-5,000/day
Agents are 10-10,000x more expensive. Use them when that cost buys you capabilities you can't get otherwise.
The Decision Framework
Is the task:
Deterministic? → Traditional automation
Rule-based? → Traditional automation
High-frequency? → Traditional automation
Time-sensitive (<100ms)? → Traditional automation
Is the task:
Ambiguous / requires judgment? → Agent
Multi-step with unknown branches? → Agent
Requires understanding natural language? → Agent
Complex enough to justify 10-1000x cost? → Agent
MCP: The Right Abstraction
Model Context Protocol lets you build tools once and use them across any LLM:
// MCP tool definition
export const searchTool = {
name: 'search_docs',
description: 'Search documentation',
inputSchema: z.object({ query: z.string() }),
execute: async ({ query }) => vectorSearch(query),
};
// Works with Claude, GPT-4, Gemini via MCP
Building MCP tools is how you create reusable AI capabilities without locking into one provider.
MCP tools for AI agents, workflow automation, and developer productivity: Whoff Agents — see our Workflow Automator MCP at $15/mo.
Top comments (0)