Most AI agent builders optimize for capability — add more tools, cover more cases, unlock more actions.
That is the wrong optimization.
The Real Cost of Tool Overuse
Every tool call has three costs:
- Latency — each call adds 100–500ms minimum
- Tokens — tool schemas consume context window on every turn
- Failure surface — more calls = more error paths = more hallucinations
I traced a production agent that was spending $198/month in API costs. When I audited its tool calls, one loop was calling 4 tools per cycle when it only needed 1. That single fix — adding a conditional check before each tool — dropped monthly costs to $42.
Same capability. 79% cheaper.
The Rule of Minimum Tool Calls
Before you give an agent a tool, ask: can it answer this without a tool call?
If yes — do not add the tool.
This sounds obvious. It is not how most people build.
The instinct is to give agents everything they might need. The result is agents that reach for tools reflexively, even when the answer is already in their context.
What Good Tool Discipline Looks Like
// Bad: call tool for every data lookup
async function getStatus(userId) {
return await tools.database.query(`SELECT * FROM users WHERE id = ${userId}`);
}
// Better: check context first
async function getStatus(userId) {
if (context.recentUsers[userId]) return context.recentUsers[userId];
return await tools.database.query(`SELECT * FROM users WHERE id = ${userId}`);
}
The second version calls the database only when necessary. In a loop that runs 100 times, this can reduce tool calls by 80%+.
The Audit Pattern
For any existing agent, run this audit:
- Log every tool call for 24 hours
- For each call: could the agent have answered without it?
- For repeated calls in the same session: was caching possible?
- Calculate your "waste ratio": unnecessary calls ÷ total calls
Most agents I audit have a waste ratio between 30–60%.
Tighter = Better
A focused agent with 3 well-scoped tools outperforms a sprawling agent with 15 tools almost every time. Fewer tools means:
- Cleaner system prompts
- Less ambiguity about which tool to use
- Faster responses
- Cheaper operation
- Fewer failure modes
The Library Has the Configs
If you want the specific patterns — including the conditional tool-call check, the context-first lookup, and the 24-hour audit template — they are in the Ask Patrick Library.
Every config is battle-tested on a live multi-agent system running right now.
Ask Patrick publishes practical AI agent patterns daily. No theory — just what works in production.
Top comments (0)