Claudio Botticelli

Posted on May 1

Agent Skills Are Just Header Files (And Virtual Memory, And Unix Pipes)

#ai #llm #agents #softwareengineering

If you've written C, used Linux, or shipped software in the last 30 years, you already understand how AI agents work. You just don't know it yet.

The Problem Nobody Tells You About

Every blog post about LLM agents starts with breathless excitement: "Agents can use tools! Agents can reason! Agents can do anything!"

Then you actually try to build one. And you hit a wall.

You have an LLM. You want to give it 50 capabilities — search, summarize, query a database, send emails, generate images, parse PDFs, the works. The naive approach: stuff every capability description into the system prompt. Done, right?

Wrong. Within a few capabilities, your context window is bloated. The model gets confused. Latency climbs. Costs spike. The agent that was supposed to be "intelligent" starts ignoring half its tools and hallucinating the rest.

This is where you discover Agent Skills — Anthropic's framework, also adopted in various forms by OpenAI, Google, and the broader agentic AI ecosystem. The marketing copy says skills are a "new paradigm for AI capabilities."

Here's the secret: skills aren't new at all. They're four old ideas from systems programming, reapplied. If you've ever written a header file or used a Unix pipe, you already understand the engineering principles.

Let me show you.

Idea 1: Skills Are Header Files

In C, you don't put implementation in your .h files. You put declarations:

// search.h
Document* search_documents(const char* query, int limit, int* count);

The header tells the compiler: "Here's what this function looks like. Here's how to call it. The actual code lives elsewhere."

When you #include "search.h", you load the contract — not the implementation. The compiler validates calls against the contract. The linker resolves the actual implementation later.

A skill works exactly the same way:

# search_documents skill metadata
name: search_documents
description: Retrieves documents matching a query
parameters:
  query: string
  limit: integer
returns: array of {id, title, snippet}

The agent sees this contract. It doesn't see the implementation. It reasons: "I have a tool called search_documents. It takes a query string and returns documents. I should use it for this user request."

When the agent decides to invoke the skill, the runtime resolves the actual implementation — exactly like a linker resolving a symbol.

Why this matters: You don't need to teach the model how search works internally. You just declare the contract. The same separation of interface from implementation that made C scalable makes agents scalable.

Idea 2: Skill Loading Is Virtual Memory Paging

Here's where it gets interesting. You have 500 skills available. You can't fit them all in the context window. So what do you do?

The same thing operating systems have been doing since the 1960s: paging.

In virtual memory, your program "sees" more memory than physically exists. The OS keeps the actively-used parts in fast RAM, and pages out the rest to slow disk. When the program touches a missing page, the OS faults, swaps something out, swaps the needed page in, and resumes execution. The program doesn't know any of this is happening.

Skills work identically:

Virtual Memory	Agent Skills
Program sees infinite memory	Agent sees all 500 skills
Reality: only 4MB RAM	Reality: ~200K token window
Pages swap in/out of RAM	Skill specs load/unload from context
Eviction policy (LRU, etc.)	Relevance-based selection
Page fault on missing page	Lazy load on skill invocation
Thrashing if working set > RAM	Confusion if skill set > context

The agent's "working set" — skills it actually needs for the current task — is small. Usually 2-3 skills per query. Even if you have 500 available, the runtime only loads the relevant ones.

Why this matters: Apply 60 years of OS optimization research to your agent. Cache hot skills. Prefetch related ones. Detect thrashing (rapidly cycling skills) as a sign your selection algorithm is broken. The patterns are proven.

Idea 3: Skill Composition Is Unix Pipes

Unix philosophy, distilled: small programs that do one thing well, composed via pipes.

cat data.txt | grep "error" | sort | uniq -c

Each command is dumb. grep doesn't know about sort. sort doesn't know about uniq. The shell orchestrates them by piping output to input.

Agent skills should follow the same pattern. A search_documents skill doesn't know what happens to its results. A summarize_text skill doesn't care where its input came from. The agent — playing the role of the shell — decides how to compose them:

search_documents → summarize_text → store_results → notify_user

This works because each skill has a clean, typed interface. Output of one matches input of the next. The agent reasons about composition; the skills stay simple.

Why this matters: Resist the temptation to build mega-skills. A skill that "searches, summarizes, and stores" is a monolith — hard to test, hard to reuse, hard to debug. Small skills compose. Big skills don't.

Idea 4: Skill Versioning Is Semantic Versioning

You ship search_documents v1.0. Six months later, you want to add a filters parameter. Do you break every existing agent? Or do you handle it like a normal library?

Use SemVer:

v1.0.0 → original release
v1.1.0 → adds optional filters (backward compatible — minor)
v2.0.0 → renames query to queries (breaking — major)

Agents pin to versions. Registries track compatibility. Deprecation warnings precede removal. Nothing here is novel — it's exactly how npm, pip, and cargo have worked for decades.

Why this matters: Skills are a software supply chain. Treat them like one. Version them. Audit them. Apply the same discipline you'd apply to any production dependency.

The Punchline

When Anthropic introduced Agent Skills, the framing was: "Here's a new way to extend LLMs."

But what's actually happening is more interesting. The agentic AI community is rediscovering systems programming principles and applying them to a new substrate.

Headers solve interface/implementation separation → so do skill metadata
Virtual memory solves resource scarcity via dynamic loading → so does skill paging
Unix pipes solve composition via clean interfaces → so do skill chains
SemVer solves evolution via explicit contracts → so does skill versioning

This is good news. It means you don't need to learn an entirely new mental model. You need to recognize the patterns you already know, applied to a new constraint: the LLM context window as the new "RAM."

What This Means for You

If you're a software engineer approaching agentic AI:

Stop reading hype articles. Read the agent's source code. You'll see headers, paging, pipes, and versioning — just dressed in new vocabulary.
Apply your existing instincts. When you see a 5,000-token system prompt with 50 inline tool descriptions, your reaction should be the same as seeing a 5,000-line main.c with no header files: this won't scale.
Demand the right primitives. If your agentic framework doesn't support lazy loading, versioning, and composition, it's incomplete. The history of systems programming has already shown what good abstractions look like.
Bridge for your team. When non-technical stakeholders ask "what are AI skills?", you don't need buzzwords. Tell them: "Header files, but for AI." They won't understand. But your engineers will.

The next decade of software engineering will be built on agentic systems. The engineers who succeed won't be the ones who learn the most jargon. They'll be the ones who recognize that most of the hard problems were solved 30-50 years ago — and who know how to translate those solutions to a new domain.

Skills aren't magic. They're old engineering, applied with discipline.

That's the best news you'll get all year.

Discussion welcome. What other systems programming patterns map cleanly to agentic AI? Where does the analogy break down?

DEV Community