Anil Kurmi

Posted on Mar 31

Securing the Agentic Era: AI Agents as First-Class Security Principals

#agents #ai #cybersecurity #security

Securing the Agentic Era: AI Agents as First-Class Security Principals

47 Agents, 6 Approvals

I was looking at a dashboard last month with a security architect I know. We were staring at a list of every AI agent running in her production environment.

The count was 47.

Her team had formally reviewed, pentested, and approved exactly six of them. The other 41 were out there in the wild — shipping traffic, scraping customer data, and chaining API calls. Nobody with a security background had ever looked at their code.

That isn't an outlier. That is the median enterprise in 2026.

We have spent the last two years treating agents like "advanced chatbots." We gave them user credentials and hoped for the best. But this week, the industry finally admitted what we've all secretly known: an agent isn't a user, and it isn't a service account. It's a new kind of principal, and our current identity stack is completely failing to contain it.

GitHub, Microsoft, Cloudflare, and Scalekit all dropped major architectures this week. They agree on the diagnosis. But the solutions? They are fighting three different wars.

The "Infinite Session" Problem

Here is why your CISO is losing sleep.

When I log into a system, I use OAuth. I click a button, I get a token, I do my work, and eventually, the session dies or I log out. The assumption baked into the protocol is that a human represents the session boundary.

Agents break that assumption.

An agent doesn't log out. It runs a loop. It takes that OAuth token — which was designed for a 30-minute browser session — and keeps it alive indefinitely. It passes that token to other tools. It explores. It makes decisions I didn't explicitly authorize.

We handed out master keys to interns who never go home.

82% of executives think their current policies cover this. Meanwhile, only 14.4% of agents shipping today have gone through a real security review. That gap is where the next major breach is going to happen.

Three Philosophies, One Diagram

If you look at the architectures released this week, you'll see three distinct schools of thought. I'm going to be honest: you can't mix and match these easily. You have to pick a philosophy.

The Isolationist (GitHub): Trust nothing. Put the agent in a padded cell (chroot jail). Give it zero secrets. If it wants to talk to the outside world, it has to ask a proxy.

The Governor (Microsoft): Register everything. Every agent gets a badge and a file in HR (Agent 365). Centralized control, centralized audit.

The Granularist (OAuth Community): Check every move. You don't get a session key; you get a single-use ticket for one specific ride.

So, How Do We Actually Build This?

The "Zero-Secret" Container

GitHub's post-mortem on their Copilot Coding Agent is the most pragmatic thing I read all week. They didn't try to prompt-engineer their way to safety. They used Linux primitives.

They run the agent in a container with zero secrets. None. The agent literally cannot exfiltrate credentials because it doesn't have them. Authentication tokens live in a separate proxy container. LLM keys live in a gateway.

The agent says "Open a PR." The proxy checks if that's allowed, attaches the token, and sends the request. The agent never touches the raw string.

This is the only way to sleep at night. If you are putting OPENAI_API_KEY in your agent's environment variables, you have already lost.

The Identity Crisis

Scalekit laid out the identity problem perfectly. We are trying to shove agents into OAuth, and it hurts.

They outlined four patterns, but the one that caught my eye is Capability Tokens. Instead of giving an agent a "Read Email" scope for an hour, you issue a token that can read one specific email and then immediately self-destructs.

It adds latency. It's annoying to implement. But for high-stakes operations — like transferring money or deleting infrastructure — "ambient authority" is a death sentence. You need per-action authorization.

The Unflushable Data

Cisco made a point that terrified me.

In a traditional breach, if a database gets stolen, you rotate keys, patch the hole, and apologize. You clean up.

But what happens when an attacker injects malicious data into your agent's context or fine-tuning set? That data becomes part of the model's weights or its long-term memory vector store. You can't just DELETE FROM users WHERE id=1. The bad data is now part of the agent's brain.

Cisco calls this "Data Permanence." I call it a nightmare. Standard incident response playbooks assume you can scrub the system. With AI, you might have to burn the model down and retrain from scratch.

My Take: Stop Trusting the Model

There is a temptation to use "System Prompts" as security. We tell the model: "You are a helpful agent. Do not delete production databases."

That is not security. That is a suggestion.

Adversarial research shows that fine-tuning attacks bypass model guardrails 57% to 72% of the time. If your security relies on the LLM being "good," you have a 60% failure rate baked into your architecture.

The hard truth: You have to move security out of the prompt and into the infrastructure.

If I were building an agent platform today, I would adopt GitHub's Isolation-first model. Use firewalls. Use read-only filesystems. Use proxy containers for auth.

Yes, the Centralized Governance model (Microsoft) offers better visibility for compliance teams. And yes, Per-Action Auth (Scalekit) is more elegant. But isolation is the only thing that stops a confused agent from becoming an attacker.

The gap between "what the agent should do" and "what the agent can do" is the attack surface. Shrink it.

What You Should Do on Monday

1. Inventory the shadows.
Run a scan. Look for processes calling OpenAI, Anthropic, or Azure OpenAI endpoints. Find the 41 agents your security team doesn't know about. You cannot secure what you haven't found.

2. Kill the "Service Account" pattern.
Stop creating generic service accounts for agents. If you have an agent, give it a unique identity. If it gets compromised, you want to revoke that agent's access, not the access of every automation script in your org.

3. Implement the "Two-Person Rule" for writes.
If an agent wants to merge code, delete resources, or send money, a human should have to click "Approve." We aren't ready for fully autonomous write access yet. Not even close.

This isn't about slowing down innovation. It's about ensuring that when we look at that dashboard next year, we aren't terrified of what we see.

Resources

Under the Hood: Security Architecture of GitHub Agentic Workflows
GitHub Engineering, March 9 — Four principles: defense-in-depth with 3-tier layering, zero-secret agents via chroot jails and MCP gateways, staged and vetted writes, comprehensive logging at trust boundaries.
Zero Trust for AI: Extending Security to the Full AI Lifecycle
Microsoft Security Blog, March 19 — Extends ZT never-trust-always-verify to model inputs, tool permissions, and data access across the full AI lifecycle.
Secure Agentic AI End-to-End with Microsoft Security
Microsoft Security Blog, March 20 — Agent 365 as governance control plane, treating agents as first-class security identities with unified audit and revocation.
API Access Patterns for AI Agents
Scalekit, March 16 — Four identity patterns: Delegated OAuth, Service Accounts, Hybrid Identity, Capability Tokens. Detailed breakdown of where each pattern breaks for autonomous agents.
Cloudflare AI Security for Applications — Now Generally Available
Cloudflare Blog, March 11 — Behavioral LLM endpoint discovery, prompt injection detection, PII scanning, WAF integration at the edge layer.
AI Incident Response: What Makes AI Breaches Different
Cisco Security Blog, March 18 — Four breach scenarios, data permanence problem, why traditional IR playbooks fail for AI systems.

Sources & Attribution

Research findings synthesized from:

GitHub Engineering — "Under the Hood: Security Architecture of GitHub Agentic Workflows," published March 9, 2026.
Microsoft Security Blog — "Zero Trust for AI" (March 19), "Secure Agentic AI End-to-End" (March 20), "Observability for AI Systems" (March 20).
Scalekit — "API Access Patterns for AI Agents," published March 16, 2026.
Cloudflare Blog — "AI Security for Applications — Now Generally Available," published March 11, 2026.
Cisco Security Blog — "AI Incident Response," published March 18, 2026.

Key statistics: 82% executive confidence vs 14.4% full security approval rate sourced from enterprise AI security surveys cited in the Microsoft and Cisco posts. Fine-tuning attack bypass rates (72% Claude Haiku, 57% GPT-4o) from adversarial ML research referenced in the GitHub post.

Related Themes

2026-W12: Agentic Security — New Trust Boundaries — Last week's foundation: GitHub's three-layer architecture, zero-secret containers, supply chain risks. This week extends to identity models and governance.
2026-W12: Context Engineering as Infrastructure — Context isolation is both a performance pattern and a security boundary; over-broad context windows expand the agent's attack surface.
2026-W13: Multi-Agent Orchestration Patterns — Agent-to-agent trust is the next frontier; orchestrators that spawn sub-agents must propagate the security principal model, not just the task.

Top comments (1)

Lee My • Mar 31

Quick personal review of AhaChat after trying it
I recently tried AhaChat to set up a chatbot for a small Facebook page I manage, so I thought I’d share my experience.
I don’t have any coding background, so ease of use was important for me. The drag-and-drop interface was pretty straightforward, and creating simple automated reply flows wasn’t too complicated. I mainly used it to handle repetitive questions like pricing, shipping fees, and business hours, which saved me a decent amount of time.
I also tested a basic flow to collect customer info (name + phone number). It worked fine, and everything is set up with simple “if–then” logic rather than actual coding.
It’s not an advanced AI that understands everything automatically — it’s more of a rule-based chatbot where you design the conversation flow yourself. But for basic automation and reducing manual replies, it does the job.
Overall thoughts:
Good for small businesses or beginners
Easy to set up
No technical skills required
I’m not affiliated with them — just sharing in case someone is looking into chatbot tools for simple automation.
Curious if anyone else here has tried it or similar platforms — what was your experience?