DEV Community

Cover image for Autonomous Cyberattacks Are Coming And Our Defenses Were Built for a Different Era
Arashad Dodhiya
Arashad Dodhiya

Posted on

Autonomous Cyberattacks Are Coming And Our Defenses Were Built for a Different Era

The shift isn't AI writing phishing emails. It's AI making decisions.

I've been thinking about a question a colleague raised at a security meetup a few months ago.

He wasn't asking about prompt injection or model poisoning. He asked something simpler: "What happens when attackers stop writing scripts and start deploying AI that figures things out on its own?"

Nobody had a great answer. And honestly, that bothered me more than if someone had said something wrong.

Let's Separate Automation From Autonomy

Automation in security isn't new. It's been around for decades. Scanners, botnets, credential stuffing tools — all of these automate tasks. Someone still decided what the task was, coded it up, and hit run. When it broke, a human fixed it.

Autonomous agents are different in one specific way: they decide what to do next.

Not because they're sentient. Not because they've "learned to think." But because modern AI systems are genuinely good at planning, adapting, and pursuing an objective across multiple steps without needing a human to intervene at each one.

That distinction — task automation versus goal-directed decision-making — is what makes this conversation worth having.

What an Autonomous Attack Actually Looks Like

Forget Hollywood. The realistic version isn't dramatic.

A traditional attack script is linear. Scan this range. If port 443 is open, try this. If that fails, stop. The attacker reviews the output, adjusts, and tries again. Human in the loop throughout.

An agent-driven approach works differently. You give it an objective — "find a way into this network" — and it works toward that goal iteratively. It tries one approach, sees what comes back, decides what to try next. If a direct path is blocked, it looks for another one. If it finds a misconfigured service, it explores that. It doesn't wait for instructions between steps.

The individual techniques aren't new. Most are documented in public vulnerability databases and attack frameworks. What's different is who's choosing which one to try next, and how long they keep trying.

The answer, increasingly, is: nobody. The system just keeps going.

Scale Is the Thing People Are Underestimating

New exploits get a lot of attention. Zero-days, novel techniques, sophisticated malware — these make headlines.

But most breaches don't involve any of that. They involve known vulnerabilities in unpatched software, misconfigured cloud storage, reused passwords, and services that were left exposed because nobody got around to turning them off.

Those problems exist everywhere. And right now, finding them at scale requires human effort. You have to pick which organizations to look at, which systems to probe, which exposed services to investigate.

Autonomous agents remove that constraint. If the cost of targeting an organization drops close to zero — because no human attention is required — then the economics change for every organization connected to the internet. Not just enterprises. Not just government agencies. Everyone.

That's the part of this discussion that tends to get skipped.

The Unreliability Argument Is a Moving Target

The pushback I hear most often: "AI is too unreliable. It hallucinates. It makes mistakes."

Fair enough. Current models do make mistakes. But this argument proves less than it seems to.

Attackers have never needed perfect tools. Spam was unreliable — it still worked. Early ransomware had bugs — it still caused billions in damage. Credential stuffing fails most of the time — the small percentage that succeeds is enough to make it worth running.

The threshold for "good enough" in offensive security is much lower than most people assume. An autonomous agent that succeeds 15% of the time, running continuously across thousands of targets, is a different kind of problem than a human attacker with the same success rate.

Reliability will improve. The question is what we're doing with the defenses we have right now, before it does.

Defenders Are Building This Too

I want to be clear that this isn't a one-sided shift.

Security teams are deploying autonomous agents for investigation, triage, and response. AI that can correlate alerts across a sprawling environment, draft detection rules, and identify which of 10,000 daily events actually warrants a human looking at it — that's real, it's in production at organizations right now, and it's genuinely useful.

The interesting thing is that both sides are pulling from the same toolkit. The same frameworks for building goal-directed AI agents, the same underlying models, the same cloud infrastructure.

Which means the competition isn't really about who has AI. Both sides have it or will have it soon. The competition is about who uses it better — who has cleaner data feeding it, better processes around it, and humans making the right calls when judgment actually matters.

The Assumption Problem

Here's what I think is the real issue, and it's less about AI than it sounds.

Most of our security controls were designed around human attackers. Rate limiting assumes human typing speed. Security awareness training assumes someone on the other end making a social judgment. Incident response runbooks assume an attack unfolds over hours, with humans making decisions at each stage.

Autonomous agents don't operate on human timelines. They don't get frustrated after five failed attempts. They don't take weekends off. They don't decide a target isn't worth the effort.

That's not a guarantee they'll succeed. Plenty of factors still work in defenders' favor. But it does mean that a lot of assumptions embedded in existing security architectures deserve to be revisited — not in a panic, but honestly.

Rate limits set for human speed may be too slow. Alerting thresholds calibrated for human attacker behavior may miss different patterns. Response playbooks designed for human attacker decision points may not map cleanly onto agent-driven campaigns.

These are solvable problems. But you have to notice they exist first.

What to Actually Do About This

I'm not going to end this with a tidy action plan, because I think anyone who gives you one is overstating their certainty.

What I will say is this: the organizations that will handle this transition best aren't the ones with the largest security budgets. They're the ones that are already thinking clearly about the difference between automating tasks and reasoning about goals — both in how they build their own defenses and in how they model what an attacker might actually do.

That means stress-testing assumptions, not just tooling. It means asking "what would a system that doesn't get tired do differently than a human attacker?" It means taking autonomous agents seriously as a threat model even before there's a confirmed case in the wild that makes it undeniable.

The history of cybersecurity is pretty consistent on this: the organizations that wait for confirmed evidence before updating their mental models are always playing catch-up.

A Better Question

The conversation in security tends to stay at "can AI do this?" And the answer is increasingly yes, for more things, more reliably.

The more useful question is: how do we build defenses that don't assume a human is on the other end of every attack?

That's not a solved problem. It's probably the most important open question in defensive security right now.

And the fact that not many people are talking about it seriously yet is, itself, something worth thinking about.

Where do you stand on this — are autonomous attacks already a real operational threat, or are we still a few years out from anything that genuinely changes the defensive calculus? Genuinely curious what people working in security day-to-day are seeing.

Top comments (2)

Collapse
 
arashad_dodhiya_0e4bdba5a profile image
Arashad Dodhiya

Honest question for the room: are we jumping ahead of reality here? I'm genuinely uncertain whether autonomous agent-driven attacks are already operational or still mostly a research/proof-of-concept thing. If anyone has visibility into what's actually being used in the wild right now, I'd love to hear what you're seeing.🤔

Collapse
 
alexshev profile image
Alex Shev

The defense gap is mostly speed plus verification. If attackers automate reconnaissance and chaining, defenders need controls that can verify intent and stop unsafe actions before review, not dashboards that explain the incident afterward.