Alan West

Posted on Apr 8

How to Run AI-Assisted Pentesting Locally Without Leaking Client Data

#security #ai #linux #pentesting

You're halfway through an engagement, staring at a terminal full of Nmap output, and you think: "I wish I could just ask an AI to help me parse this." So you paste it into ChatGPT. Then you realize you just sent your client's internal network topology to a third-party API.

I've seen this happen more times than I'd like to admit — sometimes to me, sometimes to colleagues. The problem isn't that AI is bad for pentesting. It's genuinely useful for parsing scan output, suggesting next steps, and even helping draft reports. The problem is that most AI-powered workflows send sensitive engagement data to cloud endpoints, which is a confidentiality nightmare.

Let's walk through how to solve this by running your AI pentesting assistant entirely locally.

Why Cloud-Based AI Is a Problem for Security Work

This isn't theoretical hand-wringing. Most penetration testing contracts include strict data handling clauses. When you send scan results, credentials, or network maps to a cloud LLM provider, you're potentially violating:

NDAs and MSAs — client data leaving your controlled environment
Compliance requirements — PCI-DSS, HIPAA, and SOC 2 all have opinions about where data goes
Your own operational security — if you're testing a target, you probably don't want a third party knowing about it

The fix is straightforward in principle: run the LLM locally. In practice, getting a useful local AI assistant wired into your pentesting workflow takes some setup.

The Architecture: Local LLM + Tool Integration

The general pattern for a local AI pentesting assistant looks like this:

┌─────────────────────────────────────┐
│  Your Pentesting OS (Parrot/Kali)   │
│                                     │
│  ┌───────────┐    ┌──────────────┐  │
│  │ Local LLM │◄──►│  Assistant    │  │
│  │ (Ollama)  │    │  Framework   │  │
│  └───────────┘    └──────┬───────┘  │
│                          │          │
│            ┌─────────────┼────────┐ │
│            ▼             ▼        ▼ │
│         Nmap          Nikto    Burp  │
│         Metasploit    SQLMap   ...   │
└─────────────────────────────────────┘

The key components are:

A local LLM runtime — Ollama is the most common choice for running models like Llama, Mistral, or CodeLlama on your own hardware
A coordination layer — something that takes your natural language input, decides which tools to run, and feeds results back to the LLM
Standard pentesting tools — the same Nmap, Metasploit, Nikto, etc. you already use

Projects like METATRON are exploring this exact pattern — building an AI-powered pentesting assistant that runs against a local LLM on Linux (specifically Parrot OS). The idea is to keep everything on your machine.

Step 1: Setting Up a Local LLM with Ollama

First, you need a local model runtime. Ollama makes this almost trivially easy:

# Install Ollama
curl -fsSL https://ollama.com/install.sh | sh

# Pull a model that's good at reasoning and code
# Mistral 7B is a solid starting point for modest hardware
ollama pull mistral

# Verify it's running
ollama list

Hardware matters here. For a 7B parameter model, you'll want at least 8GB of RAM (16GB preferred). If you have a decent GPU, Ollama will use it automatically. On a CPU-only setup, expect slower responses but it still works.

You can test it quickly:

# Quick sanity check — ask it something security-related
curl http://localhost:11434/api/generate -d '{
  "model": "mistral",
  "prompt": "Explain what a SYN scan does in one paragraph",
  "stream": false
}' | python3 -m json.tool

If you get a coherent response about TCP handshakes, you're in business.

Step 2: The Coordination Problem

Here's where it gets interesting — and where most people get stuck. Having a local LLM is great, but you need something that can:

Accept your natural language input ("scan this subnet for web servers")
Translate that into actual tool commands (nmap -sV -p 80,443,8080 192.168.1.0/24)
Execute the commands safely
Feed the output back to the LLM for analysis
Suggest next steps based on findings

This is the agent pattern, and building it from scratch is non-trivial. You need to handle tool calling, output parsing, context management (LLMs have finite context windows), and — critically — safety guardrails so the AI doesn't run rm -rf / on your machine.

Projects like METATRON aim to provide this coordination layer out of the box, specifically tailored for security tools on Linux distros like Parrot OS. If you're evaluating tools like this, here's what to look for:

Does it sandbox command execution? You don't want an LLM with unrestricted shell access
Does it actually run locally? Check that no API calls are being made to external services
How does it handle context? Scan output can be massive — the tool needs to summarize or chunk it intelligently
Is it transparent? You should see every command before it runs

Step 3: Building a Minimal Version Yourself

If you want to understand the mechanics (or need something custom), here's a stripped-down Python example that talks to Ollama and wraps Nmap:

import subprocess
import requests
import json
import shlex

OLLAMA_URL = "http://localhost:11434/api/generate"

def ask_llm(prompt, model="mistral"):
    """Send a prompt to the local Ollama instance."""
    resp = requests.post(OLLAMA_URL, json={
        "model": model,
        "prompt": prompt,
        "stream": False
    })
    return resp.json()["response"]

def run_nmap(target, flags="-sV"):
    """Run nmap with given flags. Target must be validated first."""
    # Basic input validation — never trust LLM output directly
    if any(c in target for c in [";", "|", "&", "`"]):
        raise ValueError("Suspicious characters in target")

    cmd = f"nmap {shlex.quote(flags)} {shlex.quote(target)}"
    print(f"[*] Running: {cmd}")  # Always show what's being executed

    result = subprocess.run(
        shlex.split(cmd),
        capture_output=True, text=True, timeout=300
    )
    return result.stdout

# Example workflow
target = "192.168.1.0/24"  # Your authorized test target
scan_output = run_nmap(target)

analysis = ask_llm(
    f"Analyze this Nmap scan output. Identify open services, "
    f"potential vulnerabilities, and suggest next steps.\n\n{scan_output}"
)
print(analysis)

This is intentionally simple. A production-grade version needs much more robust input sanitization, proper error handling, and ideally a confirmation step before executing any command the LLM suggests.

Common Pitfalls and How to Avoid Them

Model too small, output is garbage. A 3B parameter model will struggle with complex security analysis. Start with 7B minimum, go to 13B+ if your hardware allows it. The tradeoff is speed vs. quality.

Context window overflow. A full Nmap scan of a /16 subnet produces megabytes of output. You can't dump all of that into a 4K context window. Solutions: summarize scan output before sending it to the LLM, or use models with larger context windows (some support 32K+ tokens).

The LLM hallucinates CVEs. This is the big one. Local LLMs will confidently tell you that Apache 2.4.49 is vulnerable to CVE-XXXX-YYYY, and sometimes that CVE doesn't exist. Always cross-reference suggested vulnerabilities against actual CVE databases. Treat LLM output as suggestions, not findings.

Forgetting authorization. This should be obvious, but running AI-assisted or not, you need written authorization before scanning anything. The AI doesn't make unauthorized testing any more legal.

Prevention: Building Good Habits

Whether you use an existing tool or build your own:

Air-gap when possible. For the most sensitive engagements, run your AI assistant on a machine with no internet access after downloading the model
Audit your tools. Before using any open-source AI pentesting assistant, read the source. Check for telemetry, external API calls, or data exfiltration
Log everything. Keep a record of what the AI suggested vs. what you actually ran. This matters for your pentest report
Don't blindly trust output. The AI is a junior analyst that reads fast but makes things up. Verify everything

Wrapping Up

The core problem — AI is useful for pentesting but cloud APIs are a data leak risk — has a clean solution: run it locally. Tools like Ollama have made local LLM inference accessible, and projects like METATRON are building the pentesting-specific coordination layer on top.

The space is still early. Most of these tools are experimental, and you should treat them accordingly. But the direction is clear: local AI assistants that understand security tooling, run on your hardware, and keep client data where it belongs.

Just please, stop pasting recon output into ChatGPT.

Top comments (0)

Some comments may only be visible to logged-in visitors. Sign in to view all comments.