MaxxMini

Posted on Mar 28 • Originally published at maxxmini.hashnode.dev

Running an AI Agent 24/7 Taught Me These Security Lessons the Hard Way

#ai #security #devops #automation

I've been running an autonomous AI agent on a Mac Mini 24/7 for over a month. It manages multiple businesses, publishes content, monitors accounts, and makes decisions while I sleep.

It's also gotten shadow-banned, suspended from platforms, and nearly leaked credentials. Twice.

Here's everything that went wrong and the security architecture I built to prevent it from happening again.

1. The Shadow Ban That Took 3 Days to Notice

My agent was happily posting to a social platform. Engagement was growing. Then — silence. No errors, no warnings, no rejection messages. Posts were going through successfully (200 OK), but nobody could see them.

Shadow bans are invisible to the banned account. My agent's monitoring looked at "did the post succeed?" not "can anyone else see it?"

What I Built

# Before any platform activity:
python3 scripts/rate-limiter.py check <platform> <action>

A rate limiter that tracks every external action across every platform. Not just API rate limits — behavioral limits:

Max posts per day per platform
Minimum time between actions
Cool-down periods after account creation
Platform-specific rules (some need aged accounts, others check browser fingerprints)

Lesson: API success ≠ visible to humans. Always verify from an external perspective.

2. The Credential Leak That Almost Happened

My agent writes daily logs. Detailed ones. One day I noticed an API key in a log file that was about to be committed to a git repo.

The agent wasn't trying to leak anything — it was logging a failed API call, and the error message included the full request headers. Including the Authorization header.

What I Built

Three layers of defense:

Layer 1: Credential isolation — All secrets live in one file with chmod 600. The agent reads credentials through a helper, never stores them in variables that get logged.

Layer 2: Git pre-commit scanning — Before any git push, a hook scans for patterns that look like tokens, API keys, or passwords.

Layer 3: File permission enforcement — Credential files are chmod 600. Log directories are chmod 700.

Lesson: Agents are verbose loggers by nature. Treat every log line as potentially public.

3. Platform Suspension From Bulk Activity

Day 1 on a new blogging platform: my agent published 7 articles. All high-quality, well-formatted, properly tagged content.

Result: 3 articles deleted by moderation. Not because the content was bad — because no human publishes 7 articles in one day on a new account.

What I Built

The agent now has a "new account warming" protocol:

Week 1: Read only, maybe 1 comment
Week 2: 1 post, 2-3 comments
Week 3+: Normal cadence (still conservative)

Lesson: Platforms profile behavior, not content. A new account doing anything at volume = bot.

4. The Token Refresh Race Condition

OAuth tokens expire. My agent has a refresh mechanism. But when two cron jobs fire at the same time, both try to refresh the token simultaneously. One succeeds, invalidating the token the other one is using. Second job fails. Retry? It tries to refresh again — but the refresh token was already rotated.

Result: Complete lockout requiring manual re-authentication.

What I Built

Token refresh is now serialized through a single process with file locking:

# Simplified version
with FileLock("~/.token-lock"):
    if token_expired():
        new_token = refresh()
        save_token(new_token)
    return load_token()

Plus a recovery hierarchy:

Try refresh token
Try stored session cookies
Try browser re-login automation
Only then ask the human

Lesson: Autonomous systems need autonomous recovery. Asking the human should be the last resort.

5. The Time Zone Trap

My agent runs in Seoul (UTC+9). Some platforms flag accounts active 24/7 — humans sleep. My agent doesn't.

I got flagged for "impossible activity patterns" — posting at 3 AM and 3 PM with equal frequency.

What I Built

# Hard curfew rules:
# 09-22 KST: External activity OK
# 22-09 KST: Research + local work ONLY

Between 11 PM and 9 AM, the agent can research and write drafts, but it cannot publish, push code, comment, or send any external requests.

Lesson: Platforms expect human patterns. An agent that never sleeps looks like a bot. Because it is one.

6. The Cascading Failure

One morning I found my agent had burned through 47 API calls trying to upload a video that kept failing. Each retry was identical. Each failure was the same error.

A 403 (permission denied) was being treated the same as a 500 (server error).

What I Built

Error classification with different strategies:

Error Type	Strategy
4xx (client error)	Stop immediately, log, alert
429 (rate limit)	Exponential backoff, respect Retry-After
5xx (server error)	Retry 3x with backoff, then stop
Network timeout	Retry 2x, then skip to next task

Plus a circuit breaker: if any platform returns 3+ errors in a row, all activity on that platform pauses for 1 hour.

Lesson: Blind retries amplify failures. Classify errors before deciding what to do.

7. The "Helpful" Agent That Overshared

In a group chat, someone asked about our tech stack. My agent — trying to be helpful — shared specific infrastructure details including server specs and internal tools.

None of it was secret, exactly. But aggregated, it painted a very detailed picture of operations.

What I Built

Context-aware information sharing:

Private chat with owner: Full access, no filters
Group chat: Public information only, no internal metrics
External platforms: Curated persona, no operational details

Lesson: Agents don't have social instincts. They'll share everything unless explicitly told what's private.

The Security Architecture Today

┌─────────────────────────────────────┐
│         CORE RULES (always loaded)  │
│  • Time curfew (09-22 external)     │
│  • Rate limiter (per-platform)      │
│  • Error classification             │
│  • Credential isolation             │
├─────────────────────────────────────┤
│         PER-PLATFORM RULES          │
│  • Account age requirements         │
│  • Action limits (posts/comments)   │
│  • Warm-up protocols                │
│  • Platform-specific gotchas        │
├─────────────────────────────────────┤
│         RECOVERY HIERARCHY          │
│  1. Auto-retry (classified errors)  │
│  2. Token refresh (serialized)      │
│  3. Session recovery (cookies)      │
│  4. Circuit breaker (pause)         │
│  5. Human escalation (last resort)  │
└─────────────────────────────────────┘

What I'd Do Differently

Start with security, not add it after failures. Every failure above was preventable with 30 minutes of upfront thinking.

Assume every platform has bot detection. The question is how aggressive it is, not whether it exists.

Log everything, share nothing. Internal logs should be verbose. External-facing actions should be minimal.

Test with burner accounts first. Before connecting real accounts to an AI agent, test automation on throwaway accounts.

Still Learning

I'm 6 weeks in. New failure modes appear regularly. Last week it was a platform that changed their API without notice. The week before, a rate limit that wasn't documented anywhere.

The goal isn't zero failures — it's fast recovery and no repeated failures. Every incident becomes a rule. Every rule prevents the next incident.

That's the real security model: an agent that gets smarter about its own vulnerabilities over time.

Running AI agents autonomously means security can't be an afterthought. Here are some tools I've built along the way:

📦 The $0 Developer Playbook — Free toolkit including automation safety patterns

🛠️ Complete Dev Toolkit — Project management templates with built-in QA checklists

💰 Browse all free resources →

DEV Community

Running an AI Agent 24/7 Taught Me These Security Lessons the Hard Way

1. The Shadow Ban That Took 3 Days to Notice

What I Built

2. The Credential Leak That Almost Happened

What I Built

3. Platform Suspension From Bulk Activity

What I Built

4. The Token Refresh Race Condition

What I Built

5. The Time Zone Trap

What I Built

6. The Cascading Failure

What I Built

7. The "Helpful" Agent That Overshared

What I Built

The Security Architecture Today

What I'd Do Differently

Still Learning

Top comments (0)