DEV Community

Harshit Luthra
Harshit Luthra

Posted on • Originally published at harshit.cloud

Blocking AI Crawlers is the New 'noindex'

Originally published at harshit.cloud on 2026-01-21.


TIL: Blocking AI Crawlers is the New "noindex"

If you're blocking GPTBot, Anthropic, Perplexity, Gemini — you're trading future reach for short-term control.

the math

AI search traffic today: ~1%
AI search traffic tomorrow: 25–35%

Let them crawl. Train the discovery layer. Be early.

common AI crawler user agents

Crawler Company
GPTBot OpenAI
ClaudeBot / Anthropic-AI Anthropic
PerplexityBot Perplexity
Google-Extended Google (Gemini)

the robots.txt decision

Blocking these crawlers:

User-agent: GPTBot
Disallow: /

User-agent: ClaudeBot
Disallow: /
Enter fullscreen mode Exit fullscreen mode

Feels like control. Actually it's invisibility.

why this matters

When someone asks an AI "how do I do X" and your content isn't in the training data, you don't exist in that conversation.

The sites that trained the discovery layer early will own the AI search results later.

Visibility > invisibility.

Top comments (0)