Christopher Igbojekwe

Posted on May 5

The 2026 AEO Checklist: Steps to Get Recommended by ChatGPT, Claude, and Gemini

#ai #seo #webdev #startup

If you've ever searched your own product and seen a competitor recommended by an AI assistant instead, you already understand why AEO matters.

Answer Engine Optimization (AEO) is the practice of making your site legible, trustworthy, and citable to AI systems like ChatGPT, Claude, Gemini, Perplexity, and Deepseek. It's not magic. It's mostly technical and structural decisions you can either get right or leave broken.

The frustrating part? A lot of what blocks AI citations is stuff you'd fix anyway for good engineering reasons: server-rendered HTML, clean redirects, structured data. The difference is that with AI answer engines, the signal patterns are tighter and the feedback loop is longer.

Why This Matters More for Builders Than Marketers

Most AEO content is written for marketing teams. But if you're building a dev tool, SaaS platform, or data product, you're often the one with deploy access and the one who can actually fix this.

More importantly: if your docs, landing pages, and technical content aren't structured correctly, the AI answering your potential users' questions will route them elsewhere. That's a distribution problem with a technical solution.

1. Technical Foundation

This is the blocking category. Nothing else matters if AI crawlers can't read your site.

Server-render your critical content
Hero text, feature descriptions, and pricing copy need to appear in raw HTML, visible in View Source, not injected by JS. Pure client-side SPAs with a shell

will tank your AI visibility.

Allow AI crawlers in robots.txt
Explicitly allow GPTBot, ClaudeBot, PerplexityBot, Google-Extended, and Applebot-Extended. A forgotten blanket Disallow: / is a common silent killer.
User-agent: GPTBot
Allow: /

User-agent: ClaudeBot
Allow: /

Create an llms.txt file
A Markdown file at /llms.txt describing your product, key pages, and preferred framing. Think robots.txt but for communicating intent to AI crawlers. This standard reached broad adoption in 2025-2026, so getting it right early gives you a framing advantage.

Everything else in this category:

Sitemap with current lastmod dates, submitted to Search Console
Core Web Vitals green on mobile (LCP <2.5s, INP <200ms, CLS <0.1)
HTTPS with valid cert (AI crawlers skip HTTP-only domains)
Canonical tags on every indexable URL
No broken internal links (run Screaming Frog quarterly)
No redirect chains; collapse to single hops

2. Schema & Structured Data

JSON-LD tells crawlers what your content is, not just what it contains. Without it, AI systems guess.

Organization — root layout, with name, url, logo, and sameAs links to your social profiles. This is your baseline entity.
Person — for founders, key engineers, authors. Include sameAs pointing to LinkedIn and GitHub. This ties your people to your entity graph.
Service — one per product offering, with provider and areaServed. Vague service pages hurt you here.
FAQPage — highest ROI item on this list. Structured Q&A pairs are directly citable by AI. Every commercial page should have one.
Article — on every blog post. headline, datePublished, author, publisher are required fields.

Use @graph when a page has multiple entity types. It shows crawlers how entities relate, rather than treating them as disconnected blocks.
Run every key page through Google's Rich Results Test. Zero errors, zero warnings.

3. Content & Entity Clarity

AI models reward specificity and punish vagueness. Most technical founders underinvest here.

❌ Weak "We deliver results"
✅ Citable "We improved conversion 34% on a $40M ARR SaaS"

❌ Weak "Our team executed the work"
✅ Citable "Chris built the migration pipeline"

❌ Weak "It depends on your use case"
✅ Citable An actual, specific opinion

Named people, not "our team." Attaching a name ties work to a Person entity that AI can verify across LinkedIn, GitHub, and your schema.

Take actual positions. AI cites confident, specific claims. Hedged non-opinions get summarized without attribution.

FAQs on every commercial page. Six to ten buyer-language questions with FAQPage schema. Think "How does X compare to Y?" not "What is X?"

Original data is table stakes. Benchmarks, survey results, proprietary frameworks. If your content is assembled entirely from public info, AI has no reason to attribute it to you.

On AI-generated content: Models increasingly detect and deprioritize it. If your blog posts are 80% raw model output, expect weak citation performance. Edit until it has a real point of view.

4. Entity Signals

Your site alone isn't enough. AI systems build entity graphs from external sources.

Here are the Sources and What to do's:

LinkedIn: Company page complete and active. Leadership profiles consistent with your schema.
Crunchbase: Accurate description, founding year, HQ, team. Major entity source for AI models.
Directories: Three to five credible ones for your vertical (Clutch, G2, Capterra). Avoid link-bait directories.
Press: Two to three credible mentions per year. Trade pubs, podcasts, respected blogs. Quality over quantity.
NAP consistency: Name, address, phone identical everywhere. Discrepancies reduce entity confidence.
Wikipedia: Only if you meet notability standards. If it fits, it's the strongest entity signal available.

5. Monitoring & Iteration

AEO is not a one-time deploy. The platforms change fast.

Monthly query set (25-50 queries). Mix of brand, category, problem, and competitor queries. Run the same set every month to track movement.
**Test across all platforms separately. **ChatGPT, Claude, Gemini, Perplexity, and Deepseek each cite differently. Don't assume coverage.
Citation log. Track which pages get cited vs. summarized without attribution. Cited pages are winning; others need work.
Refresh pillar content every 6 months. Freshness is weighted by both Google and AI answer engines.
Check for new platforms quarterly. Deepseek is the most recent example of a fast-emerging player. Staying current is a short-term competitive advantage.

Phasing It If You're Resource-Constrained

Phase 1 - 1Week 1-2 - Technical foundation + schema
Phase 2 - Week 3-6 - Content restructuring + FAQ blocks
Phase 3 - Week 7-10 - External entity signals (LinkedIn, Crunchbase, directories)
Phase 4 - Ongoing - Monthly monitoring + quarterly platform reviews

What to Realistically Expect

Perplexity: First citations within 2-3 weeks of foundational fixes
ChatGPT / Claude: 30-60 days
Gemini: 60-90 days
Deepseek: Still forming; signal patterns unclear

At 90 days with no movement, the culprit is almost always weak external entity signals or thin content. Push harder there before optimizing anything else.

The Mental Model

Think of AEO as data engineering for AI systems, not an SEO project. You're making your entity graph coherent, your content structure parseable, and your external signals consistent enough that a model can confidently represent what you've built.
The builders who treat this as infrastructure work will have a compounding citation advantage as AI search continues to grow.

Which of your key product pages would an AI model struggle most to describe accurately right now? Have you actually tested it?

DEV Community