DEV Community

Cover image for How to Conduct a Technical SEO Site Audit
Tsotne Bukiya
Tsotne Bukiya

Posted on • Originally published at hotpress.ai

How to Conduct a Technical SEO Site Audit

How to Conduct a Technical SEO Site Audit

Your content might be excellent. Your backlinks might be strong. But if Google can't crawl, render, or index your pages properly, none of it matters.

Knowing how to conduct a technical SEO site audit gives you the answer. It strips away the marketing layer and examines the infrastructure underneath — can search engines, and now AI crawlers, actually access and understand your site?

72% — of websites fail at least one critical technical SEO factor (Semrush 2025 State of Search)

Most founders skip this work. They publish articles, build links, and wonder why rankings stall. The answer is usually buried in their technical foundation — broken canonicals, slow pages, blocked resources, or orphaned URLs that no crawler ever finds.

Here's how we conduct a technical SEO site audit — step by step — before touching a single piece of content.

What You'll Find When You Conduct a Technical SEO Site Audit

Think of it as a diagnostic scan for your site's relationship with Google. Not a compliance checkbox. A growth diagnostic.

You'll uncover three categories of problems:

  • Access issues — pages Google can't crawl or chooses not to index
  • Performance issues — pages that load too slowly or fail Core Web Vitals
  • Structure issues — poor internal linking, missing schema, duplicate content

The order matters. Fix access first. A fast page that Google can't find is worthless. A slow page that Google indexes will at least show up — you can improve speed later.

If you've never run a technical SEO audit before, expect 15-30 issues on a typical site. Don't panic. Most have straightforward fixes, and you only need to prioritize the ones blocking crawling or indexation.

Before You Start: Tools and Setup

You don't need expensive enterprise software. Here's the minimum toolkit:

  • Screaming Frog (free up to 500 URLs) — your primary crawling tool
  • Google Search Console — the source of truth for indexation and Core Web Vitals
  • PageSpeed Insights — for page-level performance diagnostics
  • Chrome DevTools — for real-time rendering and network inspection

For sites under 500 pages — and most SaaS marketing sites fall here — the free tier covers everything. Larger sites benefit from Screaming Frog's paid license ($259/year) or one of the best SEO tools we've reviewed.

Set up a spreadsheet to track findings. Columns: issue, severity (critical/medium/low), affected URLs, fix. You'll thank yourself during prioritization.

Step 1: Run a Full Site Crawl

The first step when you conduct a technical SEO site audit is crawling your entire site with Screaming Frog. Point it at your homepage and let it follow every internal link.

What you're looking for:

  • HTTP status codes — anything returning 4xx or 5xx needs attention. A handful of 404s won't kill you, but 50+ broken internal links will.
  • Redirect chains — pages that redirect to pages that redirect again. Each hop bleeds PageRank and adds latency.
  • Orphan pages — URLs in your sitemap that no internal link points to. If you can't reach them from navigation, Google probably deprioritizes them too.

Export the crawl results and filter by status code. Fix all 5xx errors immediately — those indicate server failures. Then tackle 4xx errors that have inbound links pointing to them.

Compare the crawl against your XML sitemap. Your sitemap should contain every page you want indexed and nothing you don't. Pages returning non-200 status codes in your sitemap need to be removed.

Step 2: Audit Indexation

Open Google Search Console and check the Pages report. This tells you exactly how many pages Google has indexed versus excluded.

Common indexation problems:

  • "Discovered — currently not indexed" — Google found the page but didn't think it was worth indexing. Usually means thin content or low internal link equity.
  • "Crawled — currently not indexed" — Google read the page and decided against indexing it. The content might be duplicate or low-value.
  • Unintended noindex tags — CMS defaults, staging environments leaking into production, or overzealous robots.txt rules.

Check your robots.txt file manually. Visit yoursite.com/robots.txt and verify it's not blocking important paths. We've seen startups accidentally block their entire /blog/ directory after a staging misconfiguration.

The gap between "pages you published" and "pages Google indexed" is your technical debt. Close that gap before publishing anything new.
HotPress Engineering

Canonical tags matter here too. Every page should have a self-referencing canonical, and duplicate content should point canonicals to the preferred version. Run a crawl filter for pages with missing or mismatched canonical tags.

Step 3: Measure Core Web Vitals

Core Web Vitals are Google's user experience metrics. Three numbers define whether your pages pass:

< 2.5s — Largest Contentful Paint (LCP) — how fast main content loads (Google Web Vitals)
< 200ms — Interaction to Next Paint (INP) — how fast pages respond to clicks (Google Web Vitals)
< 0.1 — Cumulative Layout Shift (CLS) — how much the page jumps around (Google Web Vitals)

INP replaced First Input Delay (FID) in March 2024. If your audit tools still reference FID, they're outdated.

Only 33% of websites pass all three Core Web Vitals thresholds. Fixing yours puts you ahead of two-thirds of the web.

Check field data in Search Console under Core Web Vitals. For page-level diagnostics, run individual URLs through PageSpeed Insights. Focus on pages with the most organic traffic first — improving CWV on a page getting 5,000 monthly visits matters more than fixing a page getting 50.

Don't chase a perfect 100 on PageSpeed Insights. A score of 80+ with passing CWV thresholds is good enough. The last 20 points often require engineering effort that delivers zero measurable ranking benefit.

Step 4: Audit Site Architecture and Internal Links

Site architecture determines how PageRank flows through your site. Pages buried 4+ clicks from the homepage get crawled less frequently and rank worse.

Check your click depth in Screaming Frog's Crawl Depth report. Every important page should be reachable within 3 clicks of the homepage.

Problems to hunt for:

  • Pages with zero internal links pointing to them — orphans, invisible to crawlers following link paths
  • Hub pages with too few outbound links — your blog index, category pages, and pillar pages should link generously to related content
  • Broken internal links — any href pointing to a 404 wastes both PageRank and user trust

If you're running a content-heavy site, topic clusters are the cleanest architecture pattern. One pillar page links to 8-15 supporting articles, each linking back. Google follows the structure and understands the topical authority you're building.

Step 5: Check Mobile and HTTPS

Google uses mobile-first indexing. The mobile version of your site is what gets crawled and ranked. Desktop is secondary.

Test your key pages with Chrome DevTools device emulation:

  • Text too small to read without zooming
  • Buttons or links too close together (tap target size)
  • Horizontal scrolling on any viewport
  • Content hidden behind JavaScript that mobile crawlers can't execute

Mobile accounts for roughly 60% of global web traffic. A 1-second delay on mobile drops conversions by 20%. These aren't edge cases — they're your default audience.

HTTPS is non-negotiable. Over 95% of page-one results use HTTPS. If you're still on HTTP, that's your highest-priority fix. Mixed content warnings — HTTPS pages loading HTTP resources — also need attention.

Run Google's Mobile-Friendly Test on your top 5 landing pages. If any fail, fix those before worrying about anything else in this audit — mobile usability issues affect every visitor, not just search bots.

Step 6: Test AI Crawler Access

This is the audit step most guides skip entirely. In 2026, your site isn't just competing for Google rankings — it's competing for mentions in AI-generated answers.

Check your robots.txt for these user agents:

  • GPTBot (OpenAI / ChatGPT)
  • ClaudeBot (Anthropic)
  • PerplexityBot (Perplexity)
  • Google-Extended (Gemini)

Blocking them means your content won't appear in AI overviews or answer engines. For most sites, allowing AI crawlers is the right call — it's free distribution you're leaving on the table otherwise.

Structured Data Matters More Now
Schema markup (FAQ, HowTo, Article) helps both Google and AI systems understand your content. Adding structured data to key pages improves your chances of appearing in featured snippets and AI-generated answers. See our SEO audit checklist for the full schema implementation guide.

Verify your pages render correctly without JavaScript too. AI crawlers and Google's secondary rendering service don't always execute JS. If your content depends on client-side rendering, consider server-side rendering or static generation.

What to Fix First

By now you know how to conduct a technical SEO site audit. You'll finish with a list of 15-50 issues. Don't try to fix everything at once.

Prioritize by impact:

  1. Critical — anything blocking crawling or indexation (robots.txt blocks, noindex tags, 5xx errors, missing sitemaps)
  2. High — Core Web Vitals failures on high-traffic pages, broken internal links, HTTPS issues
  3. Medium — redirect chains, missing schema, duplicate content, orphan pages
  4. Low — image compression, minor CLS shifts, meta description length

Fix critical issues first. You might see ranking movement within days of resolving crawl blocks. High-priority fixes typically show results within 2-4 weeks after Google recrawls.

A single technical audit won't protect you forever. Run this process quarterly. Sites break — CMS updates, new deploys, and plugin changes can quietly reintroduce every issue you fixed.
HotPress Team

Schedule a recurring quarterly audit. Many of the common SEO mistakes we see come from teams who audited once and never checked again. A 30-minute check every three months catches regressions before they cost traffic.

If you're a startup and this feels overwhelming, start with steps 1-3. Crawl access, indexation, and Core Web Vitals cover the issues responsible for 80% of technical ranking problems. Our guide on SEO for startups covers where technical audits fit into your broader SEO content strategy.

Want to see this in action? Start with a free site scan — from site scan to published article in one workflow.

Top comments (0)