I got tired of calling 7 different APIs just to understand a single website. So I built one that does everything in a single call.
Historically, getting a complete picture of a website required a "Frankenstein" stack of integrations. You would need one API for tech stack detection (like Wappalyzer), another for SEO metrics, a third for email/contact scraping, a fourth for WHOIS data, and yet another to clean the HTML into Markdown for your LLMs.
Managing seven different API keys, seven different billing cycles, and seven different JSON formats is a developer's nightmare. It introduces latency, increases costs, and creates multiple points of failure.
Enter the WebSight API. Hosted on the Apify platform, this tool collapses those seven layers into a single, high-performance API call. In this tutorial, we will explore how to use the WebSight API to extract tech stacks, SEO audits, contacts, and AI-ready content in one go.
The Problem: The Seven-API Burden
Before we dive into the solution, let's look at what developers typically have to juggle to get comprehensive website intelligence:
- Tech Stack Detection: Identifying if a site uses React, Shopify, or WordPress.
- SEO Audit: Checking meta tags, H1 headers, and internal link structures.
- Contact Extraction: Finding emails, LinkedIn profiles, and Twitter handles.
- Structured Data: Parsing JSON-LD or Microdata for products or events.
- AI Content Scoring: Determining the readability and "human-ness" of the content.
- Domain Intel: Checking SSL status, WHOIS records, and server location.
- Clean Content Extraction: Converting messy HTML into clean Markdown for GPT-4 or Claude.
Using separate services for these tasks means your application has to wait for seven different round-trips to various servers. WebSight API solves this by performing all these checks server-side in a single execution.
Introducing WebSight API
The WebSight API is a comprehensive website analysis api designed for speed and depth. By calling a single endpoint, you receive a structured JSON object containing everything from the site's underlying framework to the social media profiles of its founders.
Endpoint: https://george-the-developer--websight-api.apify.actor/analyze?url=URL
The 7 Modules of WebSight
- Tech Stack Detection API: Identifies over 5,000 technologies, including CMS, UI frameworks, analytics, and payment processors.
- SEO Audit API: Provides a real-time health check on title tags, descriptions, canonicals, and image alt texts.
- Contact & Social Intel: Scrapes the DOM and headers for emails, phone numbers, and links to platforms like X, LinkedIn, and Instagram.
- Structured Data Parser: Extracts and flattens Schema.org data, making it easy to read product prices or review scores.
- AI Content Metrics: Scores the content for reading ease and provides a summary of the page's intent.
- Infrastructure & Security: Returns SSL details, IP addresses, and hosting providers.
- Markdown Engine: Delivers the main content of the page in clean Markdown, stripped of ads and navigation menus—perfect for RAG (Retrieval-Augmented Generation) applications.
Implementation: JavaScript and Python
Integrating WebSight into your workflow is straightforward. Below are examples of how to call the API using the two most popular languages for web automation.
JavaScript (Fetch API)
const analyzeWebsite = async (targetUrl) => {
const apiEndpoint = \`https://george-the-developer--websight-api.apify.actor/analyze?url=\${encodeURIComponent(targetUrl)}\`;
try {
const response = await fetch(apiEndpoint);
if (!response.ok) throw new Error('Network response was not ok');
const data = await response.json();
console.log("Tech Stack:", data.tech_stack);
console.log("SEO Health:", data.seo.score);
console.log("Main Content (Markdown):", data.content.markdown.substring(0, 100) + "...");
} catch (error) {
console.error("Error analyzing website:", error);
}
};
analyzeWebsite('https://stripe.com');
Python (Requests)
import requests
import json
def get_website_intel(target_url):
api_url = f"https://george-the-developer--websight-api.apify.actor/analyze?url={target_url}"
response = requests.get(api_url)
if response.status_code == 200:
data = response.json()
# Accessing specific modules
contacts = data.get('contacts', {})
tech = data.get('tech_stack', [])
print(f"Emails Found: {contacts.get('emails')}")
print(f"Technologies: {', '.join(tech[:5])}")
return data
else:
print(f"Failed to retrieve data: {response.status_code}")
intel = get_website_intel('https://stripe.com')
Realistic Output: Analyzing Stripe.com
When you point the WebSight API at a sophisticated site like Stripe.com, you get a glimpse of its true power. Instead of a wall of HTML, you get a clean, actionable website intelligence api response.
JSON Output Fragment (Stripe.com)
{
"url": "https://stripe.com",
"status": 200,
"metadata": {
"title": "Stripe | Financial Infrastructure for the Internet",
"description": "Stripe is a suite of APIs powering online payment processing and commerce solutions for internet businesses of all sizes."
},
"tech_stack": [
"React",
"Next.js",
"Stripe Payments",
"Amazon Web Services",
"Google Analytics",
"Marketo",
"HSTS"
],
"seo": {
"score": 94,
"h1_count": 1,
"missing_alt_tags": 0,
"is_mobile_friendly": true,
"sitemap_found": true
},
"contacts": {
"emails": ["support@stripe.com", "sales@stripe.com"],
"social": {
"twitter": "https://twitter.com/stripe",
"linkedin": "https://linkedin.com/company/stripe",
"github": "https://github.com/stripe"
}
},
"infrastructure": {
"ip": "3.233.126.24",
"provider": "Amazon Data Services",
"ssl_expiry": "2025-12-01T00:00:00Z"
},
"content": {
"word_count": 1240,
"readability_score": "College Level",
"markdown": "# Financial infrastructure for the internet..."
}
}
Key Use Cases
1. Competitive Intelligence
Monitor your competitors' tech stacks. Did they just switch from Shopify to Next.js? Are they using a new A/B testing tool? WebSight API allows you to track these changes at scale without manual inspection.
2. Hyper-Targeted Lead Generation
For B2B companies, the "Tech Stack Detection API" is a goldmine. If you sell a React performance monitoring tool, you can use WebSight to find every company in a specific niche that is using React and has a low SEO score, then reach out to them via the extracted emails.
3. SEO Auditing at Scale
Instead of running a manual audit tool for every new client, agencies can programmatically generate a health report in seconds. By checking meta tags, H1s, and site speed indicators in one call, you can provide instant value during discovery calls.
4. Giving AI Agents "Eyes"
If you are building an AI agent that needs to "browse" the web, providing it with raw HTML is expensive and noisy. WebSight API provides the AI with clean Markdown and structured metadata. This reduces token usage and improves the accuracy of the AI's summaries and actions.
Conclusion
The era of fragmented web scraping is over. By consolidating tech detection, SEO auditing, contact extraction, and content cleaning into a single website analysis api, the WebSight API allows developers to focus on building features rather than managing infrastructure.
Whether you are building the next big SaaS or a simple automation script, the ability to get 360-degree website intelligence with one API call is a game-changer for your development velocity.
Ready to try it? Head over to the WebSight API on Apify and start analyzing.
Top comments (0)