JC Labs

Posted on Apr 7

I Was Juggling 5 Domain APIs. So I Built One to Replace Them All

#security #api #saas #webdev

Last year I needed full domain intelligence for a lead qualification tool. Sounds straightforward: get WHOIS data, check DNS records, verify SSL, detect the tech stack.

Here's what I ended up wiring together:

WhoisXML API — WHOIS/RDAP data
SecurityTrails — DNS records
SSL Labs — SSL analysis (free but 60s+ per query)
BuiltWith — tech stack detection

Four APIs. Four API keys. Four SDKs. Four completely different response formats.

And the worst part? Each one returned a different JSON structure. WHOIS data came as nested objects, DNS as arrays, SSL Labs had its own grading system, BuiltWith returned a tree of technology categories. My "simple" lead enrichment function ballooned to 400 lines of normalization code — and that was before I added any actual business logic.

The fragmentation problem

This isn't unique to my use case. If you're building:

A sales tool that qualifies leads by their domain
A security dashboard that monitors certificate expiration
An SEO platform that analyzes competitor tech stacks
A compliance system that validates domain configurations

...you're stitching together the same 3-5 APIs. Everyone is solving the same integration problem independently.

And when one of those APIs changes their response format (WhoisXML did this twice in 6 months) or goes down at 2am (SecurityTrails had a 4-hour outage last quarter), your entire pipeline breaks — and debugging means figuring out WHICH of the four integrations failed.

The cognitive load alone is exhausting. You're not thinking about your product. You're thinking about API rate limits, retries, schema diffs, and which client library is lagging behind the upstream changes.

What the normalization code actually looks like

Here's a simplified version of what I ended up writing just to unify the WHOIS and DNS responses:

function normalizeDomainData(whoisRaw, dnsRaw, sslRaw) {
  // WhoisXML returns registrar under WhoisRecord.registrarData.registrarName
  // SecurityTrails returns DNS as { type: "A", values: [...] }[]
  // SSL Labs returns a chain of endpoints with grading per IP

  const registrar = whoisRaw?.WhoisRecord?.registrarData?.registrarName
    ?? whoisRaw?.registrar // format changed in v2
    ?? null;

  const aRecords = dnsRaw?.records
    ?.filter(r => r.type === "A")
    ?.flatMap(r => r.values.map(v => v.ip))
    ?? [];

  const sslGrade = sslRaw?.endpoints?.[0]?.grade ?? "unknown";

  return { registrar, aRecords, sslGrade };
}

This is just three fields. Now multiply that by every field you need from every API, add error handling, add field-level null checks, add the logic to re-fetch when one provider fails... and you start to see why 400 lines isn't an exaggeration.

What a unified response actually looks like

Here's what I wanted to be able to call from day one:

curl https://api.domainlens.com/v1/domain/stripe.com \
  -H "Authorization: Bearer dk_live_xxx"

And get back:

{
  "domain": "stripe.com",
  "whois": {
    "registrar": "MarkMonitor Inc.",
    "organization": "Stripe, Inc.",
    "created": "2010-01-12",
    "expires": "2030-01-12",
    "nameservers": ["ns1.p16.dynect.net", "..."],
    "dnssec": true
  },
  "dns": {
    "a": ["185.166.143.38"],
    "mx": [{"priority": 1, "host": "aspmx.l.google.com"}],
    "txt": ["v=spf1 include:_spf.google.com ..."],
    "mail_provider": "Google Workspace",
    "spf": true,
    "dmarc": true
  },
  "ssl": {
    "issuer": "Let's Encrypt",
    "valid_from": "2025-11-15",
    "valid_to": "2026-02-13",
    "days_remaining": 42,
    "protocol": "TLSv1.3"
  },
  "tech": {
    "frameworks": ["React", "Next.js"],
    "cdn": ["Cloudflare"],
    "analytics": ["Segment"],
    "hosting": "AWS"
  },
  "reputation": {
    "score": 94,
    "factors": {
      "domain_age": "high",
      "ssl_valid": true,
      "email_security": "strong",
      "dns_config": "optimal"
    }
  }
}

One consistent schema. No field-level guessing. No "did this provider change their format again?"

Real-world use cases

Lead qualification

Your sales team gets an inbound signup from acme.io. Before anyone picks up the phone:

GET /v1/domain/acme.io

Now you know: they use React + Next.js (technical team), host on AWS (budget for infrastructure), domain is 8 years old (established company), have Google Workspace (active organization). That's a qualified lead — no manual research, no LinkedIn stalking.

The signal that matters most here isn't any single field — it's the combination. WHOIS age + tech stack + DNS config tells a coherent story. With separate APIs you'd have to join that data yourself.

Security monitoring

You manage 200+ client domains. Instead of checking SSL expiration in one tool, DNS in another, and WHOIS in a third:

POST /v1/domains/bulk
["client1.com", "client2.com", ..., "client200.com"]

Then filter in one pass:

domains
  .filter(d =>
    d.ssl.days_remaining < 30 ||
    d.dns.dmarc === false ||
    daysBefore(d.whois.expires) < 90
  )
  .forEach(sendAlert);

One script, one cron job, complete coverage across all three risk vectors.

Competitive analysis

Your SEO platform needs to show users what technology their competitors use:

GET /v1/domain/competitor.com

Pull tech.frameworks, tech.cdn, tech.analytics directly — no scraping, no browser automation, 200+ technology fingerprints detected from HTTP headers and HTML analysis. And because it's in the same response, you can cross-reference with DNS config and domain age without a second call.

The normalization tax is real

Every team building on top of fragmented APIs pays the same tax:

Initial integration time — understanding each API's quirks, auth flows, pagination
Ongoing maintenance — each provider ships breaking changes on their own schedule
Incident debugging — when your pipeline breaks, which provider is down?
Response stitching — writing and maintaining the glue code that joins N schemas into one

That last one is the invisible cost. It's not a one-time task. Every time you add a new field, you touch the normalization layer. Every time a provider updates their schema, you touch it again. It's technical debt that compounds quietly.

When NOT to use this

If you only need raw WHOIS data and nothing else, a dedicated WHOIS API will likely be cheaper at high volume. If you need historical DNS data — what did this domain resolve to 6 months ago? — you'll still need SecurityTrails or a similar provider with historical records.

A unified API like this is optimized for: "I need complete, current intelligence about a domain in one call." If your use case is narrower than that, a specialized API might serve you better. The consolidation benefit only kicks in when you need data from multiple of these categories at once.

I'm building DomainLens to consolidate all of this into one call. Still in development — if you've hit the same wall, join the waitlist to get early access when it launches.

How are you currently handling domain intelligence in your projects? I'd love to hear if others have hit the same fragmentation wall — or found a better approach.

DEV Community