DEV Community

Алексей Спинов
Алексей Спинов

Posted on

Wayback Machine API — Get Historical Snapshots of Any Website

The Internet Archive's CDX API lets you find every saved version of any website.

The API

https://web.archive.org/cdx/search/cdx?url=apple.com&output=json&limit=20
Enter fullscreen mode Exit fullscreen mode

Returns: timestamp, original URL, status code, MIME type, file size, digest.

What You Can Do

  • See how a site looked 10 years ago — useful for brand research
  • Track competitor changes — when did they redesign?
  • Find deleted content — pages that are now 404
  • Verify claims — what did a company say before they edited it?

Archive URL Format

https://web.archive.org/web/20250101120000/https://example.com
Enter fullscreen mode Exit fullscreen mode

Timestamp format: YYYYMMDDHHmmss

Collapse by Date

Use collapse=timestamp:8 to get one snapshot per day instead of every crawl.

Apple.com Example

My scraper found 4,817 snapshots for apple.com across 6 sitemap partitions.

I built a Wayback Machine Scraper on Apify — search knotless_cadence wayback.

Top comments (0)