The Internet Archive's CDX API lets you find every saved version of any website.
The API
https://web.archive.org/cdx/search/cdx?url=apple.com&output=json&limit=20
Returns: timestamp, original URL, status code, MIME type, file size, digest.
What You Can Do
- See how a site looked 10 years ago — useful for brand research
- Track competitor changes — when did they redesign?
- Find deleted content — pages that are now 404
- Verify claims — what did a company say before they edited it?
Archive URL Format
https://web.archive.org/web/20250101120000/https://example.com
Timestamp format: YYYYMMDDHHmmss
Collapse by Date
Use collapse=timestamp:8 to get one snapshot per day instead of every crawl.
Apple.com Example
My scraper found 4,817 snapshots for apple.com across 6 sitemap partitions.
I built a Wayback Machine Scraper on Apify — search knotless_cadence wayback.
Top comments (0)