DEV Community

Ghazanfar Uruj
Ghazanfar Uruj

Posted on

I scanned the 15 biggest news sites for trackers. 11 got an F.

I built a small terminal tool called leakwatch that loads any website in a real browser, clicks through the cookie-consent wall, and shows you every tracker, data broker, fingerprinting trick, and session recorder watching you โ€” rolled up to the companies they actually report to.

Then I pointed it at the 15 biggest news sites.

leakwatch demo

The results

# Site Leakage Trackers Brokers Fingerprinting
1 foxnews.com ๐Ÿ”ด 100 (F) 116 11 yes
2 usatoday.com ๐Ÿ”ด 100 (F) 116 11 yes
3 forbes.com ๐Ÿ”ด 100 (F) 113 11 yes
4 thesun.co.uk ๐Ÿ”ด 100 (F) 113 14 yes
5 businessinsider.com ๐Ÿ”ด 100 (F) 109 19 yes
6 cnn.com ๐Ÿ”ด 100 (F) 108 13 yes
7 buzzfeed.com ๐Ÿ”ด 100 (F) 96 15 yes
8 theguardian.com ๐Ÿ”ด 100 (F) 81 12 yes
9 nbcnews.com ๐Ÿ”ด 100 (F) 58 5 yes
10 cnbc.com ๐Ÿ”ด 100 (F) 48 5 yes

11 of 15 scored a flat F. Business Insider leaks to 19 data brokers. Fox News and USA Today each load 116 trackers. Every single red site fingerprints your browser.

(Reuters and Bloomberg sit behind hard paywalls / bot-walls โ€” leakwatch flags those as "under-measured" rather than pretending they're clean.)

How it works

leakwatch runs a two-phase scan:

  1. It loads the page as a fresh anonymous visitor and records every request, cookie, storage write, and fingerprinting call.
  2. It defeats the cookie-consent wall โ€” the ~dozen consent frameworks (OneTrust, Sourcepoint, Cookiebot, Quantcast, Didomiโ€ฆ) by their language-independent IDs โ€” and records the tracker surge that only fires after you "accept."

That second phase is the interesting part: plenty of sites fire trackers before you ever consent, and many more pile on the moment you click the button.

It records only the tracking surface โ€” network metadata, cookies, storage keys, fingerprinting call counts, and response headers. It never downloads or stores page content.

Try it on your own sites

pip install leakwatch
playwright install chromium
leakwatch nytimes.com
Enter fullscreen mode Exit fullscreen mode

You get a live, colour-coded terminal dashboard: a verdict line, a streaming request feed, a company rollup, and panels for fingerprinting, cookies/storage, and a security-headers grade.

Other modes:

leakwatch batch sites.txt --format markdown    # the leaderboard above
leakwatch diff your-site.com -b baseline.json  # CI gate: fail on new trackers
leakwatch example.com --json                   # machine-readable output
Enter fullscreen mode Exit fullscreen mode

Why I built it

Most privacy tools tell you a site is "bad" without the receipts, and security recon usually means digging through devtools by hand. leakwatch does both in one pass and makes the result legible: who's watching you here, how badly, and what data leaves the page.

Open source (MIT), built with Python ยท Textual ยท Playwright.

Scan your own favourite sites and see who they sell you out to. ๐Ÿ”ฆ

Top comments (0)