DEV Community

# webscraping

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
AI-Powered Web Scraper

AI-Powered Web Scraper

Comments
4 min read
I scraped every **Show HN** post from May 2025 to May 2026 that crossed **200 points** and ran a quick analysis. There were 334 of them. Here is what landed.

I scraped every **Show HN** post from May 2025 to May 2026 that crossed **200 points** and ran a quick analysis. There were 334 of them. Here is what landed.

Comments
3 min read
Scraping 1000 Pages in 10 Seconds: Python Async HTTP Guide

Scraping 1000 Pages in 10 Seconds: Python Async HTTP Guide

1
Comments
5 min read
Scraping Dynamic Web Pages Without Selectors Using AI Vision (TypeScript/JavaScript Tutorial)

Scraping Dynamic Web Pages Without Selectors Using AI Vision (TypeScript/JavaScript Tutorial)

1
Comments
2 min read
Why Web Agents Fail on Protected Sites — And How to Fix It at the Infrastructure Level

Why Web Agents Fail on Protected Sites — And How to Fix It at the Infrastructure Level

Comments
7 min read
Bypassing Scraper Latency: Building a Real-Time Economic Indicator (REI) Tracker with Python

Bypassing Scraper Latency: Building a Real-Time Economic Indicator (REI) Tracker with Python

Comments
4 min read
FULL SSRF + EXFILTRACION EN CRAWLEE

FULL SSRF + EXFILTRACION EN CRAWLEE

Comments
12 min read
What I learned scraping Bulk URL Status Checker: schema, gotchas and the tooling that worked

What I learned scraping Bulk URL Status Checker: schema, gotchas and the tooling that worked

Comments
3 min read
Sample dataset analysis: a 100-row snapshot of Bazaraki

Sample dataset analysis: a 100-row snapshot of Bazaraki

Comments
3 min read
Comparing approaches to extracting Hacker News Who Is Hiring data

Comparing approaches to extracting Hacker News Who Is Hiring data

Comments
3 min read
Building a Letterboxd Film & Review data pipeline: from raw scrape to first insight

Building a Letterboxd Film & Review data pipeline: from raw scrape to first insight

Comments
3 min read
What I learned scraping ClinicalTrials.gov: schema, gotchas and the tooling that worked

What I learned scraping ClinicalTrials.gov: schema, gotchas and the tooling that worked

Comments
3 min read
How I Built a Real Chinese Product Review Aggregator (and Why English Reviews Are Broken)

How I Built a Real Chinese Product Review Aggregator (and Why English Reviews Are Broken)

Comments
1 min read
Feeding Raw HTML to Your LLM Is a Token Tax. I Measured It on 10 Real Pages — Median 7.4 , and It Hits Every Scheduled Run

Feeding Raw HTML to Your LLM Is a Token Tax. I Measured It on 10 Real Pages — Median 7.4 , and It Hits Every Scheduled Run

2
Comments 1
8 min read
What I learned scraping Website Contact: schema, gotchas and the tooling that worked

What I learned scraping Website Contact: schema, gotchas and the tooling that worked

Comments
3 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.