DEV Community

# webscraping

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Synthesio charges $36K+/year for Chinese platform coverage. I built one for $0.045/mention.

Synthesio charges $36K+/year for Chinese platform coverage. I built one for $0.045/mention.

Comments
8 min read
A Self-Hosted Web Content Extraction API

A Self-Hosted Web Content Extraction API

9
Comments 1
5 min read
How to Scrape JS-Rendered E-Commerce Pages Without Getting Blocked (2026 Guide)

How to Scrape JS-Rendered E-Commerce Pages Without Getting Blocked (2026 Guide)

Comments
2 min read
XCrawl vs Puppeteer vs Playwright: Which Web Scraping Tool Saves You More Time in 2026?

XCrawl vs Puppeteer vs Playwright: Which Web Scraping Tool Saves You More Time in 2026?

Comments
2 min read
Stop Building Fragile Scrapers — Build Actors Instead

Stop Building Fragile Scrapers — Build Actors Instead

Comments
4 min read
How Modern Anti-Bot Systems Detect Automation Before HTML Loads

How Modern Anti-Bot Systems Detect Automation Before HTML Loads

Comments
3 min read
Bulk Downloading Amazon Product Images via API and MCP: A Complete Developer Guide

Bulk Downloading Amazon Product Images via API and MCP: A Complete Developer Guide

5
Comments
4 min read
Why my Reddit scraper went from 92% to 61% success rate in 30 days (and the one-line fix)

Why my Reddit scraper went from 92% to 61% success rate in 30 days (and the one-line fix)

Comments
4 min read
AI-Powered Web Scraper

AI-Powered Web Scraper

Comments
4 min read
I scraped every **Show HN** post from May 2025 to May 2026 that crossed **200 points** and ran a quick analysis. There were 334 of them. Here is what landed.

I scraped every **Show HN** post from May 2025 to May 2026 that crossed **200 points** and ran a quick analysis. There were 334 of them. Here is what landed.

Comments
3 min read
Scraping 1000 Pages in 10 Seconds: Python Async HTTP Guide

Scraping 1000 Pages in 10 Seconds: Python Async HTTP Guide

1
Comments
5 min read
Why Web Agents Fail on Protected Sites — And How to Fix It at the Infrastructure Level

Why Web Agents Fail on Protected Sites — And How to Fix It at the Infrastructure Level

Comments
7 min read
Bypassing Scraper Latency: Building a Real-Time Economic Indicator (REI) Tracker with Python

Bypassing Scraper Latency: Building a Real-Time Economic Indicator (REI) Tracker with Python

Comments
4 min read
FULL SSRF + EXFILTRACION EN CRAWLEE

FULL SSRF + EXFILTRACION EN CRAWLEE

Comments
12 min read
What I learned scraping Bulk URL Status Checker: schema, gotchas and the tooling that worked

What I learned scraping Bulk URL Status Checker: schema, gotchas and the tooling that worked

Comments
3 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.