DEV Community

# webscraping

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Building a Lightweight Media Downloader with Modern Web Techniques (Pinterest Case Study)

Building a Lightweight Media Downloader with Modern Web Techniques (Pinterest Case Study)

Comments
3 min read
Track brand mentions across China's top 5 social platforms in one API call — $0.045 per mention

Track brand mentions across China's top 5 social platforms in one API call — $0.045 per mention

Comments
10 min read
Why JSON Schema Validation Isn't Enough for Apify Actors

Why JSON Schema Validation Isn't Enough for Apify Actors

1
Comments
18 min read
Why your scraper plateaus at 5-6 concurrent Chrome instances (and the shared-cookie trap nobody names)

Why your scraper plateaus at 5-6 concurrent Chrome instances (and the shared-cookie trap nobody names)

Comments
4 min read
I Built an API That Lets AI Agents See the Web Like Humans Do

I Built an API That Lets AI Agents See the Web Like Humans Do

Comments
3 min read
Facebook scrambles author names with Flexbox order — here's the 5-line diagnostic that proves it isn't custom fonts

Facebook scrambles author names with Flexbox order — here's the 5-line diagnostic that proves it isn't custom fonts

Comments
5 min read
5 Apify webhook patterns that turn one-off scrapers into reliable data pipelines

5 Apify webhook patterns that turn one-off scrapers into reliable data pipelines

1
Comments
5 min read
The Hidden Problem Behind Technical SEO Crawlers: URL Explosion

The Hidden Problem Behind Technical SEO Crawlers: URL Explosion

1
Comments 1
1 min read
QuickCommerce API

QuickCommerce API

1
Comments
1 min read
BeautifulSoup and Requests for Web Scraping With Python: When Simple Still Works

BeautifulSoup and Requests for Web Scraping With Python: When Simple Still Works

Comments
4 min read
Open-source Playwright wrapper that passes bot.sannysoft.com, pixelscan, and CreepJS in headless mode

Open-source Playwright wrapper that passes bot.sannysoft.com, pixelscan, and CreepJS in headless mode

1
Comments 2
1 min read
NYTimes वीडियो स्ट्रीमिंग का विश्लेषण: HLS और FFmpeg के साथ एक हाई-परफॉर्मेंस एक्सट्रैक्शन इंजन का निर्माण

NYTimes वीडियो स्ट्रीमिंग का विश्लेषण: HLS और FFmpeg के साथ एक हाई-परफॉर्मेंस एक्सट्रैक्शन इंजन का निर्माण

Comments
1 min read
Puppeteer networkidle is not a scraping strategy

Puppeteer networkidle is not a scraping strategy

2
Comments 2
5 min read
Why Playwright Gets You Blocked Even With Proxies

Why Playwright Gets You Blocked Even With Proxies

Comments
4 min read
Korea's #1 Real Estate Platform Has No Official API — So I Built a Scraper. Then Got Blocked.

Korea's #1 Real Estate Platform Has No Official API — So I Built a Scraper. Then Got Blocked.

Comments
4 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.