DEV Community

# webscraping

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
The Waterfall Pattern: A Tiered Strategy for Reliable Data Extraction

The Waterfall Pattern: A Tiered Strategy for Reliable Data Extraction

1
Comments 1
5 min read
From Script to Spreadsheet: Building a Self-Serve Etsy Competitor Tracker

From Script to Spreadsheet: Building a Self-Serve Etsy Competitor Tracker

2
Comments
5 min read
How to Scrape Pinduoduo (拼多多) for Product Data: A Complete Guide

How to Scrape Pinduoduo (拼多多) for Product Data: A Complete Guide

Comments
5 min read
I Benchmarked 6 LLMs to Automate My Job Board for $0.35/Month

I Benchmarked 6 LLMs to Automate My Job Board for $0.35/Month

5
Comments 2
6 min read
Scraping Chinese E-commerce Sites: Challenges and Solutions

Scraping Chinese E-commerce Sites: Challenges and Solutions

2
Comments 1
6 min read
How I Built an AI-Driven Job Automation Engine: My Hardest Engineering Lessons

How I Built an AI-Driven Job Automation Engine: My Hardest Engineering Lessons

5
Comments 1
3 min read
Calendar Feeds: Where It All Started

Calendar Feeds: Where It All Started

2
Comments
4 min read
Your AI Agent Doesn't Need Firecrawl Anymore

Your AI Agent Doesn't Need Firecrawl Anymore

Comments 2
6 min read
The End of APIs: Why Vision Agents Are the Future of Scraping

The End of APIs: Why Vision Agents Are the Future of Scraping

1
Comments 1
2 min read
Building a 'Data-on-Demand' Microservice: Wrapping Alibaba Scrapers for Internal Tools

Building a 'Data-on-Demand' Microservice: Wrapping Alibaba Scrapers for Internal Tools

2
Comments
5 min read
Automating Catalog Sync: Designing Resilient Scrapers for Dynamic Marketplaces

Automating Catalog Sync: Designing Resilient Scrapers for Dynamic Marketplaces

2
Comments
5 min read
Super Simple Web Scraping in Java (Jsoup)

Super Simple Web Scraping in Java (Jsoup)

1
Comments
2 min read
Data Quality at Scale: Validating JSONL Output with Pydantic

Data Quality at Scale: Validating JSONL Output with Pydantic

1
Comments 1
4 min read
Building a 'Living' Market Intelligence Dashboard with Python and Streamlit

Building a 'Living' Market Intelligence Dashboard with Python and Streamlit

1
Comments 2
5 min read
Hardcoded Selectors vs. AI Prompts: A Resilience Benchmark on Etsy

Hardcoded Selectors vs. AI Prompts: A Resilience Benchmark on Etsy

Comments 1
5 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.