Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
webscraping
Follow
Hide
Posts
Left menu
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
How I Built a Real Chinese Product Review Aggregator (and Why English Reviews Are Broken)
wuledan
wuledan
wuledan
Follow
May 15
How I Built a Real Chinese Product Review Aggregator (and Why English Reviews Are Broken)
#
showdev
#
sideprojects
#
webdev
#
webscraping
Comments
Add Comment
1 min read
Feeding Raw HTML to Your LLM Is a Token Tax. I Measured It on 10 Real Pages — Median 7.4 , and It Hits Every Scheduled Run
Alex Spinov
Alex Spinov
Alex Spinov
Follow
May 28
Feeding Raw HTML to Your LLM Is a Token Tax. I Measured It on 10 Real Pages — Median 7.4 , and It Hits Every Scheduled Run
#
webscraping
#
python
#
ai
#
dataengineering
2
 reactions
Comments
1
 comment
8 min read
How I scraped Welcome to the Jungle Jobs and what the dataset actually looks like
Can Yılmaz
Can Yılmaz
Can Yılmaz
Follow
May 15
How I scraped Welcome to the Jungle Jobs and what the dataset actually looks like
#
webscraping
#
apify
#
jobs
#
tutorial
Comments
Add Comment
4 min read
What I learned scraping Website Contact: schema, gotchas and the tooling that worked
Can Yılmaz
Can Yılmaz
Can Yılmaz
Follow
May 15
What I learned scraping Website Contact: schema, gotchas and the tooling that worked
#
webscraping
#
apify
#
leadgen
#
tutorial
Comments
Add Comment
3 min read
Sample dataset analysis: a 100-row snapshot of Sitemap
Can Yılmaz
Can Yılmaz
Can Yılmaz
Follow
May 15
Sample dataset analysis: a 100-row snapshot of Sitemap
#
webscraping
#
apify
#
data
#
dataengineering
Comments
Add Comment
3 min read
The German Web Scraping Market: €190M and Growing
James
James
James
Follow
May 13
The German Web Scraping Market: €190M and Growing
#
webscraping
#
germany
#
automation
#
business
Comments
Add Comment
4 min read
DSGVO-Compliant Web Scraping: What German Businesses Need to Know
James
James
James
Follow
May 13
DSGVO-Compliant Web Scraping: What German Businesses Need to Know
#
dsgvo
#
webscraping
#
gdpr
#
germany
Comments
Add Comment
4 min read
Automating Web Intelligence with Python: A Practical Guide
James
James
James
Follow
May 13
Automating Web Intelligence with Python: A Practical Guide
#
python
#
webscraping
#
automation
#
tutorial
Comments
Add Comment
4 min read
I Built a Web Scraper API That Handles JS Rendering, CAPTCHAs, and Proxies
Charles
Charles
Charles
Follow
May 14
I Built a Web Scraper API That Handles JS Rendering, CAPTCHAs, and Proxies
#
webdev
#
webscraping
#
javascript
Comments
Add Comment
2 min read
xcrawl-scraper v1.0.1 — Node.js SDK for Web Scraping
Charles
Charles
Charles
Follow
May 14
xcrawl-scraper v1.0.1 — Node.js SDK for Web Scraping
#
showdev
#
javascript
#
webscraping
#
opensource
1
 reaction
Comments
Add Comment
1 min read
Giving n8n AI Workflows Fresh Web Data Without Babysitting Scrapers
Anakin
Anakin
Anakin
Follow
May 27
Giving n8n AI Workflows Fresh Web Data Without Babysitting Scrapers
#
n8n
#
ai
#
webscraping
#
automation
Comments
2
 comments
5 min read
Raw HTML is where LLM context goes to die
Massi
Massi
Massi
Follow
May 13
Raw HTML is where LLM context goes to die
#
ai
#
webdev
#
llm
#
webscraping
1
 reaction
Comments
Add Comment
5 min read
Scraping Chinese Social Platforms for LLM Training Data: A Practical Multi-Source Pipeline (Python, 2026)
Sami
Sami
Sami
Follow
May 12
Scraping Chinese Social Platforms for LLM Training Data: A Practical Multi-Source Pipeline (Python, 2026)
#
python
#
webscraping
#
china
#
ai
Comments
Add Comment
7 min read
What to do when websites change and your spider doesn't know
John Rooney
John Rooney
John Rooney
Follow
for
Extract by Zyte
May 11
What to do when websites change and your spider doesn't know
#
webscraping
#
zyte
#
data
#
programming
1
 reaction
Comments
Add Comment
6 min read
Web Scraping in 2024: Whats Legal, Whats Not, and What Works
James
James
James
Follow
May 9
Web Scraping in 2024: Whats Legal, Whats Not, and What Works
#
webscraping
#
legal
#
gdpr
#
ai
Comments
Add Comment
6 min read
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account