Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
Scraping
Follow
Hide
Posts
Left menu
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
I spent 3 days scraping a site until I tried LLMs for data extraction
zhongqiyue
zhongqiyue
zhongqiyue
Follow
Jun 5
I spent 3 days scraping a site until I tried LLMs for data extraction
#
webdev
#
python
#
ai
#
scraping
Comments
Add Comment
6 min read
My web scraping nightmare ended when I let an LLM read the HTML
zhongqiyue
zhongqiyue
zhongqiyue
Follow
Jun 5
My web scraping nightmare ended when I let an LLM read the HTML
#
webdev
#
python
#
ai
#
scraping
Comments
Add Comment
5 min read
I Thought I Knew Web Scraping — Until I Hit JavaScript
zhongqiyue
zhongqiyue
zhongqiyue
Follow
Jun 5
I Thought I Knew Web Scraping — Until I Hit JavaScript
#
webdev
#
python
#
scraping
#
ai
Comments
Add Comment
4 min read
Why I Gave Up on Regex and Started Using AI for Web Scraping
zhongqiyue
zhongqiyue
zhongqiyue
Follow
Jun 5
Why I Gave Up on Regex and Started Using AI for Web Scraping
#
webdev
#
python
#
ai
#
scraping
Comments
Add Comment
5 min read
I Spent a Weekend Fighting Flaky Scrapers — Here’s What Finally Worked
zhongqiyue
zhongqiyue
zhongqiyue
Follow
Jun 5
I Spent a Weekend Fighting Flaky Scrapers — Here’s What Finally Worked
#
webdev
#
python
#
ai
#
scraping
Comments
Add Comment
5 min read
Advanced Headless Browser Anti-Bot Techniques: TLS & Canvas
AlterLab
AlterLab
AlterLab
Follow
Jun 5
Advanced Headless Browser Anti-Bot Techniques: TLS & Canvas
#
antibot
#
headlessbrowsers
#
api
#
scraping
Comments
Add Comment
6 min read
Optimizing Chunking and Data Extraction for Zero-Hallucination RAG
AlterLab
AlterLab
AlterLab
Follow
May 28
Optimizing Chunking and Data Extraction for Zero-Hallucination RAG
#
rag
#
llm
#
datapipelines
#
scraping
Comments
Add Comment
4 min read
Track YC Demo Day Companies in Real Time (with code)
NexGenData
NexGenData
NexGenData
Follow
May 22
Track YC Demo Day Companies in Real Time (with code)
#
ycombinator
#
demoday
#
scraping
#
preseed
Comments
Add Comment
5 min read
Architecture of a Rental Aggregator: Scraping and Normalizing 90+ Sources
Caspar Bannink
Caspar Bannink
Caspar Bannink
Follow
May 14
Architecture of a Rental Aggregator: Scraping and Normalizing 90+ Sources
#
python
#
scraping
#
architecture
#
realestate
Comments
Add Comment
4 min read
Web Scraping is a Contract
Gani Mendoza
Gani Mendoza
Gani Mendoza
Follow
Jun 1
Web Scraping is a Contract
#
go
#
web
#
scraping
4
 reactions
Comments
Add Comment
8 min read
How I scraped 50k YouTube subtitles in 2 weeks for $7 (and the legal gray zones)
qcrao
qcrao
qcrao
Follow
May 10
How I scraped 50k YouTube subtitles in 2 weeks for $7 (and the legal gray zones)
#
youtube
#
scraping
#
sideprojects
#
indie
Comments
Add Comment
4 min read
API or browser agent? We picked yes.
Ava Bagherzadeh
Ava Bagherzadeh
Ava Bagherzadeh
Follow
May 6
API or browser agent? We picked yes.
#
browserautomation
#
ai
#
scraping
#
webdev
Comments
Add Comment
7 min read
When web scraping breaks: using AI to extract messy data
zhongqiyue
zhongqiyue
zhongqiyue
Follow
May 29
When web scraping breaks: using AI to extract messy data
#
webdev
#
python
#
ai
#
scraping
Comments
Add Comment
5 min read
ISP proxies, AI crawlers, and the slow death of datacenter IPs: 2026 in numbers
Romeo Mihalcea
Romeo Mihalcea
Romeo Mihalcea
Follow
May 5
ISP proxies, AI crawlers, and the slow death of datacenter IPs: 2026 in numbers
#
webdev
#
security
#
scraping
#
ai
Comments
Add Comment
8 min read
I Tested 15 LLMs for Web Scraping and Built Heuristics Instead
Rohith
Rohith
Rohith
Follow
May 6
I Tested 15 LLMs for Web Scraping and Built Heuristics Instead
#
webdev
#
ai
#
scraping
#
javascript
Comments
Add Comment
3 min read
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account