Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
webscraping
Follow
Hide
Posts
Left menu
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
AI-Powered Web Scraper
zahidkhan-xen
zahidkhan-xen
zahidkhan-xen
Follow
May 17
AI-Powered Web Scraper
#
showdev
#
ai
#
automation
#
webscraping
Comments
Add Comment
4 min read
I scraped every **Show HN** post from May 2025 to May 2026 that crossed **200 points** and ran a quick analysis. There were 334 of them. Here is what landed.
sab0tajue
sab0tajue
sab0tajue
Follow
May 16
I scraped every **Show HN** post from May 2025 to May 2026 that crossed **200 points** and ran a quick analysis. There were 334 of them. Here is what landed.
#
analytics
#
data
#
sideprojects
#
webscraping
Comments
Add Comment
3 min read
Scraping 1000 Pages in 10 Seconds: Python Async HTTP Guide
Alex Chen
Alex Chen
Alex Chen
Follow
May 15
Scraping 1000 Pages in 10 Seconds: Python Async HTTP Guide
#
performance
#
python
#
tutorial
#
webscraping
1
 reaction
Comments
Add Comment
5 min read
Scraping Dynamic Web Pages Without Selectors Using AI Vision (TypeScript/JavaScript Tutorial)
Paras Tejpal
Paras Tejpal
Paras Tejpal
Follow
Jun 19
Scraping Dynamic Web Pages Without Selectors Using AI Vision (TypeScript/JavaScript Tutorial)
#
webscraping
#
javascript
#
ai
#
node
1
 reaction
Comments
Add Comment
2 min read
Why Web Agents Fail on Protected Sites — And How to Fix It at the Infrastructure Level
Tinyfishie
Tinyfishie
Tinyfishie
Follow
May 15
Why Web Agents Fail on Protected Sites — And How to Fix It at the Infrastructure Level
#
agents
#
infrastructure
#
security
#
webscraping
Comments
Add Comment
7 min read
Bypassing Scraper Latency: Building a Real-Time Economic Indicator (REI) Tracker with Python
kazutaka kobayashi
kazutaka kobayashi
kazutaka kobayashi
Follow
May 20
Bypassing Scraper Latency: Building a Real-Time Economic Indicator (REI) Tracker with Python
#
python
#
webscraping
#
dataengineering
#
economics
Comments
Add Comment
4 min read
FULL SSRF + EXFILTRACION EN CRAWLEE
arturo melgarejo
arturo melgarejo
arturo melgarejo
Follow
May 15
FULL SSRF + EXFILTRACION EN CRAWLEE
#
cybersecurity
#
security
#
spanish
#
webscraping
Comments
Add Comment
12 min read
What I learned scraping Bulk URL Status Checker: schema, gotchas and the tooling that worked
Can Yılmaz
Can Yılmaz
Can Yılmaz
Follow
May 15
What I learned scraping Bulk URL Status Checker: schema, gotchas and the tooling that worked
#
webscraping
#
apify
#
data
#
tutorial
Comments
Add Comment
3 min read
Sample dataset analysis: a 100-row snapshot of Bazaraki
Can Yılmaz
Can Yılmaz
Can Yılmaz
Follow
May 15
Sample dataset analysis: a 100-row snapshot of Bazaraki
#
webscraping
#
apify
#
realestate
#
dataengineering
Comments
Add Comment
3 min read
Comparing approaches to extracting Hacker News Who Is Hiring data
Can Yılmaz
Can Yılmaz
Can Yılmaz
Follow
May 15
Comparing approaches to extracting Hacker News Who Is Hiring data
#
webscraping
#
apify
#
jobs
#
dataengineering
Comments
Add Comment
3 min read
Building a Letterboxd Film & Review data pipeline: from raw scrape to first insight
Can Yılmaz
Can Yılmaz
Can Yılmaz
Follow
May 15
Building a Letterboxd Film & Review data pipeline: from raw scrape to first insight
#
webscraping
#
apify
#
socialmedia
#
dataengineering
Comments
Add Comment
3 min read
What I learned scraping ClinicalTrials.gov: schema, gotchas and the tooling that worked
Can Yılmaz
Can Yılmaz
Can Yılmaz
Follow
May 15
What I learned scraping ClinicalTrials.gov: schema, gotchas and the tooling that worked
#
webscraping
#
apify
#
opendata
#
tutorial
Comments
Add Comment
3 min read
How I Built a Real Chinese Product Review Aggregator (and Why English Reviews Are Broken)
wuledan
wuledan
wuledan
Follow
May 15
How I Built a Real Chinese Product Review Aggregator (and Why English Reviews Are Broken)
#
showdev
#
sideprojects
#
webdev
#
webscraping
Comments
Add Comment
1 min read
Feeding Raw HTML to Your LLM Is a Token Tax. I Measured It on 10 Real Pages — Median 7.4 , and It Hits Every Scheduled Run
Alex Spinov
Alex Spinov
Alex Spinov
Follow
May 28
Feeding Raw HTML to Your LLM Is a Token Tax. I Measured It on 10 Real Pages — Median 7.4 , and It Hits Every Scheduled Run
#
webscraping
#
python
#
ai
#
dataengineering
2
 reactions
Comments
1
 comment
8 min read
What I learned scraping Website Contact: schema, gotchas and the tooling that worked
Can Yılmaz
Can Yılmaz
Can Yılmaz
Follow
May 15
What I learned scraping Website Contact: schema, gotchas and the tooling that worked
#
webscraping
#
apify
#
leadgen
#
tutorial
Comments
Add Comment
3 min read
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account