Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
dataengineering
Follow
Hide
Posts
Left menu
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
Bypassing Scraper Latency: Building a Real-Time Economic Indicator (REI) Tracker with Python
kazutaka kobayashi
kazutaka kobayashi
kazutaka kobayashi
Follow
May 20
Bypassing Scraper Latency: Building a Real-Time Economic Indicator (REI) Tracker with Python
#
python
#
webscraping
#
dataengineering
#
economics
Comments
Add Comment
4 min read
Exodus Point Data Engineering Interview Questions: Full DE Prep Guide
Gowtham Potureddi
Gowtham Potureddi
Gowtham Potureddi
Follow
May 15
Exodus Point Data Engineering Interview Questions: Full DE Prep Guide
#
python
#
sql
#
interview
#
dataengineering
Comments
Add Comment
20 min read
Sample dataset analysis: a 100-row snapshot of Bazaraki
Can Yılmaz
Can Yılmaz
Can Yılmaz
Follow
May 15
Sample dataset analysis: a 100-row snapshot of Bazaraki
#
webscraping
#
apify
#
realestate
#
dataengineering
Comments
Add Comment
3 min read
Comparing approaches to extracting Hacker News Who Is Hiring data
Can Yılmaz
Can Yılmaz
Can Yılmaz
Follow
May 15
Comparing approaches to extracting Hacker News Who Is Hiring data
#
webscraping
#
apify
#
jobs
#
dataengineering
Comments
Add Comment
3 min read
Building a Letterboxd Film & Review data pipeline: from raw scrape to first insight
Can Yılmaz
Can Yılmaz
Can Yılmaz
Follow
May 15
Building a Letterboxd Film & Review data pipeline: from raw scrape to first insight
#
webscraping
#
apify
#
socialmedia
#
dataengineering
Comments
Add Comment
3 min read
Differences Between Snowflake Editions and Secure Connectivity with AWS
Aki
Aki
Aki
Follow
for
AWS Community Builders
May 15
Differences Between Snowflake Editions and Secure Connectivity with AWS
#
aws
#
snowflake
#
dataengineering
3
reactions
Comments
Add Comment
8 min read
Bâtir un Système de Maintenance Prédictive : De l’IoT Industriel à l’Analyse Vectorielle 🏭🤖
Serge Mbela
Serge Mbela
Serge Mbela
Follow
May 15
Bâtir un Système de Maintenance Prédictive : De l’IoT Industriel à l’Analyse Vectorielle 🏭🤖
#
architecture
#
dataengineering
#
iot
#
machinelearning
Comments
Add Comment
3 min read
Feeding Raw HTML to Your LLM Is a Token Tax. I Measured It on 10 Real Pages — Median 7.4 , and It Hits Every Scheduled Run
Alex Spinov
Alex Spinov
Alex Spinov
Follow
May 28
Feeding Raw HTML to Your LLM Is a Token Tax. I Measured It on 10 Real Pages — Median 7.4 , and It Hits Every Scheduled Run
#
webscraping
#
python
#
ai
#
dataengineering
2
reactions
Comments
1
comment
8 min read
Sample dataset analysis: a 100-row snapshot of Sitemap
Can Yılmaz
Can Yılmaz
Can Yılmaz
Follow
May 15
Sample dataset analysis: a 100-row snapshot of Sitemap
#
webscraping
#
apify
#
data
#
dataengineering
Comments
Add Comment
3 min read
Why a single timestamp breaks real-time aggregation
Vitalii Buhaiov
Vitalii Buhaiov
Vitalii Buhaiov
Follow
for
MarketTrace
May 14
Why a single timestamp breaks real-time aggregation
#
distributedsystems
#
dataengineering
#
webdev
#
cryptocurrency
Comments
Add Comment
7 min read
ETL vs. ELT: Which Approach Should You Use and Why?
Gathuru_M
Gathuru_M
Gathuru_M
Follow
May 14
ETL vs. ELT: Which Approach Should You Use and Why?
#
architecture
#
beginners
#
data
#
dataengineering
1
reaction
Comments
Add Comment
2 min read
FOCUS 1.2 Migration: What Breaks When You Move Off CUR
Khushi Dubey
Khushi Dubey
Khushi Dubey
Follow
May 14
FOCUS 1.2 Migration: What Breaks When You Move Off CUR
#
analytics
#
aws
#
cloud
#
dataengineering
Comments
Add Comment
5 min read
Leakage in ML Pipelines: How to build a bulletproof preprocessing architecture
Pasquale Molinaro
Pasquale Molinaro
Pasquale Molinaro
Follow
May 14
Leakage in ML Pipelines: How to build a bulletproof preprocessing architecture
#
ai
#
computerscience
#
dataengineering
#
machinelearning
Comments
Add Comment
6 min read
Python and How Python Is Used In The Data Analytics Space. A Beginner's Guide.
Joseous Ng'ash
Joseous Ng'ash
Joseous Ng'ash
Follow
May 15
Python and How Python Is Used In The Data Analytics Space. A Beginner's Guide.
#
python
#
datascience
#
analytics
#
dataengineering
Comments
Add Comment
5 min read
Data Integrity in AI-Powered Content Pipelines: Practical Approaches
Mustafa ERBAY
Mustafa ERBAY
Mustafa ERBAY
Follow
May 14
Data Integrity in AI-Powered Content Pipelines: Practical Approaches
#
ai
#
dataintegrity
#
pipeline
#
dataengineering
Comments
Add Comment
4 min read
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account