DEV Community

# dataengineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Bypassing Scraper Latency: Building a Real-Time Economic Indicator (REI) Tracker with Python

Bypassing Scraper Latency: Building a Real-Time Economic Indicator (REI) Tracker with Python

Comments
4 min read
Exodus Point Data Engineering Interview Questions: Full DE Prep Guide

Exodus Point Data Engineering Interview Questions: Full DE Prep Guide

Comments
20 min read
Sample dataset analysis: a 100-row snapshot of Bazaraki

Sample dataset analysis: a 100-row snapshot of Bazaraki

Comments
3 min read
Comparing approaches to extracting Hacker News Who Is Hiring data

Comparing approaches to extracting Hacker News Who Is Hiring data

Comments
3 min read
Building a Letterboxd Film & Review data pipeline: from raw scrape to first insight

Building a Letterboxd Film & Review data pipeline: from raw scrape to first insight

Comments
3 min read
Differences Between Snowflake Editions and Secure Connectivity with AWS

Differences Between Snowflake Editions and Secure Connectivity with AWS

3
Comments
8 min read
Bâtir un Système de Maintenance Prédictive : De l’IoT Industriel à l’Analyse Vectorielle 🏭🤖

Bâtir un Système de Maintenance Prédictive : De l’IoT Industriel à l’Analyse Vectorielle 🏭🤖

Comments
3 min read
Feeding Raw HTML to Your LLM Is a Token Tax. I Measured It on 10 Real Pages — Median 7.4 , and It Hits Every Scheduled Run

Feeding Raw HTML to Your LLM Is a Token Tax. I Measured It on 10 Real Pages — Median 7.4 , and It Hits Every Scheduled Run

2
Comments 1
8 min read
Sample dataset analysis: a 100-row snapshot of Sitemap

Sample dataset analysis: a 100-row snapshot of Sitemap

Comments
3 min read
Why a single timestamp breaks real-time aggregation

Why a single timestamp breaks real-time aggregation

Comments
7 min read
ETL vs. ELT: Which Approach Should You Use and Why?

ETL vs. ELT: Which Approach Should You Use and Why?

1
Comments
2 min read
FOCUS 1.2 Migration: What Breaks When You Move Off CUR

FOCUS 1.2 Migration: What Breaks When You Move Off CUR

Comments
5 min read
Leakage in ML Pipelines: How to build a bulletproof preprocessing architecture

Leakage in ML Pipelines: How to build a bulletproof preprocessing architecture

Comments
6 min read
Python and How Python Is Used In The Data Analytics Space. A Beginner's Guide.

Python and How Python Is Used In The Data Analytics Space. A Beginner's Guide.

Comments
5 min read
Data Integrity in AI-Powered Content Pipelines: Practical Approaches

Data Integrity in AI-Powered Content Pipelines: Practical Approaches

Comments
4 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.