DEV Community

# dataengineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
dbt snapshots: moving from merges to native history

dbt snapshots: moving from merges to native history

1
Comments
5 min read
Apache Parquet File Anatomy: Row Groups, Column Chunks, Pages, and Metadata Explained 🧱📦

Apache Parquet File Anatomy: Row Groups, Column Chunks, Pages, and Metadata Explained 🧱📦

Comments
8 min read
Turing Completeness in Reactivity

Turing Completeness in Reactivity

5
Comments
4 min read
A Beginner's Guide to SQL Joins and Window Functions

A Beginner's Guide to SQL Joins and Window Functions

1
Comments
6 min read
ETL VS ELT: WHICH ONE SHOULD YOU USE AND WHY?

ETL VS ELT: WHICH ONE SHOULD YOU USE AND WHY?

3
Comments
5 min read
dbt docs

dbt docs

1
Comments
7 min read
Data Engineering Interview Prep (2026): What Actually Matters (SQL, Pipelines, System Design)

Prioritizes clear thinking under pressure

Data Engineering Interview Prep (2026): What Actually Matters (SQL, Pipelines, System Design)

78
Comments 12
8 min read
AWS Lake Formation: Why Your Data Lake Permissions Are Probably a Mess (And How to Fix That)

AWS Lake Formation: Why Your Data Lake Permissions Are Probably a Mess (And How to Fix That)

Comments
3 min read
How We Generate AI Network Digests for MegaETH at MiniBlocks.io

How We Generate AI Network Digests for MegaETH at MiniBlocks.io

1
Comments
8 min read
Understanding Vector Pipelines: From Config Files to Data Flow

Understanding Vector Pipelines: From Config Files to Data Flow

2
Comments
3 min read
Stop Losing Your Medical Records: Build a Multimodal Health RAG with LlamaIndex & Qdrant 🩺

Stop Losing Your Medical Records: Build a Multimodal Health RAG with LlamaIndex & Qdrant 🩺

1
Comments
4 min read
From Scrape to Feed: Building a Google Merchant Center CSV from Zappos Data

From Scrape to Feed: Building a Google Merchant Center CSV from Zappos Data

Comments
4 min read
Advanced SQL Techniques for Data Analytics Every Data Analyst Should Know

Advanced SQL Techniques for Data Analytics Every Data Analyst Should Know

1
Comments 1
6 min read
How to Bypass the Pandas "Object Tax": Building an 8x Faster CSV Engine in C

How to Bypass the Pandas "Object Tax": Building an 8x Faster CSV Engine in C

Comments
2 min read
How Google Maps Predicts Traffic in Real Time: Live Data and ETA Explained

How Google Maps Predicts Traffic in Real Time: Live Data and ETA Explained

Comments
3 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.