DEV Community

# dataengineering

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
What Spark Interviews Actually Test (Based on 189 Real Interview Reports)

What Spark Interviews Actually Test (Based on 189 Real Interview Reports)

Comments
7 min read
AWS Lake Formation: TBAC vs NBAC — The Permission Model Decision That Will Define Your Data Lake

AWS Lake Formation: TBAC vs NBAC — The Permission Model Decision That Will Define Your Data Lake

Comments
4 min read
DuckDB in the Wild: What 6 Minutes of Benchmarking Across 4 Machines Taught Me About Real-World Performance

DuckDB in the Wild: What 6 Minutes of Benchmarking Across 4 Machines Taught Me About Real-World Performance

Comments
5 min read
ETL vs ELT: Understanding the Two Pillars of Modern Data Engineering

ETL vs ELT: Understanding the Two Pillars of Modern Data Engineering

Comments
16 min read
AI-Powered Digital Transformation Strategies

AI-Powered Digital Transformation Strategies

1
Comments 1
3 min read
Apache Data Lakehouse Weekly: April 9–15, 2026

Apache Data Lakehouse Weekly: April 9–15, 2026

Comments
7 min read
Trying Out Snowflake's Adaptive Warehouse — Auto-Scaling Compute Without Manual Sizing

Trying Out Snowflake's Adaptive Warehouse — Auto-Scaling Compute Without Manual Sizing

Comments
10 min read
Backpressure in document pipelines is an architecture problem first

Backpressure in document pipelines is an architecture problem first

Comments
2 min read
Why mixed document packs make extraction pipelines harder to trust

Why mixed document packs make extraction pipelines harder to trust

Comments
2 min read
Why Cursor AI Won't Replace Data Engineers (And How to Actually Use It)

Why Cursor AI Won't Replace Data Engineers (And How to Actually Use It)

3
Comments
2 min read
Time-Series Databases (InfluxDB/TimescaleDB)

Time-Series Databases (InfluxDB/TimescaleDB)

1
Comments
8 min read
PostgreSQL to Snowflake: A Hands-On CDC Streaming Guide

PostgreSQL to Snowflake: A Hands-On CDC Streaming Guide

Comments
13 min read
ETL vs ELT: Which One Should You Use and Why?

ETL vs ELT: Which One Should You Use and Why?

1
Comments
6 min read
TaskFlow API vs Traditional Operators in Apache Airflow

TaskFlow API vs Traditional Operators in Apache Airflow

3
Comments
3 min read
What is Apache Arrow? Erasing the Serialization Tax

What is Apache Arrow? Erasing the Serialization Tax

Comments
3 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.