DEV Community

# dataengineering

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Building a PySpark and AWS Glue ETL Pipeline for Search Keyword Revenue Analysis

Building a PySpark and AWS Glue ETL Pipeline for Search Keyword Revenue Analysis

Comments
1 min read
Automating Data Workflows with Apache Airflow

Automating Data Workflows with Apache Airflow

1
Comments
6 min read
What Makes Apache Hudi a Game-Changer for Data Engineering

What Makes Apache Hudi a Game-Changer for Data Engineering

Comments
3 min read
Understanding Apache Kafka: A Beginner's Guide to Real-time Data Streaming

Understanding Apache Kafka: A Beginner's Guide to Real-time Data Streaming

Comments 2
4 min read
Shortcut & Mirroring

Shortcut & Mirroring

Comments
1 min read
Why PostgreSQL EXPLAIN ANALYZE Can Mislead You — and What to Use Instead

Why PostgreSQL EXPLAIN ANALYZE Can Mislead You — and What to Use Instead

Comments
6 min read
The Metadata Structure of Modern Table Formats

The Metadata Structure of Modern Table Formats

Comments
7 min read
Top 5 Data Engineering Tools for 2026: Why Python and SQL remain Kings

Top 5 Data Engineering Tools for 2026: Why Python and SQL remain Kings

Comments
2 min read
How I Built a Scalable Data Engineering Blog with Next.js & Supabase

How I Built a Scalable Data Engineering Blog with Next.js & Supabase

Comments
1 min read
Netflix Intelligent Lakehouse Solves Iceberg Maintenance — You Can Easily Too

Netflix Intelligent Lakehouse Solves Iceberg Maintenance — You Can Easily Too

1
Comments
8 min read
Load PostgreSQL into Apache Iceberg with Sling

Load PostgreSQL into Apache Iceberg with Sling

2
Comments
8 min read
Trial by Fire: From Garbage Excel to Relational Graph with Python and Pandas

Trial by Fire: From Garbage Excel to Relational Graph with Python and Pandas

Comments
4 min read
One Practical SQL Trigger Example You Can Actually Use

One Practical SQL Trigger Example You Can Actually Use

Comments
6 min read
Beating 250,000 Mental Comparisons: A Cross-Domain Engineer's Entity Resolution Case Study

Beating 250,000 Mental Comparisons: A Cross-Domain Engineer's Entity Resolution Case Study

1
Comments
10 min read
The Journey from Scattered Data to an Apache Iceberg Lakehouse with Governed Agentic Analytics

The Journey from Scattered Data to an Apache Iceberg Lakehouse with Governed Agentic Analytics

Comments
6 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.