DEV Community

# dataengineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
International SaaS Nightmare: Timezone Edge Cases (And How to Solve Them Once and For All)

International SaaS Nightmare: Timezone Edge Cases (And How to Solve Them Once and For All)

Comments
2 min read
📘 Foundation Phase Completed - Starting Phase 2 of My Journey

📘 Foundation Phase Completed - Starting Phase 2 of My Journey

Comments
3 min read
Paralelismo em Python para Engenharia de Dados: O Segredo das Tarefas I/O-Bound

Paralelismo em Python para Engenharia de Dados: O Segredo das Tarefas I/O-Bound

Comments 1
6 min read
Skip the Database: Building Analytics Dashboards Directly from S3 Files

Skip the Database: Building Analytics Dashboards Directly from S3 Files

17
Comments 3
9 min read
From smog to streams: how data engineering helps us breathe easier.

From smog to streams: how data engineering helps us breathe easier.

1
Comments 1
4 min read
Apache Iceberg dev list digest (Sept 8–12, 2025)

Apache Iceberg dev list digest (Sept 8–12, 2025)

1
Comments
5 min read
The Ultimate Guide to Open Table Formats: Iceberg, Delta Lake, Hudi, Paimon, and DuckLake

The Ultimate Guide to Open Table Formats: Iceberg, Delta Lake, Hudi, Paimon, and DuckLake

2
Comments
16 min read
How to Pass the AWS Certified Data Engineer – Associate (DEA-C01) Exam2025

How to Pass the AWS Certified Data Engineer – Associate (DEA-C01) Exam2025

Comments 1
5 min read
Apache Kafka Deep Dive: Core Concepts, Data Engineering Applications, and Real-World Production Practices

Apache Kafka Deep Dive: Core Concepts, Data Engineering Applications, and Real-World Production Practices

Comments
7 min read
Apache Kafka Deep Dive: Core Concepts, Data Engineering Applications, and Real-World Production Practices

Apache Kafka Deep Dive: Core Concepts, Data Engineering Applications, and Real-World Production Practices

Comments
4 min read
Apache Kafka Deep Dive: Core Concepts, Data Engineering Applications, and Real-World Production Practices

Apache Kafka Deep Dive: Core Concepts, Data Engineering Applications, and Real-World Production Practices

Comments
10 min read
🦆 DuckDB: The SQLite of Analytics You Didn’t Know You Needed

🦆 DuckDB: The SQLite of Analytics You Didn’t Know You Needed

Comments
3 min read
Apache Kafka Deep Dive: Core Concepts, Data Engineering Applications, and Real-World Production Practices

Apache Kafka Deep Dive: Core Concepts, Data Engineering Applications, and Real-World Production Practices

1
Comments
6 min read
Real-Time Fraud Detection Using Apache Flink

Real-Time Fraud Detection Using Apache Flink

1
Comments
1 min read
Personal Picks: Data Product News (August 20, 2025)

Personal Picks: Data Product News (August 20, 2025)

Comments
6 min read
Apache Polaris Dev List Digest (Sept 15–19, 2025)

Apache Polaris Dev List Digest (Sept 15–19, 2025)

Comments
4 min read
Building a Reddit Sentiment Pipeline using Python, PostgreSQL, VADER, Airflow, Grafana, Prometheus and StatsD

Building a Reddit Sentiment Pipeline using Python, PostgreSQL, VADER, Airflow, Grafana, Prometheus and StatsD

1
Comments 1
5 min read
Is Prompt Engineering Just Hype for Now?

Is Prompt Engineering Just Hype for Now?

7
Comments
3 min read
Apache Kafka Deep Dive: Core Concepts, Data Engineering Applications, and Real-World Production Practices

Apache Kafka Deep Dive: Core Concepts, Data Engineering Applications, and Real-World Production Practices

Comments
7 min read
Apache Kafka Deep Dive: Core Concepts, Data Engineering Applications, and Real-World Production Practices

Apache Kafka Deep Dive: Core Concepts, Data Engineering Applications, and Real-World Production Practices

1
Comments
4 min read
Apache Polaris Dev Mailing List — Weekly Digest (Aug 11–17, 2025)

Apache Polaris Dev Mailing List — Weekly Digest (Aug 11–17, 2025)

Comments
4 min read
How ETL Pipelines Power Modern Data Analytics

How ETL Pipelines Power Modern Data Analytics

Comments
1 min read
Polyglot Data Engineering: Python + Go in the Same Pipeline

Polyglot Data Engineering: Python + Go in the Same Pipeline

3
Comments 2
2 min read
🚀 Why You Should Pick Auto Loader Over Structured Streaming in Azure Databricks (The Funny Truth)

🚀 Why You Should Pick Auto Loader Over Structured Streaming in Azure Databricks (The Funny Truth)

Comments
2 min read
Real-Time CDC with Debezium and Kafka for Sharded PostgreSQL Integration

Real-Time CDC with Debezium and Kafka for Sharded PostgreSQL Integration

1
Comments
9 min read
loading...