DEV Community

# dataengineering

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
The Silent Bug That Exposed All Tenant Data in Databricks Unity Catalog

The Silent Bug That Exposed All Tenant Data in Databricks Unity Catalog

Comments
6 min read
Debugging a Multi-Container Airflow Pipeline: Kafka Network Isolation and the YAML Indentation Trap

Debugging a Multi-Container Airflow Pipeline: Kafka Network Isolation and the YAML Indentation Trap

Comments
7 min read
The Great Data Debate: Should You Build Your Warehouse Top-Down or Bottom-Up?

The Great Data Debate: Should You Build Your Warehouse Top-Down or Bottom-Up?

Comments
6 min read
Data Lake Architecture for Data Engineering Interviews

Data Lake Architecture for Data Engineering Interviews

Comments
44 min read
Building a Dockerized Cryptocurrency ETL Pipeline with Apache Airflow

Building a Dockerized Cryptocurrency ETL Pipeline with Apache Airflow

1
Comments
4 min read
Python 101

Python 101

Comments
7 min read
A Beginners guide to Real-time Data Streaming with Apache Kafka

A Beginners guide to Real-time Data Streaming with Apache Kafka

2
Comments
6 min read
Building a Real-Time Kafka + Cassandra Pipeline

Building a Real-Time Kafka + Cassandra Pipeline

1
Comments
6 min read
Beyond the SELECT: Mastering Advanced SQL for Surgical BI

Beyond the SELECT: Mastering Advanced SQL for Surgical BI

Comments
2 min read
Managed Iceberg: Optimizing a Modern Lakehouse

Managed Iceberg: Optimizing a Modern Lakehouse

Comments
16 min read
ClickHouse Duplicates: Clean Your Results vs. Clean Your Storage

ClickHouse Duplicates: Clean Your Results vs. Clean Your Storage

3
Comments
3 min read
Airflow Version Upgrade for Enterprises: A Practical Blueprint for AWS, Snowflake, dbt, and Fintech Data Platforms

Airflow Version Upgrade for Enterprises: A Practical Blueprint for AWS, Snowflake, dbt, and Fintech Data Platforms

Comments
9 min read
Airflow Version Upgrade for Enterprises: A Practical Blueprint for AWS, Snowflake, dbt, and Fintech Data Platforms

Airflow Version Upgrade for Enterprises: A Practical Blueprint for AWS, Snowflake, dbt, and Fintech Data Platforms

Comments
9 min read
Streamlining ETL Pipelines with Docker and Docker Compose in Data Engineering

Streamlining ETL Pipelines with Docker and Docker Compose in Data Engineering

Comments
3 min read
Why PDF-Style RAG Fails on Structured Enterprise Data

Why PDF-Style RAG Fails on Structured Enterprise Data

Comments 1
1 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.