DEV Community

# dataengineering

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Apache Fluss: Architecting the Streaming-First Persistent Data Stack

Apache Fluss: Architecting the Streaming-First Persistent Data Stack

Comments
3 min read
How a Self-Documenting Semantic Layer Reduces Data Team Toil

How a Self-Documenting Semantic Layer Reduces Data Team Toil

Comments 1
4 min read
Using Apache Iceberg with Python and MPP Query Engines

Using Apache Iceberg with Python and MPP Query Engines

Comments
7 min read
Understanding Apache Kafka: A Beginner-Friendly Guide

Understanding Apache Kafka: A Beginner-Friendly Guide

Comments
2 min read
Stop Naming Your Healthcare Columns Wrong — ISO-11179 Explained

Stop Naming Your Healthcare Columns Wrong — ISO-11179 Explained

Comments 2
6 min read
Aligning Timeouts in Distributed Orchestration: Why Equal Airflow and Spark Limits Lead to Race Conditions

Aligning Timeouts in Distributed Orchestration: Why Equal Airflow and Spark Limits Lead to Race Conditions

Comments
3 min read
The Missing Organizing Principle of Microsoft Fabric: Medallion Architecture Explained :gem:

The Missing Organizing Principle of Microsoft Fabric: Medallion Architecture Explained :gem:

Comments
2 min read
Slowly Changing Dimensions Explained: How Data Warehouses Keep History Accurate

Slowly Changing Dimensions Explained: How Data Warehouses Keep History Accurate

1
Comments
12 min read
Headless BI: How a Universal Semantic Layer Replaces Tool-Specific Models

Headless BI: How a Universal Semantic Layer Replaces Tool-Specific Models

Comments
3 min read
Data Virtualization and the Semantic Layer: Query Without Copying

Data Virtualization and the Semantic Layer: Query Without Copying

Comments 1
4 min read
What Are Lakehouse Catalogs? The Role of Catalogs in Apache Iceberg

What Are Lakehouse Catalogs? The Role of Catalogs in Apache Iceberg

Comments
7 min read
Partition Evolution: Change Your Partitioning Without Rewriting Data

Partition Evolution: Change Your Partitioning Without Rewriting Data

Comments
7 min read
Performance and Apache Iceberg's Metadata

Performance and Apache Iceberg's Metadata

1
Comments
7 min read
Organizing the Use Cases of AWS Step Functions and Glue Workflow for ETL Processing with AWS Glue Jobs

Organizing the Use Cases of AWS Step Functions and Glue Workflow for ETL Processing with AWS Glue Jobs

4
Comments
7 min read
The Feature Store: Consistency and Latency Are Both Non-Negotiable

The Feature Store: Consistency and Latency Are Both Non-Negotiable

3
Comments
9 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.