DEV Community

# dataengineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Prompt Engineering Patterns: From Zero-Shot to Chain-of-Thought Reasoning

Prompt Engineering Patterns: From Zero-Shot to Chain-of-Thought Reasoning

1
Comments
14 min read
Introduction to the Confluent REST Proxy

Introduction to the Confluent REST Proxy

2
Comments
4 min read
Why We Need Schema Registry in Kafka

Why We Need Schema Registry in Kafka

2
Comments
17 min read
While We're Measuring Developer Productivity, Won't Someone Think of the Data Engineers?

While We're Measuring Developer Productivity, Won't Someone Think of the Data Engineers?

Comments
9 min read
Azure Synapse Analytics

Azure Synapse Analytics

Comments
5 min read
Debugging Windows Race Conditions in Dagster

Debugging Windows Race Conditions in Dagster

Comments
3 min read
Design Patterns for Data Engineers: Cleaner ETL with the Builder Pattern.

Design Patterns for Data Engineers: Cleaner ETL with the Builder Pattern.

3
Comments
2 min read
Positional Encodings and Context Window Engineering: Why Token Order Matters

Positional Encodings and Context Window Engineering: Why Token Order Matters

3
Comments
12 min read
Trying Out Dagster for Data Orchestration

Trying Out Dagster for Data Orchestration

4
Comments
9 min read
🔐 Understanding Governance in Microsoft Fabric

🔐 Understanding Governance in Microsoft Fabric

1
Comments
3 min read
6 Different Data Formats Commonly Used in Data Analytics

6 Different Data Formats Commonly Used in Data Analytics

Comments
3 min read
Part 1: Snowflake's Autonomous Future

Part 1: Snowflake's Autonomous Future

Comments
8 min read
Scaling Customer Analytics: Designing ML Pipelines for Millions of Users

Scaling Customer Analytics: Designing ML Pipelines for Millions of Users

Comments
7 min read
Apache Dev Mail Digest: Iceberg & Polaris (Nov 12–17, 2025)

Apache Dev Mail Digest: Iceberg & Polaris (Nov 12–17, 2025)

Comments
4 min read
A Developer’s Guide to Apache Kafka: From Basics to Architecture in One Read

A Developer’s Guide to Apache Kafka: From Basics to Architecture in One Read

1
Comments
5 min read
How to Get Filtered Amazon Reviews into a Pandas DataFrame in Under 50 Lines of Python

How to Get Filtered Amazon Reviews into a Pandas DataFrame in Under 50 Lines of Python

Comments
3 min read
Comparing CsvPath and SodaCL

Comparing CsvPath and SodaCL

Comments
4 min read
Why Your Snowflake Bill is High and How to Fix It with a Hybrid Approach

Why Your Snowflake Bill is High and How to Fix It with a Hybrid Approach

1
Comments
14 min read
Star vs. Snowflake Schema

Star vs. Snowflake Schema

Comments
4 min read
Data Engineer — Người Kiến Tạo “Dòng Chảy Dữ Liệu” Trong Kỷ Nguyên Số

Data Engineer — Người Kiến Tạo “Dòng Chảy Dữ Liệu” Trong Kỷ Nguyên Số

Comments
2 min read
Building a Modern Data Platform to Track Kenya’s Food Prices — A Data Engineering Case Study

Building a Modern Data Platform to Track Kenya’s Food Prices — A Data Engineering Case Study

Comments
5 min read
Final Project Report 1: Schema Evolution Support on Apache SeaTunnel Flink Engine

Final Project Report 1: Schema Evolution Support on Apache SeaTunnel Flink Engine

Comments
4 min read
From Pandas to Upstream Control: The Evolution PyData Needs Next

From Pandas to Upstream Control: The Evolution PyData Needs Next

Comments
6 min read
Building Reliable Legal AI: Never Missing a Supreme Court Case

Building Reliable Legal AI: Never Missing a Supreme Court Case

2
Comments
26 min read
Statistics Day 2: Correlation Isn’t Causation — Here’s Why It Matters!

Statistics Day 2: Correlation Isn’t Causation — Here’s Why It Matters!

5
Comments
4 min read
loading...