DEV Community

# dataengineering

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
The State of Apache Iceberg, Polaris, and Arrow: October–November 2025

The State of Apache Iceberg, Polaris, and Arrow: October–November 2025

2
Comments
7 min read
Real-Time Streaming Platform with Pulsar, Flink & ClickHouse

Real-Time Streaming Platform with Pulsar, Flink & ClickHouse

1
Comments
6 min read
Why Parquet Is Everywhere - And What Makes It Actually Fast?

Why Parquet Is Everywhere - And What Makes It Actually Fast?

3
Comments
3 min read
Interoperating Open Table Formats on AWS Using Apache XTable (Delta Iceberg)

Interoperating Open Table Formats on AWS Using Apache XTable (Delta Iceberg)

4
Comments
4 min read
Big Data Processing (Hadoop, Spark)

Big Data Processing (Hadoop, Spark)

2
Comments
5 min read
Building a clean Energy Data Pipeline for Africa( from raw CSVs to MongoDB)

Building a clean Energy Data Pipeline for Africa( from raw CSVs to MongoDB)

Comments
1 min read
Why Your Snowflake Bill is High and How to Fix It with a Hybrid Approach

Why Your Snowflake Bill is High and How to Fix It with a Hybrid Approach

5
Comments
14 min read
From APIs to Aquifers: A Developer's Guide to Smart Water Management Data

From APIs to Aquifers: A Developer's Guide to Smart Water Management Data

Comments
7 min read
Data in the Cloud: Understanding 6 Common Data Formats in Analytics

Data in the Cloud: Understanding 6 Common Data Formats in Analytics

Comments
3 min read
A real-world example of CsvPath schemas

A real-world example of CsvPath schemas

Comments
5 min read
Guia arquitetônico de ponta para a construção de uma plataforma de dados

Guia arquitetônico de ponta para a construção de uma plataforma de dados

Comments
6 min read
Inside the Edge: How Real-Time Data Pipelines Power Connected Devices

Inside the Edge: How Real-Time Data Pipelines Power Connected Devices

1
Comments
3 min read
Building a Modern Data Platform to Track Kenya’s Food Prices — A Data Engineering Case Study

Building a Modern Data Platform to Track Kenya’s Food Prices — A Data Engineering Case Study

Comments
5 min read
Python For Data Engineering

Python For Data Engineering

Comments
3 min read
Picking the Right Data Format for Your Workflow

Picking the Right Data Format for Your Workflow

Comments
3 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.