DEV Community

# dataengineering

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Realtime Data Streaming Platform: Building a Unified Monitoring Stack

Realtime Data Streaming Platform: Building a Unified Monitoring Stack

3
Comments
8 min read
The State of Apache Iceberg, Polaris, and Arrow: October–November 2025

The State of Apache Iceberg, Polaris, and Arrow: October–November 2025

2
Comments
7 min read
Real-Time Streaming Platform with Pulsar, Flink & ClickHouse

Real-Time Streaming Platform with Pulsar, Flink & ClickHouse

4
Comments
6 min read
Why Parquet Is Everywhere - And What Makes It Actually Fast?

Why Parquet Is Everywhere - And What Makes It Actually Fast?

2
Comments
3 min read
🎓 Building a Smart LMS Assistant: RAG System with Pinecone for Multi-Source Learning Data

🎓 Building a Smart LMS Assistant: RAG System with Pinecone for Multi-Source Learning Data

Comments
3 min read
Interoperating Open Table Formats on AWS Using Apache XTable (Delta Iceberg)

Interoperating Open Table Formats on AWS Using Apache XTable (Delta Iceberg)

4
Comments
4 min read
Big Data Processing (Hadoop, Spark)

Big Data Processing (Hadoop, Spark)

2
Comments
5 min read
Building a clean Energy Data Pipeline for Africa( from raw CSVs to MongoDB)

Building a clean Energy Data Pipeline for Africa( from raw CSVs to MongoDB)

Comments
1 min read
Why Your Snowflake Bill is High and How to Fix It with a Hybrid Approach

Why Your Snowflake Bill is High and How to Fix It with a Hybrid Approach

5
Comments
14 min read
From APIs to Aquifers: A Developer's Guide to Smart Water Management Data

From APIs to Aquifers: A Developer's Guide to Smart Water Management Data

Comments
7 min read
Data in the Cloud: Understanding 6 Common Data Formats in Analytics

Data in the Cloud: Understanding 6 Common Data Formats in Analytics

Comments
3 min read
A real-world example of CsvPath schemas

A real-world example of CsvPath schemas

Comments
5 min read
Guia arquitetônico de ponta para a construção de uma plataforma de dados

Guia arquitetônico de ponta para a construção de uma plataforma de dados

Comments
6 min read
Inside the Edge: How Real-Time Data Pipelines Power Connected Devices

Inside the Edge: How Real-Time Data Pipelines Power Connected Devices

1
Comments
3 min read
Python For Data Engineering

Python For Data Engineering

Comments
3 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.