DEV Community

# dataengineering

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Introduction to Apache Airflow

Introduction to Apache Airflow

1
Comments
4 min read
Building a Production-Ready Data Lake: PostgreSQL to S3 with AWS DMS, Glue, and Athena using CDK

Building a Production-Ready Data Lake: PostgreSQL to S3 with AWS DMS, Glue, and Athena using CDK

2
Comments
8 min read
From Postgres to Iceberg

From Postgres to Iceberg

1
Comments
11 min read
Real-Time Cryptocurrency Data Pipeline

Real-Time Cryptocurrency Data Pipeline

Comments
12 min read
Synthetic Data for RAG: Safe Generation, Deduplication, and Drift-Aware Curation in 2025

Synthetic Data for RAG: Safe Generation, Deduplication, and Drift-Aware Curation in 2025

2
Comments
10 min read
Personal Picks: Data Product News (October 1, 2025)

Personal Picks: Data Product News (October 1, 2025)

Comments
7 min read
1 billion JSON records, 1-second query response: Apache Doris vs. ClickHouse, Elasticsearch, and PostgreSQL

1 billion JSON records, 1-second query response: Apache Doris vs. ClickHouse, Elasticsearch, and PostgreSQL

6
Comments
7 min read
SQL: is there a better way to code this?

SQL: is there a better way to code this?

Comments 1
1 min read
Building Real-Time Data Pipelines from PostgreSQL Using Flink CDC

Building Real-Time Data Pipelines from PostgreSQL Using Flink CDC

Comments
5 min read
How to Convert Excel to CSV in Python using Spire.XLS for Python

How to Convert Excel to CSV in Python using Spire.XLS for Python

Comments
4 min read
Building a Sales Database in PostgreSQL — Schema, Data & JOIN Examples

Building a Sales Database in PostgreSQL — Schema, Data & JOIN Examples

4
Comments
6 min read
Building Self-Healing, Reliable Data Pipelines That Think

Building Self-Healing, Reliable Data Pipelines That Think

Comments 1
4 min read
Interesting links - September 2025

Interesting links - September 2025

Comments
13 min read
Beyond the Browser: Crafting a Robust Web Scraping Pipeline for Dynamic Sports Data

Beyond the Browser: Crafting a Robust Web Scraping Pipeline for Dynamic Sports Data

Comments 1
3 min read
Get Started with Fastest SQL Query Engine - Presto C++ (Prestissimo): Beginner Friendly Setup Guide with Docker.

Get Started with Fastest SQL Query Engine - Presto C++ (Prestissimo): Beginner Friendly Setup Guide with Docker.

Comments
5 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.