DEV Community

# dataengineering

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
From Glue to Horizon: Our Real Journey Building an Iceberg Lakehouse on Snowflake

From Glue to Horizon: Our Real Journey Building an Iceberg Lakehouse on Snowflake

Comments
10 min read
From Transactions to Insights: How OLTP and OLAP Work Together in Modern Data Pipelines

From Transactions to Insights: How OLTP and OLAP Work Together in Modern Data Pipelines

2
Comments
4 min read
Query CSV, Excel, Parquet, and Arrow files in the Browser with DuckDB-Wasm + Next.js 🦆✨

Query CSV, Excel, Parquet, and Arrow files in the Browser with DuckDB-Wasm + Next.js 🦆✨

Comments 1
4 min read
Fraud Detection and Recommendation Are the Same Pipeline. Most Teams Build Two.

Fraud Detection and Recommendation Are the Same Pipeline. Most Teams Build Two.

Comments
3 min read
PySpark to Pandas/scikit-learn: A Practical Migration Guide for Data Engineers Learning ML

PySpark to Pandas/scikit-learn: A Practical Migration Guide for Data Engineers Learning ML

Comments
7 min read
FINAL in ClickHouse Isn’t as Expensive as It Used to Be

FINAL in ClickHouse Isn’t as Expensive as It Used to Be

3
Comments
4 min read
🚀 DB Explorer 3.0.1 — The AI‑First SQL Editor You’ll Want to Try

🚀 DB Explorer 3.0.1 — The AI‑First SQL Editor You’ll Want to Try

Comments
1 min read
My first data pipeline

My first data pipeline

Comments
1 min read
ETL vs ELT: Which One Should You Use and Why?

ETL vs ELT: Which One Should You Use and Why?

1
Comments
6 min read
ETL vs ELT: Which One Should You Use and Why?

ETL vs ELT: Which One Should You Use and Why?

1
Comments
4 min read
A Beginner’s Guide to Apache Kafka: The Engine of Real-Time Data

A Beginner’s Guide to Apache Kafka: The Engine of Real-Time Data

2
Comments
11 min read
Extract Transform Load vs Extract Load Transform (ETL vs ELT)

Extract Transform Load vs Extract Load Transform (ETL vs ELT)

Comments
5 min read
Entity Resolution at Scale: Matching Products Across Amazon, Reddit, and RTINGS

Entity Resolution at Scale: Matching Products Across Amazon, Reddit, and RTINGS

Comments
4 min read
Apache Data Lakehouse Weekly: April 3–9, 2026

Apache Data Lakehouse Weekly: April 3–9, 2026

Comments
7 min read
Airflow vs Prefect vs Dagster: Picking the Right Orchestrator in 2026

Airflow vs Prefect vs Dagster: Picking the Right Orchestrator in 2026

Comments
6 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.