DEV Community

# dataengineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
🐝 Why Hive Exists - And Why Its Complexity Is Actually Necessary

🐝 Why Hive Exists - And Why Its Complexity Is Actually Necessary

2
Comments
3 min read
🚀 Day 17 of My Python Learning Journey

🚀 Day 17 of My Python Learning Journey

Comments
1 min read
JOIN the data analytics race: Apache Doris vs. ClickHouse, Databricks, and Snowflake

JOIN the data analytics race: Apache Doris vs. ClickHouse, Databricks, and Snowflake

Comments 1
6 min read
A Beginner’s Journey with PostgreSQL

A Beginner’s Journey with PostgreSQL

2
Comments
3 min read
The "Shift-Left" Imperative: Implementing Data Contracts in CI/CD Pipeline

The "Shift-Left" Imperative: Implementing Data Contracts in CI/CD Pipeline

Comments
4 min read
Break Through Data Silos: Practices of Multi-cloud Observability Integration Based on Object Storage Service (OSS)

Break Through Data Silos: Practices of Multi-cloud Observability Integration Based on Object Storage Service (OSS)

Comments
12 min read
Tutorial: Intro to Apache Iceberg with Apache Polaris and Apache Spark

Tutorial: Intro to Apache Iceberg with Apache Polaris and Apache Spark

2
Comments 4
20 min read
Comprehensive Guide: kwargs vs XCom in Python & Airflow

Comprehensive Guide: kwargs vs XCom in Python & Airflow

Comments
4 min read
Precise Data Extraction: Pattern-Based Partitioning for Structured Extraction

Precise Data Extraction: Pattern-Based Partitioning for Structured Extraction

1
Comments
3 min read
Apache Gravitino 1.0.0 — From Metadata Management to Contextual Engineering

Apache Gravitino 1.0.0 — From Metadata Management to Contextual Engineering

1
Comments
7 min read
Chinese DBA's Story: Hu Zhonghao - The Journey of Becoming a DBA for Domestic Distributed Databases

Chinese DBA's Story: Hu Zhonghao - The Journey of Becoming a DBA for Domestic Distributed Databases

Comments 1
7 min read
Apache Kafka in Data engineering

Apache Kafka in Data engineering

6
Comments 1
1 min read
🧭System Design Roadmap for Data Engineers

🧭System Design Roadmap for Data Engineers

4
Comments
3 min read
Orchestrating and Observing Data Pipelines with Airflow, PostgreSQL, and Polar

Orchestrating and Observing Data Pipelines with Airflow, PostgreSQL, and Polar

2
Comments
3 min read
đŸ’„ Polars vs. Pandas: Why Your Next ETL Pipeline Should Run on Rust (Part 1/5)

đŸ’„ Polars vs. Pandas: Why Your Next ETL Pipeline Should Run on Rust (Part 1/5)

1
Comments
2 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.