DEV Community

# dataengineering

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
A Deep Dive into Apache Spark Architecture

A Deep Dive into Apache Spark Architecture

1
Comments
4 min read
Auto-Detecting CSV Schemas for Lightning-Fast ClickHouse Ingestion with Parquet

Auto-Detecting CSV Schemas for Lightning-Fast ClickHouse Ingestion with Parquet

7
Comments
5 min read
# Data Ingestion & Vector Store #llmszoomcamp

# Data Ingestion & Vector Store #llmszoomcamp

Comments
2 min read
Database Fundamentals

Database Fundamentals

Comments
3 min read
Distributed Media Inferencing with Kafka

Distributed Media Inferencing with Kafka

Comments 1
5 min read
🧑‍💻 Apache Kafka CLI – Detailed Course

🧑‍💻 Apache Kafka CLI – Detailed Course

Comments
2 min read
🌍 Automating Africa’s Energy Data Collection Using Python, Playwright(+Why Playwright ?), and MongoDB (2000–2024)

🌍 Automating Africa’s Energy Data Collection Using Python, Playwright(+Why Playwright ?), and MongoDB (2000–2024)

5
Comments
5 min read
From 8 Minutes to 40 Seconds: Solving Data Pipeline Deployment Bottlenecks with Git Sparse Checkout

From 8 Minutes to 40 Seconds: Solving Data Pipeline Deployment Bottlenecks with Git Sparse Checkout

Comments
5 min read
Create a Microsoft Fabric Lakehouse

Create a Microsoft Fabric Lakehouse

5
Comments
6 min read
Core Concepts of Kafka

Core Concepts of Kafka

Comments
8 min read
From Kafka to Clean Tables: Building a Confluent Snowflake Pipeline with Streams & Tasks.

From Kafka to Clean Tables: Building a Confluent Snowflake Pipeline with Streams & Tasks.

Comments
9 min read
Apache Kafka: ZooKeeper vs. KRaft — A Complete Comparison of Approaches

Apache Kafka: ZooKeeper vs. KRaft — A Complete Comparison of Approaches

Comments
6 min read
Introduction to Apache Airflow

Introduction to Apache Airflow

1
Comments
4 min read
Building a Production-Ready Data Lake: PostgreSQL to S3 with AWS DMS, Glue, and Athena using CDK

Building a Production-Ready Data Lake: PostgreSQL to S3 with AWS DMS, Glue, and Athena using CDK

2
Comments
8 min read
From Postgres to Iceberg

From Postgres to Iceberg

1
Comments
11 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.