Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
dataengineering
Follow
Hide
Posts
Left menu
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
A Deep Dive into Apache Spark Architecture
Aryan Dandale
Aryan Dandale
Aryan Dandale
Follow
Oct 27 '25
A Deep Dive into Apache Spark Architecture
#
datascience
#
dataengineering
#
architecture
#
performance
1
 reaction
Comments
Add Comment
4 min read
Auto-Detecting CSV Schemas for Lightning-Fast ClickHouse Ingestion with Parquet
DarkEdges
DarkEdges
DarkEdges
Follow
Nov 7 '25
Auto-Detecting CSV Schemas for Lightning-Fast ClickHouse Ingestion with Parquet
#
node
#
dataengineering
#
database
#
automation
7
 reactions
Comments
Add Comment
5 min read
# Data Ingestion & Vector Store #llmszoomcamp
Abdelrahman Adnan
Abdelrahman Adnan
Abdelrahman Adnan
Follow
Oct 4 '25
# Data Ingestion & Vector Store #llmszoomcamp
#
python
#
dataengineering
#
rag
#
llm
Comments
Add Comment
2 min read
Database Fundamentals
Gilbert korir
Gilbert korir
Gilbert korir
Follow
Oct 4 '25
Database Fundamentals
#
postgressql
#
sql
#
nosql
#
dataengineering
Comments
Add Comment
3 min read
Distributed Media Inferencing with Kafka
Jayash Tripathy
Jayash Tripathy
Jayash Tripathy
Follow
Nov 6 '25
Distributed Media Inferencing with Kafka
#
ai
#
dataengineering
#
architecture
#
python
Comments
1
 comment
5 min read
🧑‍💻 Apache Kafka CLI – Detailed Course
nk sk
nk sk
nk sk
Follow
Oct 3 '25
🧑‍💻 Apache Kafka CLI – Detailed Course
#
cli
#
dataengineering
#
tutorial
Comments
Add Comment
2 min read
🌍 Automating Africa’s Energy Data Collection Using Python, Playwright(+Why Playwright ?), and MongoDB (2000–2024)
John Wakaba
John Wakaba
John Wakaba
Follow
Nov 4 '25
🌍 Automating Africa’s Energy Data Collection Using Python, Playwright(+Why Playwright ?), and MongoDB (2000–2024)
#
dataengineering
#
mongodb
#
python
#
automation
5
 reactions
Comments
Add Comment
5 min read
From 8 Minutes to 40 Seconds: Solving Data Pipeline Deployment Bottlenecks with Git Sparse Checkout
Byron Hsieh
Byron Hsieh
Byron Hsieh
Follow
Nov 6 '25
From 8 Minutes to 40 Seconds: Solving Data Pipeline Deployment Bottlenecks with Git Sparse Checkout
#
git
#
devops
#
dataengineering
#
azure
Comments
Add Comment
5 min read
Create a Microsoft Fabric Lakehouse
lotanna obianefo
lotanna obianefo
lotanna obianefo
Follow
Oct 2 '25
Create a Microsoft Fabric Lakehouse
#
database
#
dataengineering
#
cloudcomputing
#
datascience
5
 reactions
Comments
Add Comment
6 min read
Core Concepts of Kafka
Farhan Khan
Farhan Khan
Farhan Khan
Follow
Oct 2 '25
Core Concepts of Kafka
#
architecture
#
beginners
#
dataengineering
Comments
Add Comment
8 min read
From Kafka to Clean Tables: Building a Confluent Snowflake Pipeline with Streams & Tasks.
elisha lukalia
elisha lukalia
elisha lukalia
Follow
Oct 2 '25
From Kafka to Clean Tables: Building a Confluent Snowflake Pipeline with Streams & Tasks.
#
dataengineering
#
automation
#
tutorial
#
sql
Comments
Add Comment
9 min read
Apache Kafka: ZooKeeper vs. KRaft — A Complete Comparison of Approaches
Leanid Herasimau
Leanid Herasimau
Leanid Herasimau
Follow
Oct 3 '25
Apache Kafka: ZooKeeper vs. KRaft — A Complete Comparison of Approaches
#
architecture
#
backend
#
dataengineering
Comments
Add Comment
6 min read
Introduction to Apache Airflow
John Kioko
John Kioko
John Kioko
Follow
Oct 6 '25
Introduction to Apache Airflow
#
dataengineering
#
beginners
#
learning
#
python
1
 reaction
Comments
Add Comment
4 min read
Building a Production-Ready Data Lake: PostgreSQL to S3 with AWS DMS, Glue, and Athena using CDK
André Paris
André Paris
André Paris
Follow
Oct 14 '25
Building a Production-Ready Data Lake: PostgreSQL to S3 with AWS DMS, Glue, and Athena using CDK
#
aws
#
dataengineering
#
typescript
#
postgres
2
 reactions
Comments
Add Comment
8 min read
From Postgres to Iceberg
Brian Misachi
Brian Misachi
Brian Misachi
Follow
Nov 5 '25
From Postgres to Iceberg
#
database
#
postgres
#
dataengineering
1
 reaction
Comments
Add Comment
11 min read
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account