Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
dataengineering
Follow
Hide
Posts
Left menu
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
Google's LEGO tribute đź§©
Jigyasa Grover
Jigyasa Grover
Jigyasa Grover
Follow
for
Google Developer Experts
Jan 9
Google's LEGO tribute đź§©
#
computerscience
#
dataengineering
#
google
#
systemdesign
27
 reactions
Comments
8
 comments
1 min read
The Ultimate Linux Command Cheat Sheet for Data Engineers and Analysts
Mwenda Harun Mbaabu
Mwenda Harun Mbaabu
Mwenda Harun Mbaabu
Follow
May 21 '25
The Ultimate Linux Command Cheat Sheet for Data Engineers and Analysts
#
dataengineering
#
bash
#
datascience
#
python
85
 reactions
Comments
4
 comments
4 min read
When Small Parquet Files Become a Big Problem (and How I Ended Up Writing a Compactor in PyArrow)
Olga Braginskaya
Olga Braginskaya
Olga Braginskaya
Follow
May 9 '25
When Small Parquet Files Become a Big Problem (and How I Ended Up Writing a Compactor in PyArrow)
#
dataengineering
#
python
#
data
#
tutorial
17
 reactions
Comments
2
 comments
5 min read
Join Data from Anywhere: The Streaming SQL Engine That Bridges Databases, APIs, and Files
Theodore P.
Theodore P.
Theodore P.
Follow
Dec 16 '25
Join Data from Anywhere: The Streaming SQL Engine That Bridges Databases, APIs, and Files
#
python
#
database
#
dataengineering
#
sql
8
 reactions
Comments
1
 comment
17 min read
Why 71,000 Data Engineers Read My Article: What I Learned About Technical Writing
Pradeep Kalluri
Pradeep Kalluri
Pradeep Kalluri
Follow
Dec 8 '25
Why 71,000 Data Engineers Read My Article: What I Learned About Technical Writing
#
dataengineering
#
writing
#
programming
#
career
4
 reactions
Comments
1
 comment
6 min read
Real-Time is an SLA, Not an Architecture: When You Actually Need Kafka (And When You Don't)
Vinicius Fagundes
Vinicius Fagundes
Vinicius Fagundes
Follow
Jan 11
Real-Time is an SLA, Not an Architecture: When You Actually Need Kafka (And When You Don't)
#
discuss
#
architecture
#
dataengineering
#
career
1
 reaction
Comments
Add Comment
10 min read
🌍 Automating Africa’s Energy Data Collection Using Python, Playwright(+Why Playwright ?), and MongoDB (2000–2024)
John Wakaba
John Wakaba
John Wakaba
Follow
Nov 4 '25
🌍 Automating Africa’s Energy Data Collection Using Python, Playwright(+Why Playwright ?), and MongoDB (2000–2024)
#
dataengineering
#
mongodb
#
python
#
automation
5
 reactions
Comments
Add Comment
5 min read
S3-Native Kafka Alternatives: What's Actually Different
Alexander Alten
Alexander Alten
Alexander Alten
Follow
Jan 2
S3-Native Kafka Alternatives: What's Actually Different
#
kafka
#
dataengineering
#
opensource
#
devops
Comments
Add Comment
3 min read
Why Parquet Is Everywhere - And What Makes It Actually Fast?
Mohamed Hussain S
Mohamed Hussain S
Mohamed Hussain S
Follow
Nov 15 '25
Why Parquet Is Everywhere - And What Makes It Actually Fast?
#
dataengineering
#
parquet
#
bigdata
#
dataarchitecture
2
 reactions
Comments
Add Comment
3 min read
RIP Amazon Data Firehose Change Data Capture
Paul SANTUS
Paul SANTUS
Paul SANTUS
Follow
for
AWS Community Builders
Oct 2 '25
RIP Amazon Data Firehose Change Data Capture
#
dataengineering
#
aws
#
cloud
#
firehose
7
 reactions
Comments
3
 comments
4 min read
Building a 75,000-Product Image Feature Dataset for the Amazon ML Challenge 2025
Anuj Bolewar
Anuj Bolewar
Anuj Bolewar
Follow
Oct 17 '25
Building a 75,000-Product Image Feature Dataset for the Amazon ML Challenge 2025
#
machinelearning
#
deeplearning
#
dataengineering
#
pca
1
 reaction
Comments
Add Comment
4 min read
Writes, 3 ways: Postgres, Apache Kafka® and Apache Iceberg™
Celeste Horgan
Celeste Horgan
Celeste Horgan
Follow
Nov 6 '25
Writes, 3 ways: Postgres, Apache Kafka® and Apache Iceberg™
#
postgres
#
dataengineering
#
learning
#
database
1
 reaction
Comments
Add Comment
10 min read
From smog to streams: how data engineering helps us breathe easier.
Oliver Samuel
Oliver Samuel
Oliver Samuel
Follow
Oct 20 '25
From smog to streams: how data engineering helps us breathe easier.
#
architecture
#
dataengineering
#
opensource
1
 reaction
Comments
1
 comment
4 min read
Data Quality at Scale: Why Your Pipeline Needs More Than Green Checkmarks
Pradeep Kalluri
Pradeep Kalluri
Pradeep Kalluri
Follow
Nov 24 '25
Data Quality at Scale: Why Your Pipeline Needs More Than Green Checkmarks
#
dataengineering
#
dataquality
#
bigdata
#
python
Comments
Add Comment
8 min read
The Waterfall Pattern: A Tiered Strategy for Reliable Data Extraction
Robert N. Gutierrez
Robert N. Gutierrez
Robert N. Gutierrez
Follow
Feb 14
The Waterfall Pattern: A Tiered Strategy for Reliable Data Extraction
#
webscraping
#
dataengineering
#
devops
#
dataextraction
1
 reaction
Comments
1
 comment
5 min read
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account