Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
dataengineering
Follow
Hide
Posts
Left menu
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
End-to-End Real-Time Data Engineering on Databricks Using Spark Structured Streaming and Delta Lake
Nithyalakshmi Kamalakkannan
Nithyalakshmi Kamalakkannan
Nithyalakshmi Kamalakkannan
Follow
Jan 2
End-to-End Real-Time Data Engineering on Databricks Using Spark Structured Streaming and Delta Lake
#
dataengineering
#
handson
#
realtimeproject
#
spark
1
 reaction
Comments
Add Comment
1 min read
Building Bulletproof Data Pipelines: Orchestration, Testing, and Monitoring (Part 3 of 3)
Karthikeyan Rajasekaran
Karthikeyan Rajasekaran
Karthikeyan Rajasekaran
Follow
Jan 2
Building Bulletproof Data Pipelines: Orchestration, Testing, and Monitoring (Part 3 of 3)
#
dataengineering
#
dagster
#
dataquality
#
testing
1
 reaction
Comments
Add Comment
9 min read
LogInSight: A Lightweight CloudWatch Log Analytics Tool for Faster Debugging and Real-Time Insights
Sanjay_Balaji
Sanjay_Balaji
Sanjay_Balaji
Follow
Nov 29 '25
LogInSight: A Lightweight CloudWatch Log Analytics Tool for Faster Debugging and Real-Time Insights
#
fastapi
#
python
#
aws
#
dataengineering
2
 reactions
Comments
Add Comment
3 min read
The Proxy Economy: Residential, Datacenter, and ISP Rotation
Lalit Mishra
Lalit Mishra
Lalit Mishra
Follow
Jan 1
The Proxy Economy: Residential, Datacenter, and ISP Rotation
#
architecture
#
dataengineering
#
networking
Comments
2
 comments
5 min read
The Database Query That Could Cost a Company Millions(And Why Data Engineers Exist)
Thanh Truong
Thanh Truong
Thanh Truong
Follow
Jan 1
The Database Query That Could Cost a Company Millions(And Why Data Engineers Exist)
#
technology
#
dataengineering
#
techhistory
Comments
Add Comment
5 min read
RAG Evaluation Metrics: Measuring What Actually Matters
Vinicius Fagundes
Vinicius Fagundes
Vinicius Fagundes
Follow
Dec 21 '25
RAG Evaluation Metrics: Measuring What Actually Matters
#
llm
#
rag
#
dataengineering
#
ai
1
 reaction
Comments
Add Comment
10 min read
Why Data SLAs Fail — and How to Enforce Them with a Unified Reliability Framework
Baharath Bathula
Baharath Bathula
Baharath Bathula
Follow
Jan 1
Why Data SLAs Fail — and How to Enforce Them with a Unified Reliability Framework
#
dataengineering
#
aws
#
machinelearning
#
analytics
Comments
Add Comment
2 min read
When code-gen suggests deprecated Pandas APIs — a subtle drift that broke a pipeline
Gabriel
Gabriel
Gabriel
Follow
Jan 1
When code-gen suggests deprecated Pandas APIs — a subtle drift that broke a pipeline
#
dataengineering
#
devops
#
python
#
ai
Comments
Add Comment
3 min read
Building Streaming Iceberg Tables for Real-Time Logistics Analytics
Eliana Lam
Eliana Lam
Eliana Lam
Follow
Nov 29 '25
Building Streaming Iceberg Tables for Real-Time Logistics Analytics
#
analytics
#
dataengineering
#
architecture
#
opensource
Comments
Add Comment
4 min read
Building a Scalable Community Health Worker Analytics Platform: My Journey with dbt and Snowflake
Amos Augo
Amos Augo
Amos Augo
Follow
Nov 26 '25
Building a Scalable Community Health Worker Analytics Platform: My Journey with dbt and Snowflake
#
dbt
#
snowflake
#
dataengineering
Comments
Add Comment
4 min read
When an AI Suggests DataFrame.append: Missing Pandas Deprecations in Generated Code
Gabriel
Gabriel
Gabriel
Follow
Dec 31 '25
When an AI Suggests DataFrame.append: Missing Pandas Deprecations in Generated Code
#
codequality
#
dataengineering
#
llm
#
python
Comments
1
 comment
3 min read
The Great Table Format Debate: A Deep Dive into Apache Iceberg, Delta Lake, and Apache Hudi
Data Tech Bridge
Data Tech Bridge
Data Tech Bridge
Follow
Dec 31 '25
The Great Table Format Debate: A Deep Dive into Apache Iceberg, Delta Lake, and Apache Hudi
#
database
#
dataengineering
#
iceberg
#
apachehudi
Comments
Add Comment
18 min read
Amazon Kinesis vs Amazon MSK: The Complete Guide for Stream Processing on AWS
Data Tech Bridge
Data Tech Bridge
Data Tech Bridge
Follow
Dec 30 '25
Amazon Kinesis vs Amazon MSK: The Complete Guide for Stream Processing on AWS
#
architecture
#
aws
#
dataengineering
Comments
Add Comment
29 min read
Day 8: Accelerating Spark Joins - Broadcast, Shuffle Optimization & Skew Handling
Sandeep
Sandeep
Sandeep
Follow
Dec 9 '25
Day 8: Accelerating Spark Joins - Broadcast, Shuffle Optimization & Skew Handling
#
dataengineering
#
python
#
spark
#
bigdata
Comments
Add Comment
2 min read
Mastering Serverless Data Pipelines: AWS Step Functions Best Practices for 2026
Jubin Soni
Jubin Soni
Jubin Soni
Follow
Dec 30 '25
Mastering Serverless Data Pipelines: AWS Step Functions Best Practices for 2026
#
aws
#
serverless
#
stepfunctions
#
dataengineering
Comments
Add Comment
5 min read
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account