Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
bigdata
Follow
Hide
Posts
Left menu
đ
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
Why Parquet Is Everywhere - And What Makes It Actually Fast?
Mohamed Hussain S
Mohamed Hussain S
Mohamed Hussain S
Follow
Nov 15 '25
Why Parquet Is Everywhere - And What Makes It Actually Fast?
#
dataengineering
#
parquet
#
bigdata
#
dataarchitecture
2
 reactions
Comments
Add Comment
3 min read
Data Quality at Scale: Why Your Pipeline Needs More Than Green Checkmarks
Pradeep Kalluri
Pradeep Kalluri
Pradeep Kalluri
Follow
Nov 24 '25
Data Quality at Scale: Why Your Pipeline Needs More Than Green Checkmarks
#
dataengineering
#
dataquality
#
bigdata
#
python
Comments
Add Comment
8 min read
Code for a Better Planet: Hacking UN SDGs 7-12 with Big Data
Vitali Sorenko
Vitali Sorenko
Vitali Sorenko
Follow
Oct 14 '25
Code for a Better Planet: Hacking UN SDGs 7-12 with Big Data
#
bigdata
#
datascience
#
sustainability
4
 reactions
Comments
Add Comment
7 min read
Drips to Data Streams: Hacking Water Scarcity with IoT & Big Data
Laetitia Perraut
Laetitia Perraut
Laetitia Perraut
Follow
Oct 13 '25
Drips to Data Streams: Hacking Water Scarcity with IoT & Big Data
#
iot
#
bigdata
#
sustainability
Comments
Add Comment
6 min read
đ„ Day 5: Introduction to DataFrames - The Most Importantce of Spark API
Sandeep
Sandeep
Sandeep
Follow
Dec 5 '25
đ„ Day 5: Introduction to DataFrames - The Most Importantce of Spark API
#
dataengineering
#
python
#
spark
#
bigdata
Comments
Add Comment
2 min read
Beyond Tagging: A Blueprint for Real-Time Cost Attribution in Data Platforms
Mahendran
Mahendran
Mahendran
Follow
Dec 22 '25
Beyond Tagging: A Blueprint for Real-Time Cost Attribution in Data Platforms
#
dataengineering
#
finops
#
bigdata
#
costoptimization
Comments
Add Comment
9 min read
From Raw to Refined: Data Pipeline Architecture at Scale
Pradeep Kalluri
Pradeep Kalluri
Pradeep Kalluri
Follow
Nov 22 '25
From Raw to Refined: Data Pipeline Architecture at Scale
#
dataengineering
#
bigdata
#
python
#
dataquality
Comments
Add Comment
12 min read
Fueling Climate Action with Code: A Dev's Guide to First, Second, and Third-Party Data
Bob Cars(on)
Bob Cars(on)
Bob Cars(on)
Follow
Oct 13 '25
Fueling Climate Action with Code: A Dev's Guide to First, Second, and Third-Party Data
#
data
#
climatechange
#
bigdata
Comments
Add Comment
7 min read
Day 9: Spark SQL Deep Dive - Temp Views, Query Execution & Optimization Tips for Data Engineers
Sandeep
Sandeep
Sandeep
Follow
Dec 9 '25
Day 9: Spark SQL Deep Dive - Temp Views, Query Execution & Optimization Tips for Data Engineers
#
python
#
dataengineering
#
spark
#
bigdata
Comments
Add Comment
2 min read
Day 12: UDF vs Pandas UDF
Sandeep
Sandeep
Sandeep
Follow
Dec 11 '25
Day 12: UDF vs Pandas UDF
#
python
#
dataengineering
#
spark
#
bigdata
Comments
Add Comment
2 min read
10x Query Performance Improvement: The Design and Implementation of the New Unique Key
Apache Doris
Apache Doris
Apache Doris
Follow
Nov 20 '25
10x Query Performance Improvement: The Design and Implementation of the New Unique Key
#
bigdata
#
olap
#
database
#
apachedoris
Comments
Add Comment
30 min read
Blockchain Analytics: Exploring Ethereum Data with BigQuery, RAG, and AI
Gary Zavaleta
Gary Zavaleta
Gary Zavaleta
Follow
Oct 23 '25
Blockchain Analytics: Exploring Ethereum Data with BigQuery, RAG, and AI
#
programming
#
ai
#
rag
#
bigdata
1
 reaction
Comments
1
 comment
1 min read
Overview of Real-Time Data Synchronization from PostgreSQL to VeloDB
Apache Doris
Apache Doris
Apache Doris
Follow
Dec 17 '25
Overview of Real-Time Data Synchronization from PostgreSQL to VeloDB
#
bigdata
#
postgressql
#
apachedoris
#
database
Comments
Add Comment
6 min read
Spark & Scala Cache Lessons from ETL Project
krlz
krlz
krlz
Follow
Sep 3 '25
Spark & Scala Cache Lessons from ETL Project
#
scala
#
programming
#
bigdata
#
spark
2
 reactions
Comments
1
 comment
3 min read
How to build real-time user-facing analytics with Kafka + Flink + Doris
Apache Doris
Apache Doris
Apache Doris
Follow
Oct 28 '25
How to build real-time user-facing analytics with Kafka + Flink + Doris
#
bigdata
#
olap
#
kafka
#
doris
4
 reactions
Comments
Add Comment
9 min read
đ
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account