Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
bigdata
Follow
Hide
Posts
Left menu
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
The future of Data Engineering in Databricks - From Pipelines to Intent
Arjun Krishna
Arjun Krishna
Arjun Krishna
Follow
Mar 3
The future of Data Engineering in Databricks - From Pipelines to Intent
#
dataengineering
#
databricks
#
ai
#
bigdata
1
 reaction
Comments
Add Comment
2 min read
From TB-Scale MongoDB to Doris: 5 Critical Challenges and Fixes with Apache SeaTunnel
Apache SeaTunnel
Apache SeaTunnel
Apache SeaTunnel
Follow
Feb 27
From TB-Scale MongoDB to Doris: 5 Critical Challenges and Fixes with Apache SeaTunnel
#
apacheseatunnel
#
mongodb
#
opensource
#
bigdata
2
 reactions
Comments
Add Comment
9 min read
(I) An Overview of Data Warehouses and Data Lakes
Apache SeaTunnel
Apache SeaTunnel
Apache SeaTunnel
Follow
Feb 27
(I) An Overview of Data Warehouses and Data Lakes
#
database
#
opensource
#
datascience
#
bigdata
2
 reactions
Comments
Add Comment
4 min read
How to Size a Spark Cluster. And How Not To.
Arjun Krishna
Arjun Krishna
Arjun Krishna
Follow
Mar 1
How to Size a Spark Cluster. And How Not To.
#
spark
#
dataengineering
#
distributedsystems
#
bigdata
1
 reaction
Comments
Add Comment
6 min read
build-my-own-datalake: Improve metadata with caching
kination
kination
kination
Follow
Feb 28
build-my-own-datalake: Improve metadata with caching
#
bigdata
#
metadata
#
rust
#
buildmyownx
3
 reactions
Comments
Add Comment
19 min read
Part 1 | A Scheduler Is More Than Just a “Timer”
Chen Debra
Chen Debra
Chen Debra
Follow
Feb 5
Part 1 | A Scheduler Is More Than Just a “Timer”
#
apachedolphinscheduler
#
opensource
#
programming
#
bigdata
Comments
Add Comment
4 min read
Scaling Relationship Discovery Across 100,000+ Fields Without Breaking Compute
Hello Arisyn
Hello Arisyn
Hello Arisyn
Follow
Feb 23
Scaling Relationship Discovery Across 100,000+ Fields Without Breaking Compute
#
scalablesystems
#
dataengineering
#
distributedsystems
#
bigdata
1
 reaction
Comments
1
 comment
2 min read
How to Implement Data Modelling in Power BI
Gathuru_M
Gathuru_M
Gathuru_M
Follow
Feb 2
How to Implement Data Modelling in Power BI
#
dataengineering
#
datascience
#
bigdata
#
beginners
2
 reactions
Comments
Add Comment
2 min read
Designing a Cross-Cloud Data Plane with Apache Iceberg
Andrew Kalik
Andrew Kalik
Andrew Kalik
Follow
Jan 26
Designing a Cross-Cloud Data Plane with Apache Iceberg
#
dataengineering
#
gcp
#
aws
#
bigdata
2
 reactions
Comments
Add Comment
5 min read
Arisyn: Rebuilding Data Relationship Discovery as Infrastructure
Hello Arisyn
Hello Arisyn
Hello Arisyn
Follow
Feb 9
Arisyn: Rebuilding Data Relationship Discovery as Infrastructure
#
dataengineering
#
dataarchitecture
#
ai
#
bigdata
1
 reaction
Comments
1
 comment
3 min read
How I Built a Big Data Survival Guide - Because My Semester Was Not Surviving Me
Nitish
Nitish
Nitish
Follow
Feb 27
How I Built a Big Data Survival Guide - Because My Semester Was Not Surviving Me
#
datascience
#
opensource
#
learning
#
bigdata
2
 reactions
Comments
1
 comment
3 min read
Fuzzy-match millions of rows in Databricks (2026)
Siyana Hristova
Siyana Hristova
Siyana Hristova
Follow
Feb 25
Fuzzy-match millions of rows in Databricks (2026)
#
datascience
#
dataengineering
#
databricks
#
bigdata
8
 reactions
Comments
Add Comment
5 min read
Overview of Real-Time Data Synchronization from PostgreSQL to VeloDB
Apache Doris
Apache Doris
Apache Doris
Follow
Jan 20
Overview of Real-Time Data Synchronization from PostgreSQL to VeloDB
#
postgres
#
bigdata
#
database
#
doris
Comments
Add Comment
5 min read
Bigtable vs BigQuery: What’s the difference? (2026 Guide)
Tech Croc
Tech Croc
Tech Croc
Follow
Feb 2
Bigtable vs BigQuery: What’s the difference? (2026 Guide)
#
bigdata
#
googlecloud
#
programming
#
algorithms
5
 reactions
Comments
Add Comment
4 min read
Apache Iceberg Explained: From Data Lakes to Metadata, Snapshots, and Real-World Usage
Mohamed Hussain S
Mohamed Hussain S
Mohamed Hussain S
Follow
Jan 21
Apache Iceberg Explained: From Data Lakes to Metadata, Snapshots, and Real-World Usage
#
datalake
#
apacheiceberg
#
dataengineering
#
bigdata
2
 reactions
Comments
Add Comment
4 min read
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account