DEV Community

# dataengineering

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
What is Apache Iceberg? The Table Format Revolution

What is Apache Iceberg? The Table Format Revolution

Comments
4 min read
What is Apache Parquet? Columns, Encoding, and Performance

What is Apache Parquet? Columns, Encoding, and Performance

Comments
4 min read
Integrating OIC with Databricks using AWS S3 (External Tables Approach)

Integrating OIC with Databricks using AWS S3 (External Tables Approach)

Comments
2 min read
ETL vs ELT: Which One Should You Use and Why?

ETL vs ELT: Which One Should You Use and Why?

Comments
4 min read
The Risk Map: Architecting an Obsolescence-Immune Data Foundation

The Risk Map: Architecting an Obsolescence-Immune Data Foundation

1
Comments
3 min read
Your Database Workflow Is Broken (And It’s Not Your Fault)

Your Database Workflow Is Broken (And It’s Not Your Fault)

Comments 1
2 min read
Agentic Analytics on the Apache Lakehouse

Agentic Analytics on the Apache Lakehouse

Comments 2
4 min read
Welcome to the World of SQL

Welcome to the World of SQL

Comments
5 min read
"Beating 250,000 Mental Comparisons: A Cross-Domain Engineer's Entity Resolution Case Study"

"Beating 250,000 Mental Comparisons: A Cross-Domain Engineer's Entity Resolution Case Study"

Comments
10 min read
Why We Open-Sourced Our Database Query Layer

Why We Open-Sourced Our Database Query Layer

Comments
4 min read
State of Data Engineering 2026: Why Data Teams Spend 60% of Their Time Firefighting

State of Data Engineering 2026: Why Data Teams Spend 60% of Their Time Firefighting

Comments
3 min read
Why I Built AnomalyArmor

Why I Built AnomalyArmor

Comments
3 min read
How Real Data Engineering Powers AI Customer Intelligence

How Real Data Engineering Powers AI Customer Intelligence

1
Comments
3 min read
Practical SQL Concepts

Practical SQL Concepts

Comments
4 min read
Building a Hantavirus Misinformation Detector: Challenges of NLP in Low-Data Health Domains

Building a Hantavirus Misinformation Detector: Challenges of NLP in Low-Data Health Domains

3
Comments
3 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.