DEV Community

# dataengineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
The Waterfall Pattern: A Tiered Strategy for Reliable Data Extraction

The Waterfall Pattern: A Tiered Strategy for Reliable Data Extraction

1
Comments 1
5 min read
Schemas and Data Modelling in Power BI

Schemas and Data Modelling in Power BI

2
Comments
7 min read
AWS Data Engineer Associate (DEA-C01): What Each Domain Actually Tests (From Someone Who Just Passed)

AWS Data Engineer Associate (DEA-C01): What Each Domain Actually Tests (From Someone Who Just Passed)

Comments
2 min read
A 2026 Introduction to Apache Iceberg

A 2026 Introduction to Apache Iceberg

Comments
6 min read
Apache Data Lakehouse Weekly: February 4-11, 2026

Apache Data Lakehouse Weekly: February 4-11, 2026

Comments
6 min read
Machine Learning Starts With a WHERE Clause

Machine Learning Starts With a WHERE Clause

1
Comments
2 min read
Pandas 3.0's PyArrow String Revolution: A Deep Dive into Memory and Performance

Pandas 3.0's PyArrow String Revolution: A Deep Dive into Memory and Performance

2
Comments
6 min read
Chatting with 3 Billion Base Pairs: Building a RAG Index for Your Personal Genome (WGS)

Chatting with 3 Billion Base Pairs: Building a RAG Index for Your Personal Genome (WGS)

Comments
4 min read
Architecting and Operating Geospatial Workflows with Dagster

Architecting and Operating Geospatial Workflows with Dagster

Comments
8 min read
Arquitetura de Alta Performance: O "Sob o CapĂ´" da Modern Data Stack

Arquitetura de Alta Performance: O "Sob o CapĂ´" da Modern Data Stack

Comments
5 min read
Part 3: Testing, Documentation & Deployment 🚀

Part 3: Testing, Documentation & Deployment 🚀

Comments
5 min read
Part 2: dbt Project Structure & Building Models 📁

Part 2: dbt Project Structure & Building Models 📁

Comments
4 min read
# Module 4 Summary - Analytics Engineering with dbt

# Module 4 Summary - Analytics Engineering with dbt

Comments
2 min read
Stop Wrestling With Apache POI—Meet Sheetz, the One-Liner Excel Library for Java

Stop Wrestling With Apache POI—Meet Sheetz, the One-Liner Excel Library for Java

2
Comments
5 min read
Building a 'Data-on-Demand' Microservice: Wrapping Alibaba Scrapers for Internal Tools

Building a 'Data-on-Demand' Microservice: Wrapping Alibaba Scrapers for Internal Tools

3
Comments
5 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.