DEV Community

# dataengineering

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
I built a feature store in pure Python to finally understand the point-in-time join

I built a feature store in pure Python to finally understand the point-in-time join

Comments
6 min read
Data Lineage Is a Vanity Metric Without Business Context

Data Lineage Is a Vanity Metric Without Business Context

Comments
6 min read
I Built a Bloom Filter from Scratch in Pure Python and Finally Understood How Databases Skip Reading Data

I Built a Bloom Filter from Scratch in Pure Python and Finally Understood How Databases Skip Reading Data

Comments
6 min read
# go-intake: Go-Native Streaming Data Ingestion Toolkit

# go-intake: Go-Native Streaming Data Ingestion Toolkit

Comments
3 min read
QN : Ingest Data with Dataflows Gen2 in Microsoft Fabric

QN : Ingest Data with Dataflows Gen2 in Microsoft Fabric

Comments
2 min read
From Data Quality Checks to Analytics-Ready Parquet with Python

From Data Quality Checks to Analytics-Ready Parquet with Python

1
Comments
8 min read
Your ETL Pipeline Wasn't Built for AI — Here's How to Fix It in 2026

Your ETL Pipeline Wasn't Built for AI — Here's How to Fix It in 2026

Comments
7 min read
Apache Kafka Explained: A Practical Beginner Guide for Data Engineers

Apache Kafka Explained: A Practical Beginner Guide for Data Engineers

3
Comments
9 min read
QN : Ingest and transform data in a lakehouse

QN : Ingest and transform data in a lakehouse

Comments
2 min read
Honest Memory: What Production Accuracy Data Actually Shows About AI Agent Memory

Honest Memory: What Production Accuracy Data Actually Shows About AI Agent Memory

Comments
4 min read
Building on Brazilian public data: a developer's field guide (CNPJ, CEP, Congress, BACEN)

Building on Brazilian public data: a developer's field guide (CNPJ, CEP, Congress, BACEN)

Comments
2 min read
I let an AI agent set up my entire Kafka platform. Here's what actually happened.

I let an AI agent set up my entire Kafka platform. Here's what actually happened.

Comments
4 min read
Advanced Kubernetes Patterns for Data Engineers

Advanced Kubernetes Patterns for Data Engineers

5
Comments
1 min read
LINUX FUNDAMENTALS FOR DATA ENGINEERING

LINUX FUNDAMENTALS FOR DATA ENGINEERING

Comments
5 min read
Extract data from Databases into DuckLake

Extract data from Databases into DuckLake

Comments
4 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.