DEV Community

# dataengineering

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Backpressure in document pipelines is an architecture problem first

Backpressure in document pipelines is an architecture problem first

Comments
2 min read
Why mixed document packs make extraction pipelines harder to trust

Why mixed document packs make extraction pipelines harder to trust

Comments
2 min read
Why Cursor AI Won't Replace Data Engineers (And How to Actually Use It)

Why Cursor AI Won't Replace Data Engineers (And How to Actually Use It)

3
Comments
2 min read
Why Real-Time Analytics Eventually Changes Your Database Architecture

Why Real-Time Analytics Eventually Changes Your Database Architecture

3
Comments
4 min read
Time-Series Databases (InfluxDB/TimescaleDB)

Time-Series Databases (InfluxDB/TimescaleDB)

1
Comments
8 min read
🧞‍♂️Transform unstructured PDFs Job Offers into a dataset w. gemma4:2b

Gemma 4 Challenge: Build With Gemma 4 Submission

🧞‍♂️Transform unstructured PDFs Job Offers into a dataset w. gemma4:2b

9
Comments 3
4 min read
PostgreSQL to Snowflake: A Hands-On CDC Streaming Guide

PostgreSQL to Snowflake: A Hands-On CDC Streaming Guide

Comments
13 min read
ETL vs ELT: Which One Should You Use and Why?

ETL vs ELT: Which One Should You Use and Why?

1
Comments
6 min read
TaskFlow API vs Traditional Operators in Apache Airflow

TaskFlow API vs Traditional Operators in Apache Airflow

3
Comments
3 min read
Your real-time analytics might not be real-time

Your real-time analytics might not be real-time

Comments
1 min read
What is Apache Arrow? Erasing the Serialization Tax

What is Apache Arrow? Erasing the Serialization Tax

Comments
3 min read
ETL vs ELT: Which One Should You Use and Why?

ETL vs ELT: Which One Should You Use and Why?

Comments
5 min read
Mastering SQL Fundamentals: From Data Definition to Data Transformation

Mastering SQL Fundamentals: From Data Definition to Data Transformation

Comments
3 min read
How Apache Kafka Powers Real-Time Data Pipelines

How Apache Kafka Powers Real-Time Data Pipelines

1
Comments 1
3 min read
Assembling the Apache Lakehouse: The Modular Architecture

Assembling the Apache Lakehouse: The Modular Architecture

Comments
3 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.