DEV Community

# dataengineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
From Postgres to Iceberg

From Postgres to Iceberg

1
Comments
11 min read
isql

isql

Comments
1 min read
The data lakehouse evolution

The data lakehouse evolution

Comments
11 min read
Digital Vaults: Unlocking the Future of Data Archiving with Cutting-Edge Databases

Digital Vaults: Unlocking the Future of Data Archiving with Cutting-Edge Databases

Comments
2 min read
OSPP Project Outcome: Supporting Flink Engine CDC Source Schema Evolution

OSPP Project Outcome: Supporting Flink Engine CDC Source Schema Evolution

Comments
6 min read
Designing a Cost-Efficient Parallel Data Pipeline on AWS Using Lambda and SQS

Designing a Cost-Efficient Parallel Data Pipeline on AWS Using Lambda and SQS

2
Comments
6 min read
Interesting links - October 2025

Interesting links - October 2025

Comments
15 min read
Data Cataloguing in AWS

Data Cataloguing in AWS

Comments
5 min read
From Dashboards to Decisions: Building Scalable Self-Service BI for Real Impact

From Dashboards to Decisions: Building Scalable Self-Service BI for Real Impact

Comments
2 min read
Real-World Strategies for Scaling AI in Large Organizations

Real-World Strategies for Scaling AI in Large Organizations

Comments
3 min read
Clean Code in ETL:How Python, Go, and SQL Each Teach You to Think Differently

Clean Code in ETL:How Python, Go, and SQL Each Teach You to Think Differently

2
Comments
3 min read
Medallion Architecture On AWS

Medallion Architecture On AWS

Comments
4 min read
🧠Understanding 6 Common Data Formats in Cloud Data Analytics

🧠Understanding 6 Common Data Formats in Cloud Data Analytics

1
Comments
3 min read
Introducing ReelTrust: What if data engineering could solve our AI deepfakes problem?

Introducing ReelTrust: What if data engineering could solve our AI deepfakes problem?

Comments
5 min read
Who Needs Real-Time Streaming? Use Cases & Architecture Across Industries

Who Needs Real-Time Streaming? Use Cases & Architecture Across Industries

Comments
8 min read
From “I want automation” to “It runs”: 15 decisions for lead enrichment that actually execute

From “I want automation” to “It runs”: 15 decisions for lead enrichment that actually execute

Comments
3 min read
Daft vs Ray Data: A Comprehensive Comparison for Multimodal Data Processing

Daft vs Ray Data: A Comprehensive Comparison for Multimodal Data Processing

10
Comments
6 min read
End-to-End Data Workflow: Kestra, Redshift, and dbt Integration

End-to-End Data Workflow: Kestra, Redshift, and dbt Integration

4
Comments
9 min read
What Does a Data Engineer Do? Understanding the Role and Career Path

What Does a Data Engineer Do? Understanding the Role and Career Path

1
Comments
6 min read
Auto-Detecting CSV Schemas for Lightning-Fast ClickHouse Ingestion with Parquet

Auto-Detecting CSV Schemas for Lightning-Fast ClickHouse Ingestion with Parquet

7
Comments
5 min read
Smart Invoice Analyzer — How I Automated Invoice Processing & Predicted Sales Using Machine Learning

Smart Invoice Analyzer — How I Automated Invoice Processing & Predicted Sales Using Machine Learning

3
Comments
2 min read
How I Streamed Live Binance L2 Order Book Data on AWS for ~$10/Month

How I Streamed Live Binance L2 Order Book Data on AWS for ~$10/Month

5
Comments
14 min read
Stop Waiting for the Cloud: Building a Hybrid SQL+Python Data Pipeline Locally with DuckDB

Stop Waiting for the Cloud: Building a Hybrid SQL+Python Data Pipeline Locally with DuckDB

Comments
5 min read
Decommissioning the Dinosaur: A 4-Phase Playbook for Migrating Your Legacy Data Warehouse to Databricks

Decommissioning the Dinosaur: A 4-Phase Playbook for Migrating Your Legacy Data Warehouse to Databricks

Comments
4 min read
The Data Analytics Lifecycle

The Data Analytics Lifecycle

Comments
3 min read
loading...