DEV Community

# dataengineering

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
greatCircleDistance in ClickHouse: Avoiding Full Table Scans

greatCircleDistance in ClickHouse: Avoiding Full Table Scans

3
Comments
2 min read
ETL vs ELT: Which One Should You Use and Why?

ETL vs ELT: Which One Should You Use and Why?

4
Comments
4 min read
My first Python project: Excel to SQL pipeline (feedback welcome)

My first Python project: Excel to SQL pipeline (feedback welcome)

1
Comments
1 min read
# Building a Streaming Session Analytics Pipeline with Kafka, Postgres, and dbt

# Building a Streaming Session Analytics Pipeline with Kafka, Postgres, and dbt

7
Comments
6 min read
Snowflake Cost Optimization Starts With Workload Design, Not Auto-Suspend

Snowflake Cost Optimization Starts With Workload Design, Not Auto-Suspend

Comments
5 min read
Materialization strategies: how Bruin and dbt turn SELECT queries into tables

Materialization strategies: how Bruin and dbt turn SELECT queries into tables

Comments
11 min read
How Bruin turns a SELECT query into 9 different materialization strategies across 14 databases

How Bruin turns a SELECT query into 9 different materialization strategies across 14 databases

Comments
10 min read
Data quality testing: how Bruin and dbt take different paths to the same goal

Data quality testing: how Bruin and dbt take different paths to the same goal

1
Comments
7 min read
SQLite, Go/Postgres, & Petabytes: Database Patterns for Builders

SQLite, Go/Postgres, & Petabytes: Database Patterns for Builders

2
Comments
4 min read
Creating a Cross-platform Data Sandbox that makes Working with Data Easier and Faster

Creating a Cross-platform Data Sandbox that makes Working with Data Easier and Faster

1
Comments
3 min read
Data Engineering Interviews Are Broken (Here's Proof)

Data Engineering Interviews Are Broken (Here's Proof)

Comments
6 min read
Fixing 168K Failed FHIR Conversions with Parallel AI Agents and Git Worktrees

Fixing 168K Failed FHIR Conversions with Parallel AI Agents and Git Worktrees

1
Comments
10 min read
Data Pipelines Explained Simply (and How to Build Them with Python)

Data Pipelines Explained Simply (and How to Build Them with Python)

2
Comments
2 min read
How to Build a Secure Azure Data Platform with Terraform & Data Factory (Step-by-Step)

How to Build a Secure Azure Data Platform with Terraform & Data Factory (Step-by-Step)

1
Comments
3 min read
How to Add a Data Quality Gate to Your Airflow Pipeline in 5 Minutes

How to Add a Data Quality Gate to Your Airflow Pipeline in 5 Minutes

Comments 1
4 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.