DEV Community

# dataengineering

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Advanced SQL Techniques for Data Analytics Every Data Analyst Should Know

Advanced SQL Techniques for Data Analytics Every Data Analyst Should Know

1
Comments 1
6 min read
How to Bypass the Pandas "Object Tax": Building an 8x Faster CSV Engine in C

How to Bypass the Pandas "Object Tax": Building an 8x Faster CSV Engine in C

Comments
2 min read
How Google Maps Predicts Traffic in Real Time: Live Data and ETA Explained

How Google Maps Predicts Traffic in Real Time: Live Data and ETA Explained

Comments
3 min read
How to Use Dremio with Claude Code: Connect, Query, and Build Data Apps

How to Use Dremio with Claude Code: Connect, Query, and Build Data Apps

Comments
13 min read
How to Connect Power BI to a SQL (PostgreSQL) Database and Build a Unified Dashboard

How to Connect Power BI to a SQL (PostgreSQL) Database and Build a Unified Dashboard

2
Comments
4 min read
Improving Data Ingestion Throughput with a Queue-Based Pipeline: Python + Duckdb

Improving Data Ingestion Throughput with a Queue-Based Pipeline: Python + Duckdb

Comments
3 min read
How I cut Python JSON memory overhead from 1.9GB to ~0MB (11x Speedup)

How I cut Python JSON memory overhead from 1.9GB to ~0MB (11x Speedup)

Comments
2 min read
Database Branch Testing: How Isolated Environments Improve QA Confidence

Database Branch Testing: How Isolated Environments Improve QA Confidence

1
Comments
11 min read
Boosting Lightweight ETL on AWS Lambda & Glue Python Shell with DuckDB and Apache Arrow Dataset

Boosting Lightweight ETL on AWS Lambda & Glue Python Shell with DuckDB and Apache Arrow Dataset

6
Comments
9 min read
Part 4 | Why State Machines Power Reliable Scheduling Systems

Part 4 | Why State Machines Power Reliable Scheduling Systems

Comments
6 min read
Building a Production-Ready Serverless App on Google Cloud (Part 2: The Data Contract)

Building a Production-Ready Serverless App on Google Cloud (Part 2: The Data Contract)

7
Comments
4 min read
Quantified Self: Building a Production-Grade ETL Pipeline for 10+ Wearables

Quantified Self: Building a Production-Grade ETL Pipeline for 10+ Wearables

2
Comments
4 min read
Our Data Extraction Pipeline Worked Perfectly… Until Month 6

Our Data Extraction Pipeline Worked Perfectly… Until Month 6

1
Comments
2 min read
Share of Shelf Analysis: How to Scrape Zappos Search Results

Share of Shelf Analysis: How to Scrape Zappos Search Results

1
Comments
4 min read
Iterator Patterns: How to Process Millions of Records Without Running Out of Memory

Iterator Patterns: How to Process Millions of Records Without Running Out of Memory

1
Comments
5 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.