DEV Community

# dataengineering

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Building a Production-Ready Serverless App on Google Cloud (Part 1: Architecture)

Building a Production-Ready Serverless App on Google Cloud (Part 1: Architecture)

14
Comments
5 min read
Data Pipeline Architecture: From Messy CSVs to Clean Database

Data Pipeline Architecture: From Messy CSVs to Clean Database

Comments
5 min read
Lightweight ETL on AWS Lambda Using DuckDB and Snowflake Connector

Lightweight ETL on AWS Lambda Using DuckDB and Snowflake Connector

6
Comments
6 min read
Epistemic Control Systems: Anchoring on Kafka

Epistemic Control Systems: Anchoring on Kafka

Comments
4 min read
Building an Incremental Zoho Desk to BigQuery Pipeline: Lessons from the Trenches

Building an Incremental Zoho Desk to BigQuery Pipeline: Lessons from the Trenches

1
Comments
7 min read
Stop Manually Entering Medical Data: How to Automate PDF Lab Reports with LayoutParser & OCR

Stop Manually Entering Medical Data: How to Automate PDF Lab Reports with LayoutParser & OCR

1
Comments
3 min read
Shopify Automation: How I Managed an 80,000-Product Catalog with Python & Pandas

Shopify Automation: How I Managed an 80,000-Product Catalog with Python & Pandas

Comments
3 min read
Synthetic Data and the Privacy Problem: Beyond Alice and Bob

Synthetic Data and the Privacy Problem: Beyond Alice and Bob

1
Comments
10 min read
how i use cursor and ai agents to write dbt tests and documentation

how i use cursor and ai agents to write dbt tests and documentation

1
Comments
2 min read
dbt + OpenLineage #1: Why dbt-ol Is a Post-Processor (Not a Plugin) — and Why It Matters

dbt + OpenLineage #1: Why dbt-ol Is a Post-Processor (Not a Plugin) — and Why It Matters

Comments
7 min read
PardoX 0.3.1: The GPU Awakening and the Conquest of the Universal Backend

PardoX 0.3.1: The GPU Awakening and the Conquest of the Universal Backend

1
Comments
19 min read
Feed Rescue: Converting Raw Ulta Scrapes into Google Merchant Center XML

Feed Rescue: Converting Raw Ulta Scrapes into Google Merchant Center XML

1
Comments
5 min read
the future of data engineering workflows with ai

the future of data engineering workflows with ai

1
Comments
2 min read
100 Spark Scenario Based Interview Questions and Answers

100 Spark Scenario Based Interview Questions and Answers

1
Comments 1
24 min read
ETL Pipeline: The 6-Phase Pattern That Cuts Debugging From Hours to Minutes

ETL Pipeline: The 6-Phase Pattern That Cuts Debugging From Hours to Minutes

1
Comments
5 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.