DEV Community

# dataengineering

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Your Ruby CSV Import Ran Successfully — Your Data May Still Be Wrong

Your Ruby CSV Import Ran Successfully — Your Data May Still Be Wrong

1
Comments
2 min read
Cross-Cloud Pipeline with ADF & STS: Architecture, Troubleshooting & Costs

Cross-Cloud Pipeline with ADF & STS: Architecture, Troubleshooting & Costs

Comments
17 min read
Real-time streaming pipeline with Apache Flink 2.0, Kafka and Iceberg

Real-time streaming pipeline with Apache Flink 2.0, Kafka and Iceberg

Comments
7 min read
Linux Fundamentals Every Beginner Data Engineer Should Know

Linux Fundamentals Every Beginner Data Engineer Should Know

Comments
4 min read
Open Tables, Shared Truth: Architecting a Multi-Engine Lakehouse

Open Tables, Shared Truth: Architecting a Multi-Engine Lakehouse

Comments
4 min read
Lightweight ETL on AWS Lambda Using DuckDB and Snowflake Connector

Lightweight ETL on AWS Lambda Using DuckDB and Snowflake Connector

6
Comments
6 min read
Getting Started with GoldenPipe: Clean Data in Your Python Backend

Getting Started with GoldenPipe: Clean Data in Your Python Backend

Comments
6 min read
GoldenMatch vs. Splink vs. Dedupe vs. RecordLinkage: A Practical Comparison

GoldenMatch vs. Splink vs. Dedupe vs. RecordLinkage: A Practical Comparison

Comments
8 min read
The Cloud Is Just Someone Else's Computer. Sometimes That Computer Gets Hit by a Drone.

The Cloud Is Just Someone Else's Computer. Sometimes That Computer Gets Hit by a Drone.

Comments
5 min read
Flink + AI: Building Real-Time Decision Systems (Not Just Data Pipelines)

Flink + AI: Building Real-Time Decision Systems (Not Just Data Pipelines)

Comments
2 min read
Understanding Data Modeling in Power BI: Joins, Relationships, and Schemas Explained

Understanding Data Modeling in Power BI: Joins, Relationships, and Schemas Explained

Comments
4 min read
Setting Up Your Databricks Account (Free Trial + First Look at the UI)

Setting Up Your Databricks Account (Free Trial + First Look at the UI)

Comments
4 min read
How Linux is Used in Real-World Data Engineering

How Linux is Used in Real-World Data Engineering

1
Comments
3 min read
Master Your Wellness: Building a Health Knowledge Graph with LLMs and Neo4j 🧬

Master Your Wellness: Building a Health Knowledge Graph with LLMs and Neo4j 🧬

Comments
3 min read
Air Quality & Data Engineering Platform

Air Quality & Data Engineering Platform

Comments
4 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.