DEV Community

# dataengineering

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Machine Learning Starts With a WHERE Clause

Machine Learning Starts With a WHERE Clause

1
Comments 1
2 min read
Engineer’s Diary: Leaving Windows Behind and Building the ETL Engine I Always Dreamed Of, PardoX v0.1

Engineer’s Diary: Leaving Windows Behind and Building the ETL Engine I Always Dreamed Of, PardoX v0.1

Comments
21 min read
Understanding Git: How it tracks, pushes and pulls code on Ubuntu

Understanding Git: How it tracks, pushes and pulls code on Ubuntu

2
Comments
3 min read
Getting started with Git and GitHub.

Getting started with Git and GitHub.

3
Comments
3 min read
How We Built a Deterministic File Import Pipeline in TypeScript (CSV, XLSX, ZIP)

How We Built a Deterministic File Import Pipeline in TypeScript (CSV, XLSX, ZIP)

Comments
2 min read
Why Most Data Governance Tools Miss the Real Relationships — and What to Do About It

Why Most Data Governance Tools Miss the Real Relationships — and What to Do About It

5
Comments 1
2 min read
Building a MedAdvantage RAF Engine with dbt & PostgreSQL (Step-by-Step Guide)

Building a MedAdvantage RAF Engine with dbt & PostgreSQL (Step-by-Step Guide)

1
Comments
4 min read
Pandas 3.0's PyArrow String Revolution: A Deep Dive into Memory and Performance

Pandas 3.0's PyArrow String Revolution: A Deep Dive into Memory and Performance

7
Comments
6 min read
Ask Our AI Experts: An AMA With Our Tech Leads

Ask Our AI Experts: An AMA With Our Tech Leads

Comments
3 min read
11 Compaction Optimizations for Iceberg Data Lakes

11 Compaction Optimizations for Iceberg Data Lakes

1
Comments
25 min read
Garbage In, Powerhouse Out? (Nope.) Why Your Data Foundation Matters More Than AI

Garbage In, Powerhouse Out? (Nope.) Why Your Data Foundation Matters More Than AI

1
Comments
4 min read
Schemas and Data Modelling in Power BI

Schemas and Data Modelling in Power BI

3
Comments
7 min read
Configuring Gravitino Iceberg REST Catalog Server

Configuring Gravitino Iceberg REST Catalog Server

3
Comments 1
6 min read
Under the Hood of Arisyn: How Statistical Field Fingerprinting Enables Deterministic Data Linking

Under the Hood of Arisyn: How Statistical Field Fingerprinting Enables Deterministic Data Linking

5
Comments 1
2 min read
Analytics Engineering

Analytics Engineering

Comments
1 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.