DEV Community

# dataengineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
LINE Developer Meetup 13 (Part 1): Conference Notes from 2020/09/18

LINE Developer Meetup 13 (Part 1): Conference Notes from 2020/09/18

Comments
7 min read
JSONL is a seriously weird format!

JSONL is a seriously weird format!

Comments
2 min read
Stop Wrestling With Apache POI—Meet Sheetz, the One-Liner Excel Library for Java

Stop Wrestling With Apache POI—Meet Sheetz, the One-Liner Excel Library for Java

3
Comments
5 min read
SQL on Kafka Data Does Not Require a Streaming Engine

SQL on Kafka Data Does Not Require a Streaming Engine

Comments
4 min read
Arquitetura de Alta Performance: O "Sob o Capô" da Modern Data Stack

Arquitetura de Alta Performance: O "Sob o Capô" da Modern Data Stack

Comments
5 min read
Reverse-Engineering Unknown Databases at Scale with Arisyn

Reverse-Engineering Unknown Databases at Scale with Arisyn

5
Comments 1
2 min read
Machine Learning Starts With a WHERE Clause

Machine Learning Starts With a WHERE Clause

1
Comments 1
2 min read
Engineer’s Diary: Leaving Windows Behind and Building the ETL Engine I Always Dreamed Of, PardoX v0.1

Engineer’s Diary: Leaving Windows Behind and Building the ETL Engine I Always Dreamed Of, PardoX v0.1

Comments
21 min read
Understanding Git: How it tracks, pushes and pulls code on Ubuntu

Understanding Git: How it tracks, pushes and pulls code on Ubuntu

2
Comments
3 min read
Part 2: dbt Project Structure & Building Models 📁

Part 2: dbt Project Structure & Building Models 📁

1
Comments
4 min read
Getting started with Git and GitHub.

Getting started with Git and GitHub.

3
Comments
3 min read
How We Built a Deterministic File Import Pipeline in TypeScript (CSV, XLSX, ZIP)

How We Built a Deterministic File Import Pipeline in TypeScript (CSV, XLSX, ZIP)

Comments
2 min read
Why Most Data Governance Tools Miss the Real Relationships — and What to Do About It

Why Most Data Governance Tools Miss the Real Relationships — and What to Do About It

5
Comments 1
2 min read
Building a MedAdvantage RAF Engine with dbt & PostgreSQL (Step-by-Step Guide)

Building a MedAdvantage RAF Engine with dbt & PostgreSQL (Step-by-Step Guide)

1
Comments
4 min read
Pandas 3.0's PyArrow String Revolution: A Deep Dive into Memory and Performance

Pandas 3.0's PyArrow String Revolution: A Deep Dive into Memory and Performance

7
Comments
6 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.