DEV Community

# dataengineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Data Engineering Uncovered: What It Is and Why It Matters

Data Engineering Uncovered: What It Is and Why It Matters

2
Comments 1
3 min read
The Three Phases of Data Pipelines

The Three Phases of Data Pipelines

Comments
4 min read
Architecture of a 6TB Media Pipeline: Engineering Real-Time Content at Bharat Drone Shakti

Architecture of a 6TB Media Pipeline: Engineering Real-Time Content at Bharat Drone Shakti

Comments
6 min read
Why Columnar Storage Makes Analytics Faster

Why Columnar Storage Makes Analytics Faster

1
Comments
1 min read
Are Wide Tables Fast or Slow?

Are Wide Tables Fast or Slow?

5
Comments
4 min read
HOW TO GIT IT

HOW TO GIT IT

Comments
3 min read
Tableau + Databricks at Scale: A Technical Guide for Managing 10,000+ Databases

Tableau + Databricks at Scale: A Technical Guide for Managing 10,000+ Databases

Comments
5 min read
How to Set Up GPG Keys for an Existing GitHub Account (Step-by-Step)

How to Set Up GPG Keys for an Existing GitHub Account (Step-by-Step)

Comments
2 min read
Making AI Data Flows Visible: Building an Open-Source Tool to Understand SaaS & LLM Data Risk

Making AI Data Flows Visible: Building an Open-Source Tool to Understand SaaS & LLM Data Risk

1
Comments
3 min read
An Introduction to Git: Concepts, Commands, and Workflows

An Introduction to Git: Concepts, Commands, and Workflows

Comments
4 min read
Apache Iceberg & the Open Data Stack: Why the Lakehouse is Real in 2026

Apache Iceberg & the Open Data Stack: Why the Lakehouse is Real in 2026

Comments
8 min read
Learning Git & GitHub as a Data Engineering Student at LuxDevHQ

Learning Git & GitHub as a Data Engineering Student at LuxDevHQ

Comments
3 min read
Apache Gravitino Introduction

Apache Gravitino Introduction

Comments
5 min read
Why Data Engineers Are Becoming Agent Engineers

Why Data Engineers Are Becoming Agent Engineers

Comments
3 min read
Tired of ETL Bottlenecks? Build a Logical Data Warehouse with SPL

Tired of ETL Bottlenecks? Build a Logical Data Warehouse with SPL

5
Comments
11 min read
Dev List Digest for Apache Iceberg, Parquet, Polaris and Arrow: January 6–14, 2026

Dev List Digest for Apache Iceberg, Parquet, Polaris and Arrow: January 6–14, 2026

Comments
4 min read
Building a Near Real-Time Analytics Pipeline with AWS Zero-ETL

Building a Near Real-Time Analytics Pipeline with AWS Zero-ETL

Comments
4 min read
Geospatial Data Orchestration: Why Modern GIS Pipelines Require an Asset-Based Approach

Geospatial Data Orchestration: Why Modern GIS Pipelines Require an Asset-Based Approach

Comments
7 min read
Exploring the Potential of AWS Glue Python Shell as a Long-Running Batch Execution Environment

Exploring the Potential of AWS Glue Python Shell as a Long-Running Batch Execution Environment

3
Comments
7 min read
We're Manufacturing Dashboards & Data Nobody Uses (And the Data Proves It)

We're Manufacturing Dashboards & Data Nobody Uses (And the Data Proves It)

Comments
4 min read
Conference Notes: How ML Powers LINE Services

Conference Notes: How ML Powers LINE Services

Comments
5 min read
Modern Data Integration at Scale with Microsoft Fabric Connectors

Modern Data Integration at Scale with Microsoft Fabric Connectors

Comments
3 min read
LINE Developer Meetup 13 (Part 1): Conference Notes from 2020/09/18

LINE Developer Meetup 13 (Part 1): Conference Notes from 2020/09/18

Comments
7 min read
JSONL is a seriously weird format!

JSONL is a seriously weird format!

Comments
2 min read
How to Build Presto from Source - OSS Contribution Guide (Step by Step Tutorial)

How to Build Presto from Source - OSS Contribution Guide (Step by Step Tutorial)

Comments
7 min read
loading...