DEV Community

# dataengineering

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
LET'S GIT IT—A Beginner's Guide to Version Control.

LET'S GIT IT—A Beginner's Guide to Version Control.

4
Comments 1
3 min read
Day 13: Window Functions in PySpark

Day 13: Window Functions in PySpark

Comments
2 min read
Introduction to Version Control with Git and GitHub

Introduction to Version Control with Git and GitHub

1
Comments 2
3 min read
Is CsvPath an easy or hard language?

Is CsvPath an easy or hard language?

Comments
16 min read
Day 17: Building a Real ETL Pipeline in Spark Using Bronze-Silver-Gold Architecture

Day 17: Building a Real ETL Pipeline in Spark Using Bronze-Silver-Gold Architecture

Comments
1 min read
Understanding Salesforce Data 360 Objects: The Core of the Unified Customer Profile

Understanding Salesforce Data 360 Objects: The Core of the Unified Customer Profile

Comments
3 min read
Apache Gravitino Introduction

Apache Gravitino Introduction

2
Comments
5 min read
S3-Native Kafka Alternatives: What's Actually Different

S3-Native Kafka Alternatives: What's Actually Different

Comments
3 min read
Day 12: UDF vs Pandas UDF

Day 12: UDF vs Pandas UDF

Comments
2 min read
The Data Engineers Descent Into Datetime Hell

The Data Engineers Descent Into Datetime Hell

1
Comments
5 min read
Day 11: Choosing the Right File Format in Spark

Day 11: Choosing the Right File Format in Spark

Comments
2 min read
Navigating the Future: Key Data Engineering Trends for 2024 and Beyond

Navigating the Future: Key Data Engineering Trends for 2024 and Beyond

Comments
6 min read
How to Build Presto from Source - OSS Contribution Guide (Step by Step Tutorial)

How to Build Presto from Source - OSS Contribution Guide (Step by Step Tutorial)

Comments
7 min read
Day 7: Mastering Joins, Unions, and GroupBy in PySpark - The Core ETL Operations

Day 7: Mastering Joins, Unions, and GroupBy in PySpark - The Core ETL Operations

Comments
2 min read
The evolution of the Modern Data Stack: From RDBMS to the LakeHouse

The evolution of the Modern Data Stack: From RDBMS to the LakeHouse

1
Comments
11 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.