DEV Community

# dataengineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Big Data Processing - Case Study 2 (Hadoop) 04:26

Big Data Processing - Case Study 2 (Hadoop)

Comments
1 min read
Big Data Processing - Case Study 2 (Spark) 01:52

Big Data Processing - Case Study 2 (Spark)

Comments
1 min read
Big Data Processing - Case Study 1 (Hadoop) 02:01

Big Data Processing - Case Study 1 (Hadoop)

Comments
1 min read
Free Datasets for Practicing Data Engineering Skills: A 2025 Guide

Free Datasets for Practicing Data Engineering Skills: A 2025 Guide

3
Comments
3 min read
The Ultimate Linux Command Cheat Sheet for Data Engineers and Analysts

The Ultimate Linux Command Cheat Sheet for Data Engineers and Analysts

73
Comments 4
4 min read
Building a Stock Data Pipeline with requests, Apache Airflow and PostgreSQL

Building a Stock Data Pipeline with requests, Apache Airflow and PostgreSQL

1
Comments
4 min read
Why do AWS dashboards keep breaking — and is there a better way?

Why do AWS dashboards keep breaking — and is there a better way?

Comments 1
1 min read
Complete Beginner's Guide: Building a Weather ETL Pipeline with PySpark

Complete Beginner's Guide: Building a Weather ETL Pipeline with PySpark

2
Comments 1
5 min read
Event Sourcing as a creative tool for engineers

Event Sourcing as a creative tool for engineers

1
Comments
5 min read
The Underrated Soft Skills That Make Great Data Engineers

The Underrated Soft Skills That Make Great Data Engineers

2
Comments 2
2 min read
MongoDB Relationships - Embedded vs Referenced | Tutorial 2025

MongoDB Relationships - Embedded vs Referenced | Tutorial 2025

7
Comments 1
4 min read
How PostgreSQL logical decoding and plugins work

How PostgreSQL logical decoding and plugins work

1
Comments
6 min read
Why Denormalizing in ClickHouse will come back to bite you

Why Denormalizing in ClickHouse will come back to bite you

Comments
3 min read
Ultimate guide to creating a pipeline(Apache Airflow)

Ultimate guide to creating a pipeline(Apache Airflow)

11
Comments
5 min read
Extracting Data from an API using Python (requests)

Extracting Data from an API using Python (requests)

Comments
4 min read
Unlocking Business Potential with Big Data Analytics Services

Unlocking Business Potential with Big Data Analytics Services

Comments
3 min read
A Practical Guide to MLOps on AWS: Transforming Raw Data into AI-Ready Datasets with AWS Glue (Phase 02)

A Practical Guide to MLOps on AWS: Transforming Raw Data into AI-Ready Datasets with AWS Glue (Phase 02)

1
Comments 2
8 min read
Personal Picks: Data Product News (April 16, 2025)

Personal Picks: Data Product News (April 16, 2025)

Comments
8 min read
Find the Superset from the Relationship Table — From SQL to SPL #19

Find the Superset from the Relationship Table — From SQL to SPL #19

1
Comments 1
1 min read
Building a Gold (XAUUSD) Trend Tracker with Python and SQLite

Building a Gold (XAUUSD) Trend Tracker with Python and SQLite

2
Comments 1
4 min read
Big data analytics process

Big data analytics process

1
Comments
1 min read
Python 101:The Ultimate Beginner’s Guide

Python 101:The Ultimate Beginner’s Guide

1
Comments
4 min read
Common Mistakes to Avoid in Data Engineering Job Interviews (And How to Nail Them)

Common Mistakes to Avoid in Data Engineering Job Interviews (And How to Nail Them)

Comments
3 min read
Is Big Data Dying?

Is Big Data Dying?

1
Comments
7 min read
Supercharging Databricks Asset Bundles

Supercharging Databricks Asset Bundles

1
Comments
4 min read
loading...