DEV Community

# dataengineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Check Out 3 Awesome Open Source Tabular Data Wrangling Apps

Check Out 3 Awesome Open Source Tabular Data Wrangling Apps

1
Comments
3 min read
15 Core Concepts of Data Engineering

15 Core Concepts of Data Engineering

Comments
9 min read
Building a Food Price & Inflation Analysis Pipeline in Kenya (2006–2024)

Building a Food Price & Inflation Analysis Pipeline in Kenya (2006–2024)

Comments
3 min read
Understanding Data Warehousing for Retail Analytics: A Comprehensive Guide

Understanding Data Warehousing for Retail Analytics: A Comprehensive Guide

1
Comments
3 min read
Building a Modern Data Warehouse in SQL Server with Medallion Architecture

Building a Modern Data Warehouse in SQL Server with Medallion Architecture

Comments
11 min read
Benefits of OLAP and OLTP in Data Management.

Benefits of OLAP and OLTP in Data Management.

Comments
2 min read
Why We Built Confidence Scoring Into Our Date Parser (And Why Every API Should)

Why We Built Confidence Scoring Into Our Date Parser (And Why Every API Should)

Comments 1
3 min read
🧭 Data Mesh vs Data Fabric (Part 1) – Rethinking How We Scale Data

🧭 Data Mesh vs Data Fabric (Part 1) – Rethinking How We Scale Data

Comments
2 min read
Is your Vector Database Really Fast?

Is your Vector Database Really Fast?

Comments
9 min read
Kubernetes in Depth - Storage, Security, and Advanced Features

Kubernetes in Depth - Storage, Security, and Advanced Features

1
Comments
6 min read
Building a Real-Time Data Pipeline using Binance Websocket API, PySpark, Kafka and Grafana

Building a Real-Time Data Pipeline using Binance Websocket API, PySpark, Kafka and Grafana

3
Comments 1
9 min read
Building a Resilient Exception Strategy with Apache Beam and DLQ

Building a Resilient Exception Strategy with Apache Beam and DLQ

Comments
3 min read
Classes in Python, a beginner's pov

Classes in Python, a beginner's pov

1
Comments
2 min read
Why we use Apache Airflow for Data Engineering

Why we use Apache Airflow for Data Engineering

Comments
2 min read
Building ML Infrastructure in TypeScript - Part 1: The Vision

Building ML Infrastructure in TypeScript - Part 1: The Vision

5
Comments
3 min read
Building a Real-Time Healthcare Data Pipeline with Apache Spark: From SQS to Parquet (Part 2)

Building a Real-Time Healthcare Data Pipeline with Apache Spark: From SQS to Parquet (Part 2)

Comments
8 min read
🧱 OLTP vs OLAP: When Transaction Meets Analytics

🧱 OLTP vs OLAP: When Transaction Meets Analytics

Comments
2 min read
Virtual Private Database (VPD) | DBMS_RLS | fine-grained access control (FGAC) | mrcaption49

Virtual Private Database (VPD) | DBMS_RLS | fine-grained access control (FGAC) | mrcaption49

5
Comments
5 min read
DBMS_SCHEDULER with Practical example | mrcaption49

DBMS_SCHEDULER with Practical example | mrcaption49

5
Comments
4 min read
Building a News Sentiment Analysis Pipeline with Apache Airflow and Snowflake

Building a News Sentiment Analysis Pipeline with Apache Airflow and Snowflake

11
Comments
3 min read
SQL CASE Statements: The Order Matters!

SQL CASE Statements: The Order Matters!

Comments
2 min read
Why Data Cleaning is 80% of Data Science

Why Data Cleaning is 80% of Data Science

Comments
2 min read
Slowly Changing Dimensions: Strategies for Maintaining History and Integrity in Analytical Systems

Slowly Changing Dimensions: Strategies for Maintaining History and Integrity in Analytical Systems

1
Comments
8 min read
🛒 Real-Life Data Lakehouse Use Case: Revolutionizing Retail Analytics

🛒 Real-Life Data Lakehouse Use Case: Revolutionizing Retail Analytics

Comments
2 min read
Build a Lightweight Serverless ETL Pipeline to Iceberg Tables with AWS Lambda Athena

Build a Lightweight Serverless ETL Pipeline to Iceberg Tables with AWS Lambda Athena

1
Comments
4 min read
loading...