DEV Community

# dataengineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Why We Built Confidence Scoring Into Our Date Parser (And Why Every API Should)

Why We Built Confidence Scoring Into Our Date Parser (And Why Every API Should)

Comments 1
3 min read
Understanding PRAGMA SERIALLY_REUSABLE in Oracle PL/SQL: A Complete Guide

Understanding PRAGMA SERIALLY_REUSABLE in Oracle PL/SQL: A Complete Guide

5
Comments
3 min read
Is your Vector Database Really Fast?

Is your Vector Database Really Fast?

Comments
9 min read
Kubernetes in Depth - Storage, Security, and Advanced Features

Kubernetes in Depth - Storage, Security, and Advanced Features

1
Comments
6 min read
Building a Resilient Exception Strategy with Apache Beam and DLQ

Building a Resilient Exception Strategy with Apache Beam and DLQ

Comments
3 min read
Classes in Python, a beginner's pov

Classes in Python, a beginner's pov

1
Comments
2 min read
Well-formed, Valid, Canonical, and Correct

Well-formed, Valid, Canonical, and Correct

1
Comments
4 min read
Why we use Apache Airflow for Data Engineering

Why we use Apache Airflow for Data Engineering

Comments
2 min read
Building ML Infrastructure in TypeScript - Part 1: The Vision

Building ML Infrastructure in TypeScript - Part 1: The Vision

5
Comments
3 min read
Building a News Sentiment Analysis Pipeline with Apache Airflow and Snowflake

Building a News Sentiment Analysis Pipeline with Apache Airflow and Snowflake

11
Comments
3 min read
SQL CASE Statements: The Order Matters!

SQL CASE Statements: The Order Matters!

Comments
2 min read
Why Data Cleaning is 80% of Data Science

Why Data Cleaning is 80% of Data Science

Comments
2 min read
Slowly Changing Dimensions: Strategies for Maintaining History and Integrity in Analytical Systems

Slowly Changing Dimensions: Strategies for Maintaining History and Integrity in Analytical Systems

1
Comments
8 min read
Tableau Sales Dashboard Performance (Updated for 2025)

Tableau Sales Dashboard Performance (Updated for 2025)

1
Comments
4 min read
Build a Lightweight Serverless ETL Pipeline to Iceberg Tables with AWS Lambda Athena

Build a Lightweight Serverless ETL Pipeline to Iceberg Tables with AWS Lambda Athena

2
Comments
4 min read
Big Data Fundamentals: data pipeline with python

Big Data Fundamentals: data pipeline with python

Comments
6 min read
Big Data Fundamentals: data pipeline tutorial

Big Data Fundamentals: data pipeline tutorial

Comments
6 min read
🧊 Snowflake RBAC 101 – Episode 3: Ongoing Access Management

🧊 Snowflake RBAC 101 – Episode 3: Ongoing Access Management

Comments
1 min read
Data Science vs Business Analytics

Data Science vs Business Analytics

Comments
1 min read
Big Data Fundamentals: data pipeline example

Big Data Fundamentals: data pipeline example

Comments
6 min read
Big Data Fundamentals: data pipeline

Big Data Fundamentals: data pipeline

Comments
6 min read
What Is Change Data Capture (CDC) and How It Works on Google Cloud

What Is Change Data Capture (CDC) and How It Works on Google Cloud

Comments
2 min read
💾 Parquet or Avro? CSV or JSON?

💾 Parquet or Avro? CSV or JSON?

Comments
1 min read
Reading CSVs with varying column counts that pandas cannot read using DuckDB

Reading CSVs with varying column counts that pandas cannot read using DuckDB

1
Comments
3 min read
Working with Apache to automate collection of Weather data for Kenya’s major Agricultural Areas

Working with Apache to automate collection of Weather data for Kenya’s major Agricultural Areas

Comments
5 min read
loading...