DEV Community

# dataengineering

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
From Theory to Evidence: Validating Shannon Entropy for Data Quality at Scale

From Theory to Evidence: Validating Shannon Entropy for Data Quality at Scale

Comments
7 min read
Rejected but Not Defeated: Continuing My Journey Toward AWS Community Builders

Rejected but Not Defeated: Continuing My Journey Toward AWS Community Builders

1
Comments
1 min read
What is Snowflake? A Beginner's Guide to the Cloud Data Warehouse Everyone's Talking About

What is Snowflake? A Beginner's Guide to the Cloud Data Warehouse Everyone's Talking About

1
Comments
4 min read
Kafka Explained

Kafka Explained

2
Comments
2 min read
Building a Real-Time Market Anomaly Dashboard with Vue3, Element Plus, and Java

Building a Real-Time Market Anomaly Dashboard with Vue3, Element Plus, and Java

1
Comments
3 min read
#Connecting Power BI to SQL Databases: A Complete Guide

#Connecting Power BI to SQL Databases: A Complete Guide

2
Comments
4 min read
100 Spark Scenario Based Interview Questions and Answers

100 Spark Scenario Based Interview Questions and Answers

Comments
24 min read
Otimizando Escrita e Performance em Ambientes com Delta Lake, MinIO e Spark

Otimizando Escrita e Performance em Ambientes com Delta Lake, MinIO e Spark

1
Comments
3 min read
Quantified Self: Syncing Whoop and Garmin Metrics with InfluxDB and Grafana

Quantified Self: Syncing Whoop and Garmin Metrics with InfluxDB and Grafana

1
Comments
3 min read
Amazon S3 Files: from Kafka to S3 via NFS

Amazon S3 Files: from Kafka to S3 via NFS

3
Comments 1
11 min read
16GB of RAM, 12GB of JSON, and One Very Loud Fan

16GB of RAM, 12GB of JSON, and One Very Loud Fan

Comments
5 min read
How Apache Iceberg's Metadata Architecture Enables ACID at Scale

How Apache Iceberg's Metadata Architecture Enables ACID at Scale

Comments
7 min read
Data Quality at Scale: Building Trust in Airline Schedule Data Pipelines

Data Quality at Scale: Building Trust in Airline Schedule Data Pipelines

Comments
7 min read
The Modern Data Stack Has a Coherence Problem

The Modern Data Stack Has a Coherence Problem

1
Comments
7 min read
How I built a MilliSeconds data quality firewall on Cloudflare Workers

How I built a MilliSeconds data quality firewall on Cloudflare Workers

2
Comments 1
7 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.