DEV Community

# dataengineering

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Designing a Layered YouTube Analytics Pipeline with AWS Bedrock (Architecture Overview)

Designing a Layered YouTube Analytics Pipeline with AWS Bedrock (Architecture Overview)

Comments
2 min read
Understanding schemas and data modelling in Power BI

Understanding schemas and data modelling in Power BI

2
Comments
4 min read
5 Database Design Mistakes I Keep Seeing (And How to Catch Them Early)

5 Database Design Mistakes I Keep Seeing (And How to Catch Them Early)

1
Comments
7 min read
Data Relationship Analysis at Scale with Arisyn

Data Relationship Analysis at Scale with Arisyn

5
Comments 1
3 min read
Scaling Fuzzy Matching: From Local Scripts to Production Pipelines

Scaling Fuzzy Matching: From Local Scripts to Production Pipelines

7
Comments
5 min read
Offloading Statistical Computations to BigQuery: Efficient EDA with Python and Seaborn

Offloading Statistical Computations to BigQuery: Efficient EDA with Python and Seaborn

1
Comments
2 min read
Why Most Data Projects Fail Before the First Model Is Built

Why Most Data Projects Fail Before the First Model Is Built

5
Comments
2 min read
AI Data Engineer Skills Deep-Dive: Entry-Level Reality + Senior Differentiators (Follow-up to Part 1)

AI Data Engineer Skills Deep-Dive: Entry-Level Reality + Senior Differentiators (Follow-up to Part 1)

Comments
4 min read
Data Relationship Intelligence Is Infrastructure — Not a Feature

Data Relationship Intelligence Is Infrastructure — Not a Feature

5
Comments 1
1 min read
How DiDi Scaled to Hundreds of Petabytes with Apache Ozone

How DiDi Scaled to Hundreds of Petabytes with Apache Ozone

Comments
4 min read
XLTable: Bringing the OLAP Experience Back to Excel on Modern Data Warehouses

XLTable: Bringing the OLAP Experience Back to Excel on Modern Data Warehouses

Comments
4 min read
Stop Bad Data From Breaking Your Pipelines — A Python Data Quality Framework

Stop Bad Data From Breaking Your Pipelines — A Python Data Quality Framework

Comments
3 min read
How to Implement Data Modelling in Power BI

How to Implement Data Modelling in Power BI

2
Comments
2 min read
O Poder da Leitura Genérica no PySpark: Uma Abordagem Unificada para Dados

O Poder da Leitura Genérica no PySpark: Uma Abordagem Unificada para Dados

1
Comments
3 min read
AI Data Engineer vs Data Engineer: What Actually Changed? (50+ Job Analysis)

AI Data Engineer vs Data Engineer: What Actually Changed? (50+ Job Analysis)

Comments
4 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.