DEV Community

# dataengineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
How Tables and indexes stored on Disk

How Tables and indexes stored on Disk

Comments
2 min read
"Day 38 of My Learning Journey: Setting Sail into Data Excellence! Today's Focus: Mathematics for Data Analysis (Stats Day -17)

"Day 38 of My Learning Journey: Setting Sail into Data Excellence! Today's Focus: Mathematics for Data Analysis (Stats Day -17)

1
Comments
3 min read
4 numeric distribution metrics to track in Snowflake (and how to track them)

4 numeric distribution metrics to track in Snowflake (and how to track them)

Comments
9 min read
Learn Python

Learn Python

Comments
1 min read
10 Reasons to Make Apache Iceberg and Dremio Part of your Data Lakehouse Strategy

10 Reasons to Make Apache Iceberg and Dremio Part of your Data Lakehouse Strategy

Comments
9 min read
Exploring Feature Stores: Personal Insights and Notes on Hopsworks pt.2

Exploring Feature Stores: Personal Insights and Notes on Hopsworks pt.2

1
Comments
1 min read
A deep dive into the concept and world of Apache Iceberg Catalogs

A deep dive into the concept and world of Apache Iceberg Catalogs

3
Comments
8 min read
Build a Real-time Materialized View from Postgres Changes using Confluent’s ksqlDB

Build a Real-time Materialized View from Postgres Changes using Confluent’s ksqlDB

Comments
11 min read
"Day 35 of My Learning Journey: Setting Sail into Data Excellence! Today's Focus: Mathematics for Data Analysis (Stats Day -14)

"Day 35 of My Learning Journey: Setting Sail into Data Excellence! Today's Focus: Mathematics for Data Analysis (Stats Day -14)

1
Comments
2 min read
The Role of Ontologies in Data Management

The Role of Ontologies in Data Management

1
Comments
6 min read
"Day 33 of My Learning Journey: Setting Sail into Data Excellence! Today's Focus: Mathematics for Data Analysis (Stats Day -12)

"Day 33 of My Learning Journey: Setting Sail into Data Excellence! Today's Focus: Mathematics for Data Analysis (Stats Day -12)

1
Comments
2 min read
Testing and documenting DBT models

Testing and documenting DBT models

3
Comments
3 min read
Introduction to dbt

Introduction to dbt

2
Comments
5 min read
Understanding Logistic Regression

Understanding Logistic Regression

Comments
5 min read
The Future of AI and other stories

The Future of AI and other stories

Comments
2 min read
Incremental loading in dlt

Incremental loading in dlt

3
Comments
2 min read
Since When Did APIs Become Databases?

Since When Did APIs Become Databases?

Comments
4 min read
Normalizing data with dlt

Normalizing data with dlt

3
Comments
3 min read
Extracting data with dlt

Extracting data with dlt

8
Comments
7 min read
Day 28 of My Learning Journey: Setting Sail into Data Excellence Today's Focus: Mathematics for Data Analysis (Stats Day -7)

Day 28 of My Learning Journey: Setting Sail into Data Excellence Today's Focus: Mathematics for Data Analysis (Stats Day -7)

1
Comments
1 min read
Big Data is dead & other stories

Big Data is dead & other stories

Comments
2 min read
Day 27 of My Learning Journey: Setting Sail into Data Excellence! Today's Focus: Maths for Data Analysis (Statistics Day -6)

Day 27 of My Learning Journey: Setting Sail into Data Excellence! Today's Focus: Maths for Data Analysis (Statistics Day -6)

1
Comments
1 min read
What to think about when designing, building, managing and operating data systems.

What to think about when designing, building, managing and operating data systems.

1
Comments
8 min read
BigQuery best practices

BigQuery best practices

4
Comments
2 min read
Partitioning and Clustering on BigQuery

Partitioning and Clustering on BigQuery

3
Comments 1
3 min read
The Mythical Data Team

The Mythical Data Team

3
Comments
6 min read
Benchmarking Python Processing Engines: Who’s the Fastest?

Benchmarking Python Processing Engines: Who’s the Fastest?

3
Comments
4 min read
Day 26 of My Learning Journey: Setting Sail into Data Excellence! Today's Focus: Maths for Data Analysis (Statistics Day -5)

Day 26 of My Learning Journey: Setting Sail into Data Excellence! Today's Focus: Maths for Data Analysis (Statistics Day -5)

1
Comments
1 min read
SQL should be your default choice & other stories

SQL should be your default choice & other stories

Comments
2 min read
"Day 25 of My Learning Journey: Setting Sail into Data Excellence! Today's Focus: Maths for Data Analysis (Statistics Day -4)

"Day 25 of My Learning Journey: Setting Sail into Data Excellence! Today's Focus: Maths for Data Analysis (Statistics Day -4)

1
Comments
2 min read
Decisiones informadas basadas en los datos 🎬 Serie: ⚡ Cloud Superpower ⚡ 2.06

Decisiones informadas basadas en los datos 🎬 Serie: ⚡ Cloud Superpower ⚡ 2.06

Comments
2 min read
Winter Data Meetup 2024

Winter Data Meetup 2024

Comments
2 min read
"Day 23 of My Learning Journey: Setting Sail into Data Excellence! ⛵️ Today's Focus: Maths for Data Analysis (Stats Day -2)

"Day 23 of My Learning Journey: Setting Sail into Data Excellence! ⛵️ Today's Focus: Maths for Data Analysis (Stats Day -2)

1
Comments
1 min read
Deciphering Standardization and Normalization: Understanding Feature Scaling Techniques

Deciphering Standardization and Normalization: Understanding Feature Scaling Techniques

Comments
3 min read
PySpark & Apache Spark - Overview

PySpark & Apache Spark - Overview

Comments
3 min read
Using data for predictive analytics

Using data for predictive analytics

Comments
6 min read
🦿🛴Smarcity garbage reporting automation w/ ollama

🦿🛴Smarcity garbage reporting automation w/ ollama

3
Comments 4
3 min read
Day 22 of My Learning Journey: Setting Sail into Data Excellence! Today's Focus: Mathematics for Data Analysis (Stats Day 1)

Day 22 of My Learning Journey: Setting Sail into Data Excellence! Today's Focus: Mathematics for Data Analysis (Stats Day 1)

1
Comments
2 min read
"Day 20 of My Learning Journey: Setting Sail into Data Excellence! ⛵️ Today's Focus: Excel for Data Analysis (Excel Day 19) 📊🚀

"Day 20 of My Learning Journey: Setting Sail into Data Excellence! ⛵️ Today's Focus: Excel for Data Analysis (Excel Day 19) 📊🚀

1
Comments
4 min read
AWS Kinesis - Stream Storage Layer

AWS Kinesis - Stream Storage Layer

2
Comments
3 min read
Google Cloud to Big Query with Mage

Google Cloud to Big Query with Mage

2
Comments
2 min read
"Day 18 of My Learning Journey: Setting Sail into Data Excellence! ⛵️ Today's Focus: Excel for Data Analysis (Excel Day 17) 📊🚀

"Day 18 of My Learning Journey: Setting Sail into Data Excellence! ⛵️ Today's Focus: Excel for Data Analysis (Excel Day 17) 📊🚀

1
Comments
3 min read
Migrando geometries con DMS

Migrando geometries con DMS

Comments
1 min read
Embracing the Future of Data Management: Why Choose Lakehouse, Iceberg, and Dremio?

Embracing the Future of Data Management: Why Choose Lakehouse, Iceberg, and Dremio?

Comments
7 min read
"Day 17: Excel Essentials Unveiled - Sharing Today's Insights on My Learning Adventure! 📊🚀 #ExcelSkills #LearningJourney"

"Day 17: Excel Essentials Unveiled - Sharing Today's Insights on My Learning Adventure! 📊🚀 #ExcelSkills #LearningJourney"

1
Comments
2 min read
Transform your R Dataframes: Styles, 🎨 Colors, and 😎 Emojis

Transform your R Dataframes: Styles, 🎨 Colors, and 😎 Emojis

2
Comments
9 min read
Modern Data Engineering RoadMap - 2024

Modern Data Engineering RoadMap - 2024

97
Comments 3
3 min read
"Day 16: Excel Essentials Unveiled - Sharing Today's Insights on My Learning Adventure! 📊🚀 #ExcelSkills #LearningJourney"

"Day 16: Excel Essentials Unveiled - Sharing Today's Insights on My Learning Adventure! 📊🚀 #ExcelSkills #LearningJourney"

1
Comments
2 min read
"Day 15: Excel Essentials Unveiled - Sharing Today's Insights on My Learning Adventure! 📊🚀 #ExcelSkills #LearningJourney"

"Day 15: Excel Essentials Unveiled - Sharing Today's Insights on My Learning Adventure! 📊🚀 #ExcelSkills #LearningJourney"

1
Comments
2 min read
Introducción a los Data Lakes

Introducción a los Data Lakes

3
Comments
3 min read
Hands-on Guide to Enable Compute Nodes for Data Lake Analytics in Apache Doris

Hands-on Guide to Enable Compute Nodes for Data Lake Analytics in Apache Doris

1
Comments
4 min read
Exploring Feature Stores: Personal Insights and Notes on Hopsworks

Exploring Feature Stores: Personal Insights and Notes on Hopsworks

2
Comments
1 min read
Data Engineering Saga part 2

Data Engineering Saga part 2

2
Comments
3 min read
Data Engineering Saga

Data Engineering Saga

1
Comments
2 min read
Data Engineering Saga

Data Engineering Saga

1
Comments
2 min read
Data Engineering Saga

Data Engineering Saga

Comments
2 min read
Data Evolution - Databases to Data Lakehouse

Data Evolution - Databases to Data Lakehouse

4
Comments
4 min read
"Day 14: Excel Essentials Unveiled - Sharing Today's Insights on My Learning Adventure! 📊🚀 #ExcelSkills #LearningJourney"

"Day 14: Excel Essentials Unveiled - Sharing Today's Insights on My Learning Adventure! 📊🚀 #ExcelSkills #LearningJourney"

1
Comments
2 min read
How to build an Anomaly Detector using BigQuery

How to build an Anomaly Detector using BigQuery

4
Comments
12 min read
How NASCAR delivers realtime racing data to millions of fans around the world

How NASCAR delivers realtime racing data to millions of fans around the world

19
Comments
2 min read
loading...