DEV Community

# dataengineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
How to Convert Dates in One Group into An Interval

How to Convert Dates in One Group into An Interval

10
Comments
2 min read
Building a project in DBT

Building a project in DBT

Comments
5 min read
Variant in Apache Doris 2.1.0: a new data type 8 times faster than JSON for semi-structured data analysis

Variant in Apache Doris 2.1.0: a new data type 8 times faster than JSON for semi-structured data analysis

Comments
12 min read
Final project part 1

Final project part 1

2
Comments
2 min read
Desentrañando el Proceso ETL: La Columna Vertebral de la Ciencia de Datos

Desentrañando el Proceso ETL: La Columna Vertebral de la Ciencia de Datos

Comments
2 min read
Hands-On Guide: Implementing Debezium for PostgreSQL to Kafka Integration

Hands-On Guide: Implementing Debezium for PostgreSQL to Kafka Integration

3
Comments 5
6 min read
DBT (Data Build Tool)

DBT (Data Build Tool)

2
Comments 1
4 min read
My Data Engineering Library

My Data Engineering Library

Comments
2 min read
Shipping Data in Real Time Debezium : Part 1

Shipping Data in Real Time Debezium : Part 1

4
Comments 3
2 min read
XGBoost Training Speed: A Comparative Analysis

XGBoost Training Speed: A Comparative Analysis

Comments
2 min read
Loops and Vectorization in Python

Loops and Vectorization in Python

Comments
1 min read
Embarking on the Data Odyssey: A Deep Dive into Data Engineering for Tech Enthusiasts

Embarking on the Data Odyssey: A Deep Dive into Data Engineering for Tech Enthusiasts

Comments
3 min read
Apache Doris 2.1.0: TPC-DS, Parallel Adaptive Scan, Local Shuffle, Arrow Flight-based HTTP Data API

Apache Doris 2.1.0: TPC-DS, Parallel Adaptive Scan, Local Shuffle, Arrow Flight-based HTTP Data API

Comments
29 min read
Production and CI/CD in dbt

Production and CI/CD in dbt

2
Comments
3 min read
My Experience with Apache Airflow

My Experience with Apache Airflow

8
Comments
3 min read
"Day 42 of My Learning Journey: Setting Sail into Data Excellence! Today's Focus: Mathematics for Data Analysis (Stats Day -21)

"Day 42 of My Learning Journey: Setting Sail into Data Excellence! Today's Focus: Mathematics for Data Analysis (Stats Day -21)

1
Comments
1 min read
Different file formats, a benchmark doing basic operations

Different file formats, a benchmark doing basic operations

10
Comments 2
9 min read
5 reasons Dremio is the ideal Apache Iceberg Lakehouse Platform

5 reasons Dremio is the ideal Apache Iceberg Lakehouse Platform

Comments
5 min read
When Metrics Go Awry: Analyzing KPIs using machine learning, regression analysis, and Shapley values

When Metrics Go Awry: Analyzing KPIs using machine learning, regression analysis, and Shapley values

Comments
5 min read
How to manage tags for objects in Snowflake

How to manage tags for objects in Snowflake

Comments
6 min read
Can You Become a Data Analyst Without a Background in Computer Science?

Can You Become a Data Analyst Without a Background in Computer Science?

Comments
2 min read
“Data has a Dream” — A Short comic about data mesh and how it can transform your company

“Data has a Dream” — A Short comic about data mesh and how it can transform your company

1
Comments 1
2 min read
How moving from Pandas to Polars made me write better code without writing better code

How moving from Pandas to Polars made me write better code without writing better code

40
Comments 4
14 min read
The Apache Iceberg Lakehouse: The Great Data Equalizer (disrupting the Snowflake/Databricks status quo)

The Apache Iceberg Lakehouse: The Great Data Equalizer (disrupting the Snowflake/Databricks status quo)

1
Comments
7 min read
📢 About job offers, innovation & data strategy 🔭

📢 About job offers, innovation & data strategy 🔭

Comments 3
3 min read
loading...