DEV Community

# dataengineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
🕵️ OSINT: link company acronyms to Standard Occupation Classification w. Open Source LLMs

🕵️ OSINT: link company acronyms to Standard Occupation Classification w. Open Source LLMs

1
Comments 9
6 min read
10 Future Apache Iceberg Developments to Look forward to in 2025

10 Future Apache Iceberg Developments to Look forward to in 2025

1
Comments
13 min read
Data Architecture Best Practices

Data Architecture Best Practices

1
Comments
6 min read
🚀 Unlock the Power of ORC File Format 📊

🚀 Unlock the Power of ORC File Format 📊

5
Comments
1 min read
Setting up memory for Flink - Configuration

Setting up memory for Flink - Configuration

Comments
3 min read
Designing robust and scalable relational databases: A series of best practices.

Designing robust and scalable relational databases: A series of best practices.

17
Comments 5
17 min read
From Data to Decisions: How Machine Learning Works in 2025

From Data to Decisions: How Machine Learning Works in 2025

3
Comments
3 min read
Why Data Security is Broken and How to Fix it?

Why Data Security is Broken and How to Fix it?

1
Comments
5 min read
Mastering Dynamic Allocation in Apache Spark: A Practical Guide with Real-World Insights

Mastering Dynamic Allocation in Apache Spark: A Practical Guide with Real-World Insights

Comments
3 min read
OLAP (Online Analytical Processing)

OLAP (Online Analytical Processing)

5
Comments
3 min read
Understanding Star Schema vs. Snowflake Schema

Understanding Star Schema vs. Snowflake Schema

8
Comments
1 min read
The Future of Agentic Systems Podcast 1:42:26

The Future of Agentic Systems Podcast

7
Comments 1
1 min read
Deep Dive into Dremio's File-based Auto Ingestion into Apache Iceberg Tables

Deep Dive into Dremio's File-based Auto Ingestion into Apache Iceberg Tables

1
Comments
13 min read
What is Data Engineering?

What is Data Engineering?

Comments
1 min read
One Off to One Data Platform: The Unscalable Data Platform [Part 1]

One Off to One Data Platform: The Unscalable Data Platform [Part 1]

1
Comments
3 min read
What are the major advantages of a cloud warehouse solution over an on-premises data warehouse solution?

What are the major advantages of a cloud warehouse solution over an on-premises data warehouse solution?

Comments 1
5 min read
Databricks vs. Hadoop: Which Platform is Best for Predictive Analytics?

Databricks vs. Hadoop: Which Platform is Best for Predictive Analytics?

3
Comments 1
7 min read
Talend vs. Apache Kafka: Which Data Tool Drives Better Business Insights?

Talend vs. Apache Kafka: Which Data Tool Drives Better Business Insights?

3
Comments
6 min read
LightningChart Python 1.0

LightningChart Python 1.0

Comments
1 min read
End-to-End ETL and Sales Dashboard on WWI dataset in Microsoft Fabric

End-to-End ETL and Sales Dashboard on WWI dataset in Microsoft Fabric

Comments
7 min read
The Ultimate Data Engineering Roadmap: From Beginner to Pro

The Ultimate Data Engineering Roadmap: From Beginner to Pro

6
Comments 1
8 min read
Intro to SQL using Apache Iceberg and Dremio

Intro to SQL using Apache Iceberg and Dremio

4
Comments
22 min read
5 Best ETL Tools: A Comprehensive Comparison Guide

5 Best ETL Tools: A Comprehensive Comparison Guide

1
Comments
3 min read
Dremio, Apache Iceberg and their role in AI-Ready Data

Dremio, Apache Iceberg and their role in AI-Ready Data

Comments
7 min read
SAP S/4HANA Cloud

SAP S/4HANA Cloud

Comments 2
2 min read
loading...