DEV Community

# dataengineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Jupyter Notebooks in Docker

Jupyter Notebooks in Docker

9
Comments 1
3 min read
🚀 Beyond Data Ingestion: Advanced Strategies for Optimizing API Data Pipelines

🚀 Beyond Data Ingestion: Advanced Strategies for Optimizing API Data Pipelines

4
Comments 1
3 min read
SQL "SELECT INTO" vs "INSERT INTO SELECT" statements.

SQL "SELECT INTO" vs "INSERT INTO SELECT" statements.

Comments
1 min read
ACID Properties in Databases: What Happens Without Them?

ACID Properties in Databases: What Happens Without Them?

5
Comments
6 min read
🕵️ OSINT: link company acronyms to Standard Occupation Classification w. Open Source LLMs

🕵️ OSINT: link company acronyms to Standard Occupation Classification w. Open Source LLMs

1
Comments 8
6 min read
10 Future Apache Iceberg Developments to Look forward to in 2025

10 Future Apache Iceberg Developments to Look forward to in 2025

Comments
13 min read
Data Architecture Best Practices

Data Architecture Best Practices

1
Comments
6 min read
My Journey into Data AI and Machine Learning

My Journey into Data AI and Machine Learning

Comments
1 min read
🚀 Unlock the Power of ORC File Format 📊

🚀 Unlock the Power of ORC File Format 📊

5
Comments
1 min read
Setting up memory for Flink - Configuration

Setting up memory for Flink - Configuration

Comments
3 min read
Designing robust and scalable relational databases: A series of best practices.

Designing robust and scalable relational databases: A series of best practices.

16
Comments 5
17 min read
From Data to Decisions: How Machine Learning Works in 2025

From Data to Decisions: How Machine Learning Works in 2025

3
Comments
3 min read
Why Data Security is Broken and How to Fix it?

Why Data Security is Broken and How to Fix it?

1
Comments
5 min read
Mastering Dynamic Allocation in Apache Spark: A Practical Guide with Real-World Insights

Mastering Dynamic Allocation in Apache Spark: A Practical Guide with Real-World Insights

Comments
3 min read
OLAP (Online Analytical Processing)

OLAP (Online Analytical Processing)

5
Comments
3 min read
Understanding Star Schema vs. Snowflake Schema

Understanding Star Schema vs. Snowflake Schema

6
Comments
1 min read
The Future of Agentic Systems Podcast 1:42:26

The Future of Agentic Systems Podcast

7
Comments 1
1 min read
Deep Dive into Dremio's File-based Auto Ingestion into Apache Iceberg Tables

Deep Dive into Dremio's File-based Auto Ingestion into Apache Iceberg Tables

1
Comments
13 min read
What is Data Engineering?

What is Data Engineering?

Comments
1 min read
One Off to One Data Platform: The Unscalable Data Platform [Part 1]

One Off to One Data Platform: The Unscalable Data Platform [Part 1]

1
Comments
3 min read
What are the major advantages of a cloud warehouse solution over an on-premises data warehouse solution?

What are the major advantages of a cloud warehouse solution over an on-premises data warehouse solution?

Comments 1
5 min read
Databricks vs. Hadoop: Which Platform is Best for Predictive Analytics?

Databricks vs. Hadoop: Which Platform is Best for Predictive Analytics?

3
Comments 1
7 min read
Talend vs. Apache Kafka: Which Data Tool Drives Better Business Insights?

Talend vs. Apache Kafka: Which Data Tool Drives Better Business Insights?

Comments
6 min read
LightningChart Python 1.0

LightningChart Python 1.0

Comments
1 min read
End-to-End ETL and Sales Dashboard on WWI dataset in Microsoft Fabric

End-to-End ETL and Sales Dashboard on WWI dataset in Microsoft Fabric

Comments
7 min read
loading...