DEV Community

# dataengineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Optimizing Large-Scale Data Processing in Python: A Guide to Parallelizing CSV Operations

Optimizing Large-Scale Data Processing in Python: A Guide to Parallelizing CSV Operations

1
Comments
3 min read
Jupyter Notebooks in Docker

Jupyter Notebooks in Docker

8
Comments 1
3 min read
🚀 Beyond Data Ingestion: Advanced Strategies for Optimizing API Data Pipelines

🚀 Beyond Data Ingestion: Advanced Strategies for Optimizing API Data Pipelines

4
Comments 1
3 min read
SQL "SELECT INTO" vs "INSERT INTO SELECT" statements.

SQL "SELECT INTO" vs "INSERT INTO SELECT" statements.

Comments
1 min read
ACID Properties in Databases: What Happens Without Them?

ACID Properties in Databases: What Happens Without Them?

5
Comments
6 min read
🕵️ OSINT: link company acronyms to Standard Occupation Classification w. Open Source LLMs

🕵️ OSINT: link company acronyms to Standard Occupation Classification w. Open Source LLMs

1
Comments 8
6 min read
Data Architecture Best Practices

Data Architecture Best Practices

1
Comments
6 min read
My Journey into Data AI and Machine Learning

My Journey into Data AI and Machine Learning

Comments
1 min read
🚀 Unlock the Power of ORC File Format 📊

🚀 Unlock the Power of ORC File Format 📊

5
Comments
1 min read
Designing robust and scalable relational databases: A series of best practices.

Designing robust and scalable relational databases: A series of best practices.

10
Comments 5
17 min read
From Data to Decisions: How Machine Learning Works in 2025

From Data to Decisions: How Machine Learning Works in 2025

3
Comments
3 min read
Why Data Security is Broken and How to Fix it?

Why Data Security is Broken and How to Fix it?

1
Comments
5 min read
From ETL and ELT to Reverse ETL

From ETL and ELT to Reverse ETL

Comments
4 min read
OLAP (Online Analytical Processing)

OLAP (Online Analytical Processing)

5
Comments
3 min read
The Future of Agentic Systems Podcast 1:42:26

The Future of Agentic Systems Podcast

6
Comments 1
1 min read
What is Data Engineering?

What is Data Engineering?

Comments
1 min read
Deep Dive into Dremio's File-based Auto Ingestion into Apache Iceberg Tables

Deep Dive into Dremio's File-based Auto Ingestion into Apache Iceberg Tables

1
Comments
13 min read
One Off to One Data Platform: The Unscalable Data Platform [Part 1]

One Off to One Data Platform: The Unscalable Data Platform [Part 1]

2
Comments
3 min read
What are the major advantages of a cloud warehouse solution over an on-premises data warehouse solution?

What are the major advantages of a cloud warehouse solution over an on-premises data warehouse solution?

Comments 1
5 min read
Databricks vs. Hadoop: Which Platform is Best for Predictive Analytics?

Databricks vs. Hadoop: Which Platform is Best for Predictive Analytics?

2
Comments 1
7 min read
End-to-End ETL and Sales Dashboard on WWI dataset in Microsoft Fabric

End-to-End ETL and Sales Dashboard on WWI dataset in Microsoft Fabric

Comments
7 min read
The Ultimate Data Engineering Roadmap: From Beginner to Pro

The Ultimate Data Engineering Roadmap: From Beginner to Pro

6
Comments 1
8 min read
Data Analysis: The Unsung Hero of Modern Business

Data Analysis: The Unsung Hero of Modern Business

Comments
2 min read
Intro to SQL using Apache Iceberg and Dremio

Intro to SQL using Apache Iceberg and Dremio

4
Comments
22 min read
5 Best ETL Tools: A Comprehensive Comparison Guide

5 Best ETL Tools: A Comprehensive Comparison Guide

1
Comments
3 min read
loading...