DEV Community

# dataengineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Data ingestion using AWS Services, Part 1

Data ingestion using AWS Services, Part 1

Comments
9 min read
Data Warehousing Architectures

Data Warehousing Architectures

Comments
5 min read
Can AI finally generate best practice code? I think so.

Can AI finally generate best practice code? I think so.

2
Comments
6 min read
Query 1B Rows in PostgreSQL >25x Faster with Squirrels!

Query 1B Rows in PostgreSQL >25x Faster with Squirrels!

1
Comments 8
5 min read
📊 AI Dashboard Builder: Create Insightful Dashboards just Droppping your Data

📊 AI Dashboard Builder: Create Insightful Dashboards just Droppping your Data

5
Comments
2 min read
Introduction to Data lakes: The future of big data storage

Introduction to Data lakes: The future of big data storage

5
Comments
2 min read
Explorer l'API de 360Learning : de l'agilité de Power Query à la robustesse de la Modern Data Stack

Explorer l'API de 360Learning : de l'agilité de Power Query à la robustesse de la Modern Data Stack

7
Comments
12 min read
Data Pipeline Filters 101: Choosing Between Static and Dynamic Approaches

Data Pipeline Filters 101: Choosing Between Static and Dynamic Approaches

Comments
1 min read
The Apache Iceberg™ Small File Problem

The Apache Iceberg™ Small File Problem

13
Comments
3 min read
Ensuring Data Quality: Best Practices and Automation

Ensuring Data Quality: Best Practices and Automation

Comments
6 min read
Data Science Simplified: Tips for Aspiring Data Scientists in 2025

Data Science Simplified: Tips for Aspiring Data Scientists in 2025

1
Comments
4 min read
2025 Guide to Architecting an Iceberg Lakehouse

2025 Guide to Architecting an Iceberg Lakehouse

5
Comments
14 min read
Data Engineer as a Real-Time Algo Trader – Turning Pipelines into Profit (or at Least Trying)!

Data Engineer as a Real-Time Algo Trader – Turning Pipelines into Profit (or at Least Trying)!

2
Comments
13 min read
One Off to One Data Platform: Design with Intent [Part 2]

One Off to One Data Platform: Design with Intent [Part 2]

1
Comments
5 min read
Case Study: Creating an ETL Data Pipeline using AWS Services - Real-World Problem

Case Study: Creating an ETL Data Pipeline using AWS Services - Real-World Problem

Comments
2 min read
Choosing the right, real-time, Postgres CDC platform

Choosing the right, real-time, Postgres CDC platform

Comments
8 min read
ChatGPT Launches Pro: What's it Mean for Data Professionals?

ChatGPT Launches Pro: What's it Mean for Data Professionals?

2
Comments
4 min read
Introduction to Apache Kafka

Introduction to Apache Kafka

3
Comments 1
3 min read
Mastering Workflow Automation with Apache Airflow for Data Engineering

Mastering Workflow Automation with Apache Airflow for Data Engineering

Comments
6 min read
Mastering Twitter Data Collection: A Comprehensive Guide to Efficient Scraping Solutions

Mastering Twitter Data Collection: A Comprehensive Guide to Efficient Scraping Solutions

Comments
3 min read
Seaborn Cheat Sheet

Seaborn Cheat Sheet

1
Comments
2 min read
Jupyter Notebooks in Docker

Jupyter Notebooks in Docker

9
Comments 1
3 min read
🚀 Beyond Data Ingestion: Advanced Strategies for Optimizing API Data Pipelines

🚀 Beyond Data Ingestion: Advanced Strategies for Optimizing API Data Pipelines

4
Comments 1
3 min read
SQL "SELECT INTO" vs "INSERT INTO SELECT" statements.

SQL "SELECT INTO" vs "INSERT INTO SELECT" statements.

Comments
1 min read
ACID Properties in Databases: What Happens Without Them?

ACID Properties in Databases: What Happens Without Them?

5
Comments
6 min read
loading...