DEV Community

# dataengineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
10 Future Apache Iceberg Developments to Look forward to in 2025

10 Future Apache Iceberg Developments to Look forward to in 2025

Comments
13 min read
Creating Stripe Test Data in Python

Creating Stripe Test Data in Python

2
Comments
4 min read
📊 AI Dashboard Builder: Create Insightful Dashboards just Droppping your Data

📊 AI Dashboard Builder: Create Insightful Dashboards just Droppping your Data

Comments
2 min read
Setting up memory for Flink - Configuration

Setting up memory for Flink - Configuration

Comments
3 min read
Are AWS Certifications Worth It in 2025?

Are AWS Certifications Worth It in 2025?

3
Comments
2 min read
Mastering Dynamic Allocation in Apache Spark: A Practical Guide with Real-World Insights

Mastering Dynamic Allocation in Apache Spark: A Practical Guide with Real-World Insights

Comments
3 min read
Talend vs. Apache Kafka: Which Data Tool Drives Better Business Insights?

Talend vs. Apache Kafka: Which Data Tool Drives Better Business Insights?

Comments
6 min read
LightningChart Python 1.0

LightningChart Python 1.0

Comments
1 min read
Introduction to Data lakes: The future of big data storage

Introduction to Data lakes: The future of big data storage

5
Comments
2 min read
Explorer l'API de 360Learning : de l'agilité de Power Query à la robustesse de la Modern Data Stack

Explorer l'API de 360Learning : de l'agilité de Power Query à la robustesse de la Modern Data Stack

7
Comments
12 min read
Data Pipeline Filters 101: Choosing Between Static and Dynamic Approaches

Data Pipeline Filters 101: Choosing Between Static and Dynamic Approaches

Comments
1 min read
The Apache Icebergâ„¢ Small File Problem

The Apache Icebergâ„¢ Small File Problem

6
Comments
3 min read
Ensuring Data Quality: Best Practices and Automation

Ensuring Data Quality: Best Practices and Automation

Comments
6 min read
Data Science Simplified: Tips for Aspiring Data Scientists in 2025

Data Science Simplified: Tips for Aspiring Data Scientists in 2025

1
Comments
4 min read
2025 Guide to Architecting an Iceberg Lakehouse

2025 Guide to Architecting an Iceberg Lakehouse

7
Comments
14 min read
Dremio, Apache Iceberg and their role in AI-Ready Data

Dremio, Apache Iceberg and their role in AI-Ready Data

Comments
7 min read
Data Engineer as a Real-Time Algo Trader – Turning Pipelines into Profit (or at Least Trying)!

Data Engineer as a Real-Time Algo Trader – Turning Pipelines into Profit (or at Least Trying)!

1
Comments
13 min read
Leveraging Python's Pattern Matching and Comprehensions for Data Analytics

Leveraging Python's Pattern Matching and Comprehensions for Data Analytics

Comments
12 min read
One Off to One Data Platform: Design with Intent [Part 2]

One Off to One Data Platform: Design with Intent [Part 2]

1
Comments
5 min read
Case Study: Creating an ETL Data Pipeline using AWS Services - Real-World Problem

Case Study: Creating an ETL Data Pipeline using AWS Services - Real-World Problem

Comments
2 min read
Understanding Star Schema vs. Snowflake Schema

Understanding Star Schema vs. Snowflake Schema

Comments
1 min read
ChatGPT Launches Pro: What's it Mean for Data Professionals?

ChatGPT Launches Pro: What's it Mean for Data Professionals?

2
Comments
4 min read
Introduction to Apache Kafka

Introduction to Apache Kafka

3
Comments 1
3 min read
Mastering Workflow Automation with Apache Airflow for Data Engineering

Mastering Workflow Automation with Apache Airflow for Data Engineering

Comments
6 min read
Mastering Twitter Data Collection: A Comprehensive Guide to Efficient Scraping Solutions

Mastering Twitter Data Collection: A Comprehensive Guide to Efficient Scraping Solutions

Comments
3 min read
loading...