DEV Community

# dataengineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Understanding Apache Iceberg Delete Files

Understanding Apache Iceberg Delete Files

5
Comments
4 min read
Top 5 Things You Should Know About Spark

Top 5 Things You Should Know About Spark

1
Comments
3 min read
PySpark optimization techniques

PySpark optimization techniques

1
Comments
4 min read
Avoid These Top 10 Mistakes When Using Apache Spark

Avoid These Top 10 Mistakes When Using Apache Spark

4
Comments
8 min read
Getting Started with Apache Kafka: A Beginner's Guide to Distributed Event Streaming

Getting Started with Apache Kafka: A Beginner's Guide to Distributed Event Streaming

1
Comments
5 min read
Understanding the Apache Iceberg Manifest File

Understanding the Apache Iceberg Manifest File

3
Comments
7 min read
RoadMap to Data-Analytics 2024!

RoadMap to Data-Analytics 2024!

3
Comments
2 min read
DBT and Software Engineering

DBT and Software Engineering

4
Comments
7 min read
Effective Techniques for Handling Imbalanced Datasets: My Proven Approach

Effective Techniques for Handling Imbalanced Datasets: My Proven Approach

Comments
3 min read
The Developer’s Guide to Real-Time Data Platforms!

The Developer’s Guide to Real-Time Data Platforms!

9
Comments
6 min read
Understanding Apache Iceberg's metadata.json file

Understanding Apache Iceberg's metadata.json file

6
Comments
7 min read
🌐 Get started: What is MongoDB operational data layer? (Part 2) 🌐

🌐 Get started: What is MongoDB operational data layer? (Part 2) 🌐

5
Comments
2 min read
🌐 开始使用: MongoDB Operational Data Layer 是什么? (第1部分)

🌐 开始使用: MongoDB Operational Data Layer 是什么? (第1部分)

5
Comments
1 min read
Mastering SQL Joins and Unions: Integrate Data for Incredible Insights

Mastering SQL Joins and Unions: Integrate Data for Incredible Insights

Comments
6 min read
Feature Engineering: The Ultimate Guide

Feature Engineering: The Ultimate Guide

1
Comments
2 min read
🦆 💏 🐘 Let PostgreSQL & duckdb "sql" together

🦆 💏 🐘 Let PostgreSQL & duckdb "sql" together

2
Comments 2
3 min read
What Apache Iceberg REST Catalog is and isn't

What Apache Iceberg REST Catalog is and isn't

11
Comments
3 min read
ETL Real Estate Data Engineering with Redfin: From Extraction to Visualization

ETL Real Estate Data Engineering with Redfin: From Extraction to Visualization

1
Comments
3 min read
Transforming Data Engineering: A Business Domain Approach with Data Mesh

Transforming Data Engineering: A Business Domain Approach with Data Mesh

Comments
5 min read
Speeding Up Data on AWS: From Ingestion to Insights

Speeding Up Data on AWS: From Ingestion to Insights

4
Comments
11 min read
การนำเข้าข้อมูลจากไฟล์ CSV เข้ามาใน Posstgres : ทักษะเบื้องต้นของ Data Engineer

การนำเข้าข้อมูลจากไฟล์ CSV เข้ามาใน Posstgres : ทักษะเบื้องต้นของ Data Engineer

Comments
1 min read
The Ultimate Guide to Data Analytics: Techniques and Tools.

The Ultimate Guide to Data Analytics: Techniques and Tools.

Comments
3 min read
Building an Agnostic Data Pipeline: Pros and Cons

Building an Agnostic Data Pipeline: Pros and Cons

1
Comments
4 min read
Building and Managing Production-Ready Apache Airflow: From Setup to Troubleshooting

Building and Managing Production-Ready Apache Airflow: From Setup to Troubleshooting

Comments
2 min read
🐚 My Pacific Dataviz Challenge 2024 submission : violence & graphdatascience

🐚 My Pacific Dataviz Challenge 2024 submission : violence & graphdatascience

3
Comments 10
2 min read
loading...