DEV Community

# dataengineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Breaking Into Data Science: A Comprehensive Guide for Aspiring Data Scientists

Breaking Into Data Science: A Comprehensive Guide for Aspiring Data Scientists

Comments
5 min read
"Data Engineering 101: A Beginner's Guide"

"Data Engineering 101: A Beginner's Guide"

3
Comments
3 min read
Understanding the Polaris Iceberg Catalog and Its Architecture

Understanding the Polaris Iceberg Catalog and Its Architecture

2
Comments
8 min read
Automatically Update BigQuery View Schema Changes

Automatically Update BigQuery View Schema Changes

3
Comments
5 min read
How I contributed my first data pipeline to the open source.

How I contributed my first data pipeline to the open source.

1
Comments
3 min read
On Orchestrators: You Are All Right, But You Are All Wrong Too

On Orchestrators: You Are All Right, But You Are All Wrong Too

1
Comments
10 min read
From Messy Data to Super Mario Pipeline: My First Adventure in Data Engineering

From Messy Data to Super Mario Pipeline: My First Adventure in Data Engineering

Comments
12 min read
Data Engineer and Databricks

Data Engineer and Databricks

1
Comments
3 min read
What is the REST API Source toolkit?

What is the REST API Source toolkit?

1
Comments
7 min read
Working with Parquet files in Java using Carpet

Working with Parquet files in Java using Carpet

1
Comments
6 min read
HNG STAGE ZERO: ANALYZING RETAIL SALES DATA AT FIRST GLANCE

HNG STAGE ZERO: ANALYZING RETAIL SALES DATA AT FIRST GLANCE

Comments
3 min read
🪄 Debezium: the magic behind data capture & async replication (for free)

🪄 Debezium: the magic behind data capture & async replication (for free)

Comments 2
2 min read
Ways to load data in DW from External Data Source

Ways to load data in DW from External Data Source

1
Comments
6 min read
Apache Doris Job Scheduler for Task Automation

Apache Doris Job Scheduler for Task Automation

1
Comments
6 min read
Tracking Health with Data Engineering - Chapter 1: Meal Optimization

Tracking Health with Data Engineering - Chapter 1: Meal Optimization

Comments
6 min read
Software OR Hardware Raid: What's Better In 2024?

Software OR Hardware Raid: What's Better In 2024?

4
Comments
7 min read
Aggregation in GROUP BY vs. Window Functions Using OVER()

Aggregation in GROUP BY vs. Window Functions Using OVER()

3
Comments
3 min read
Azure Synapse Analytics Security: Access Control

Azure Synapse Analytics Security: Access Control

2
Comments
7 min read
Databases Deconstructed: The Value of Data Lakehouses and Table Formats

Databases Deconstructed: The Value of Data Lakehouses and Table Formats

4
Comments
8 min read
Understanding RAID Levels: A Comprehensive Guide to RAID 0, 1, 5, 6, 10, and Beyond

Understanding RAID Levels: A Comprehensive Guide to RAID 0, 1, 5, 6, 10, and Beyond

7
Comments
9 min read
BigQuery Schema Generation Made Easier with PyPI’s bigquery-schema-generator

BigQuery Schema Generation Made Easier with PyPI’s bigquery-schema-generator

5
Comments 2
2 min read
Embrace simple tech stacks and code generation in DevOps and data engineering

Embrace simple tech stacks and code generation in DevOps and data engineering

2
Comments
6 min read
Apache Doris for log and time series data analysis in NetEase, why not Elasticsearch and InfluxDB?

Apache Doris for log and time series data analysis in NetEase, why not Elasticsearch and InfluxDB?

1
Comments
9 min read
MapReduce Vs Tez

MapReduce Vs Tez

6
Comments
2 min read
Azure Synapse Analytics Security: Data Protection

Azure Synapse Analytics Security: Data Protection

2
Comments
6 min read
loading...