DEV Community

# etl

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Fivetran vs Airbyte vs Estuary: Data Integration Tools Showdown

Fivetran vs Airbyte vs Estuary: Data Integration Tools Showdown

Comments
3 min read
Mastering Database Merging: Comparing Different Approaches

Mastering Database Merging: Comparing Different Approaches

1
Comments
14 min read
Optimizing ETL Processes for Efficient Data Loading in EDWs

Optimizing ETL Processes for Efficient Data Loading in EDWs

Comments
4 min read
Unlock the Power of C# in Polyglot Notebooks

Unlock the Power of C# in Polyglot Notebooks

1
Comments
7 min read
How I contributed my first data pipeline to the open source.

How I contributed my first data pipeline to the open source.

1
Comments
3 min read
On Orchestrators: You Are All Right, But You Are All Wrong Too

On Orchestrators: You Are All Right, But You Are All Wrong Too

1
Comments
10 min read
What is the REST API Source toolkit?

What is the REST API Source toolkit?

1
Comments
7 min read
Practical Way to Use AWS Glue with Postgresql

Practical Way to Use AWS Glue with Postgresql

1
Comments
2 min read
How Data Integration Is Evolving Beyond ETL

How Data Integration Is Evolving Beyond ETL

Comments
16 min read
Simplifying SDMX Data Integration with Python

Simplifying SDMX Data Integration with Python

2
Comments
3 min read
From ETL to Modern Integration Platforms

From ETL to Modern Integration Platforms

Comments
4 min read
Reverse ETL in Healthcare: Enhancing Patient Data Management

Reverse ETL in Healthcare: Enhancing Patient Data Management

Comments 1
4 min read
A Comprehensive Guide to Extracting Data from MySQL Using Singer ETL

A Comprehensive Guide to Extracting Data from MySQL Using Singer ETL

Comments
2 min read
4 Types of ETL tools: Description, Pros & Cons, and Use Cases

4 Types of ETL tools: Description, Pros & Cons, and Use Cases

Comments
7 min read
Demystifying:Azure Data Factory

Demystifying:Azure Data Factory

Comments
1 min read
Open Source High-Scale Data Pipeline Platform for Enterprise Data, Analytics, and Machine Learning Applications

Open Source High-Scale Data Pipeline Platform for Enterprise Data, Analytics, and Machine Learning Applications

Comments
2 min read
Best Practices for Designing an Efficient ETL Pipeline

Best Practices for Designing an Efficient ETL Pipeline

1
Comments
4 min read
Supercharge Data Insights: Harnessing AWS Glue for Advanced ETL in Healthcare and Life Sciences

Supercharge Data Insights: Harnessing AWS Glue for Advanced ETL in Healthcare and Life Sciences

3
Comments
3 min read
Top 5 Data Integration Tools for Modern Data Pipelines

Top 5 Data Integration Tools for Modern Data Pipelines

1
Comments
3 min read
CĂłmo Crear tu Primer Data Warehouse: Una GuĂ­a para Principiantes

CĂłmo Crear tu Primer Data Warehouse: Una GuĂ­a para Principiantes

2
Comments
3 min read
Cost-Effective GPT API Usage with Datapipe

Cost-Effective GPT API Usage with Datapipe

Comments
3 min read
Embracing Zero ETL: Unveiling the Benefits

Embracing Zero ETL: Unveiling the Benefits

Comments
6 min read
The easiest way to navigate through MongoDB, PySpark, and Jupyter Notebook

The easiest way to navigate through MongoDB, PySpark, and Jupyter Notebook

7
Comments
3 min read
Building a Data Warehouse with ETLBox: A .NET Developer's Guide

Building a Data Warehouse with ETLBox: A .NET Developer's Guide

Comments
18 min read
Redefining ETL: Data Flows Powered by C# (Part III)

Redefining ETL: Data Flows Powered by C# (Part III)

Comments
12 min read
Redefining ETL: Data Flows Powered by C# (Part I)

Redefining ETL: Data Flows Powered by C# (Part I)

4
Comments
11 min read
Data Processing with Elixir (Part 2)

Data Processing with Elixir (Part 2)

5
Comments 3
3 min read
Data processing with Elixir (Part 1)

Data processing with Elixir (Part 1)

8
Comments 5
4 min read
A mage on the Hero’s Journey: a fantasy epic on how a startup rose from the ashes

A mage on the Hero’s Journey: a fantasy epic on how a startup rose from the ashes

6
Comments
9 min read
How to check for quality? Evaluate data with AWS Glue Data Quality

How to check for quality? Evaluate data with AWS Glue Data Quality

4
Comments 1
10 min read
Data Engineering (Part 02)

Data Engineering (Part 02)

5
Comments
3 min read
Improving ETL jobs on AWS with sparksnake

Improving ETL jobs on AWS with sparksnake

4
Comments 1
4 min read
How I Decreased ETL Cost by Leveraging the Apache Arrow Ecosystem

How I Decreased ETL Cost by Leveraging the Apache Arrow Ecosystem

Comments
6 min read
Moving data from MongoDB to PostgreSQL using AWS Glue: A Guide

Moving data from MongoDB to PostgreSQL using AWS Glue: A Guide

2
Comments
2 min read
Data Masking

Data Masking

5
Comments
1 min read
SSL For RDS With Glue Python Job and AWS SDK For Pandas

SSL For RDS With Glue Python Job and AWS SDK For Pandas

4
Comments 1
4 min read
The Changing Face Of ETL

The Changing Face Of ETL

3
Comments 1
12 min read
Quick tip: Using Pentaho Data Integration (PDI) with SingleStoreDB

Quick tip: Using Pentaho Data Integration (PDI) with SingleStoreDB

Comments
3 min read
Solving AttributeError: 'float' object has no attribute 'rint'

Solving AttributeError: 'float' object has no attribute 'rint'

4
Comments
2 min read
How to import JSON file into SQL Server Database

How to import JSON file into SQL Server Database

5
Comments 1
3 min read
ETL vs Interactive Queries: The Case for Both

ETL vs Interactive Queries: The Case for Both

6
Comments
8 min read
Dynamic way doing ETL through Pyspark

Dynamic way doing ETL through Pyspark

16
Comments 2
4 min read
A No-code workflow (DAG) executor

A No-code workflow (DAG) executor

19
Comments
6 min read
ELT Data Pipeline with Kubernetes CronJob, Azure Data Lake, Azure Databricks (Part 1)

ELT Data Pipeline with Kubernetes CronJob, Azure Data Lake, Azure Databricks (Part 1)

7
Comments
5 min read
Debezium Change Data Capture without Kafka Connect

Debezium Change Data Capture without Kafka Connect

10
Comments 1
8 min read
Diving into ETL and CQRS — developing a secret message encoder with Serialized

Diving into ETL and CQRS — developing a secret message encoder with Serialized

18
Comments 1
18 min read
QuĂŠ es y como crear ETL en AWS Glue Parte 2

QuĂŠ es y como crear ETL en AWS Glue Parte 2

37
Comments
9 min read
QuĂŠ es y como crear ETL en AWS Glue Parte 1

QuĂŠ es y como crear ETL en AWS Glue Parte 1

34
Comments
3 min read
Considerations when performing ETL

Considerations when performing ETL

4
Comments
3 min read
How to Use Apache Airflow

How to Use Apache Airflow

7
Comments
8 min read
Using AWS Glue Studio for your ETL Jobs

Using AWS Glue Studio for your ETL Jobs

6
Comments
7 min read
Data architecture models

Data architecture models

3
Comments
6 min read
Apache Airflow. How to make the complex workflow as an easy job

Apache Airflow. How to make the complex workflow as an easy job

9
Comments 1
7 min read
Using Athena Views As A Source In Glue

Using Athena Views As A Source In Glue

15
Comments 3
4 min read
A simple serverless architecture on AWS ecosystem for data ETL and visualization

A simple serverless architecture on AWS ecosystem for data ETL and visualization

5
Comments
1 min read
Kestra, infinitely scalable open source orchestration and scheduling platform.

Kestra, infinitely scalable open source orchestration and scheduling platform.

3
Comments
6 min read
Modern data warehouse patterns: ELT with Snowflake variants

Modern data warehouse patterns: ELT with Snowflake variants

9
Comments
6 min read
How to Migrate from Segment to RudderStack

How to Migrate from Segment to RudderStack

6
Comments
8 min read
How To Build An ETL Using Python, Docker, PostgreSQL And Airflow

How To Build An ETL Using Python, Docker, PostgreSQL And Airflow

42
Comments
26 min read
Why It’s Hard for Engineering to Support Marketing

Why It’s Hard for Engineering to Support Marketing

2
Comments
3 min read
loading...