DEV Community

# dataengineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Setting up memory for Flink 2 - What to think about...

Setting up memory for Flink 2 - What to think about...

Comments
4 min read
Pandas vs Polars: Is It Time to Rethink Python’s Trusted DataFrame Library?

Pandas vs Polars: Is It Time to Rethink Python’s Trusted DataFrame Library?

3
Comments 2
3 min read
From API to Dashboard: Tracking Kenya’s Debt with Python,PostgreSQL &Grafana

From API to Dashboard: Tracking Kenya’s Debt with Python,PostgreSQL &Grafana

Comments
3 min read
Architecting your GenAI data pipeline with AWS native services

Architecting your GenAI data pipeline with AWS native services

6
Comments
10 min read
Introduction to Data Analytics Platform with Databricks

Introduction to Data Analytics Platform with Databricks

1
Comments
3 min read
The Death of Traditional ETL: Why AI Agents Are Taking Over Data Pipelines

The Death of Traditional ETL: Why AI Agents Are Taking Over Data Pipelines

1
Comments
4 min read
learnings from optimizing pandas code

learnings from optimizing pandas code

1
Comments
3 min read
Tracking Kenya’s External Debt Using Python, PostgreSQL, and Grafana

Tracking Kenya’s External Debt Using Python, PostgreSQL, and Grafana

3
Comments
3 min read
Trust Over Throttle: Leveraging o3-Pro for Accurate, Impactful AI

Trust Over Throttle: Leveraging o3-Pro for Accurate, Impactful AI

Comments
3 min read
🏗️ Building data pipelines in 2025?

🏗️ Building data pipelines in 2025?

Comments
1 min read
Data Engineering in 30 Days: Day 1

Data Engineering in 30 Days: Day 1

3
Comments 1
3 min read
Simplify Private Data Warehouse Ops: Visualized, Secure, and Fast with BendDeploy on Kubernetes

Simplify Private Data Warehouse Ops: Visualized, Secure, and Fast with BendDeploy on Kubernetes

Comments
11 min read
Announcing Collate 1.7

Announcing Collate 1.7

Comments
4 min read
Scaling Apache SeaTunnel for Enterprise: Billion-Level Data Processing and Intelligent Fault Tolerance in Real-World Use Cases

Scaling Apache SeaTunnel for Enterprise: Billion-Level Data Processing and Intelligent Fault Tolerance in Real-World Use Cases

5
Comments
9 min read
Inside Databases: BSTs, B-Trees, & LSM Trees

Inside Databases: BSTs, B-Trees, & LSM Trees

1
Comments
7 min read
PostgreSQL Maximalism

PostgreSQL Maximalism

Comments
20 min read
Contributor Spotlight: How I Brought Apache SeaTunnel from First PR to Production

Contributor Spotlight: How I Brought Apache SeaTunnel from First PR to Production

Comments
4 min read
Personal Picks: Data Product News (May 14, 2025)

Personal Picks: Data Product News (May 14, 2025)

Comments
3 min read
My First Data Pipeline Project Using Airflow, Docker & Postgres (COVID API Edition)

My First Data Pipeline Project Using Airflow, Docker & Postgres (COVID API Edition)

4
Comments
2 min read
Why java don't have her own Pandas and numPy Libraries??

Why java don't have her own Pandas and numPy Libraries??

Comments
1 min read
Stop Hiring Data Analysts

Stop Hiring Data Analysts

Comments
3 min read
What Is DBT? A No-Fluff Guide for Data Engineers and Analysts

What Is DBT? A No-Fluff Guide for Data Engineers and Analysts

Comments
3 min read
How Excel is Used in Real-World Data Analysis

How Excel is Used in Real-World Data Analysis

2
Comments
3 min read
[Snowflake's New Feature]Cortex AISQL: Multimodal Data Analysis with SQL Commands

[Snowflake's New Feature]Cortex AISQL: Multimodal Data Analysis with SQL Commands

1
Comments
8 min read
DatAasee - A Metadata-Lake

DatAasee - A Metadata-Lake

Comments
1 min read
loading...