DEV Community

# dataengineering

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Building a Test Data Platform After Watching Teams Secretly Use Production for Years

Building a Test Data Platform After Watching Teams Secretly Use Production for Years

Comments
3 min read
Chinese DBA's Story: Sui Haifeng - Grasp the two most important five-year periods of your career

Chinese DBA's Story: Sui Haifeng - Grasp the two most important five-year periods of your career

Comments
5 min read
Kafka

Kafka

3
Comments
10 min read
A Modern Data Governance Framework for Google Cloud: Implementing Just-Enough and Just-in-Time Access

A Modern Data Governance Framework for Google Cloud: Implementing Just-Enough and Just-in-Time Access

3
Comments
8 min read
Scaling Customer Analytics: Designing ML Pipelines for Millions of Users

Scaling Customer Analytics: Designing ML Pipelines for Millions of Users

Comments
7 min read
Temperature, Tokens, and Context Windows: The Three Pillars of LLM Control

Temperature, Tokens, and Context Windows: The Three Pillars of LLM Control

3
Comments
13 min read
Building Intelligent, Metadata-Driven Pipelines with Azure Data Factory

Building Intelligent, Metadata-Driven Pipelines with Azure Data Factory

2
Comments
6 min read
Why Your Enterprise Data Platform Is No Longer Just for Analytics

Why Your Enterprise Data Platform Is No Longer Just for Analytics

2
Comments 1
11 min read
The State of Apache Iceberg, Polaris, and Arrow: October–November 2025

The State of Apache Iceberg, Polaris, and Arrow: October–November 2025

2
Comments
7 min read
Realtime Data Streaming Platform: Building a Unified Monitoring Stack

Realtime Data Streaming Platform: Building a Unified Monitoring Stack

3
Comments
8 min read
Real-Time Streaming Platform with Pulsar, Flink & ClickHouse

Real-Time Streaming Platform with Pulsar, Flink & ClickHouse

1
Comments
6 min read
Why Parquet Is Everywhere - And What Makes It Actually Fast?

Why Parquet Is Everywhere - And What Makes It Actually Fast?

2
Comments
3 min read
🎓 Building a Smart LMS Assistant: RAG System with Pinecone for Multi-Source Learning Data

🎓 Building a Smart LMS Assistant: RAG System with Pinecone for Multi-Source Learning Data

Comments
3 min read
Interoperating Open Table Formats on AWS Using Apache XTable (Delta Iceberg)

Interoperating Open Table Formats on AWS Using Apache XTable (Delta Iceberg)

4
Comments
4 min read
Big Data Processing (Hadoop, Spark)

Big Data Processing (Hadoop, Spark)

2
Comments
5 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.