DEV Community

# dataengineering

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
50 Scala Interview Questions for Spark Developers with Answers

50 Scala Interview Questions for Spark Developers with Answers

Comments
10 min read
How Linux is Used in Real-World Data Engineering

How Linux is Used in Real-World Data Engineering

Comments
2 min read
Quantify Your Life: Building a High-Performance Health Data Lake with InfluxDB, Grafana, and Python 🚀

Quantify Your Life: Building a High-Performance Health Data Lake with InfluxDB, Grafana, and Python 🚀

Comments
4 min read
Building a Flight Data Pipeline Without Trusting AI

Building a Flight Data Pipeline Without Trusting AI

Comments
5 min read
Autonomous Systems and Decision Intelligence

Autonomous Systems and Decision Intelligence

1
Comments 1
2 min read
TOON File Format Anatomy: Schema-Once, Data-Many for LLM Pipelines 🎯📄

TOON File Format Anatomy: Schema-Once, Data-Many for LLM Pipelines 🎯📄

Comments
6 min read
What Is Apache Polaris? Why Open Data Catalogs Matter and How to Use Them with AWS

What Is Apache Polaris? Why Open Data Catalogs Matter and How to Use Them with AWS

7
Comments
11 min read
Build Your Own AI Medical Brain: Transforming PDF Health Reports into a Graph-RAG Powerhouse with Neo4j and LangChain

Build Your Own AI Medical Brain: Transforming PDF Health Reports into a Graph-RAG Powerhouse with Neo4j and LangChain

1
Comments
4 min read
10 Years of Blood Reports into One Graph: Building a Personal Medical Knowledge Base with Unstructured.io, Neo4j, and LlamaIndex

10 Years of Blood Reports into One Graph: Building a Personal Medical Knowledge Base with Unstructured.io, Neo4j, and LlamaIndex

1
Comments
3 min read
How Linux is Used in Real-World Data Engineering

How Linux is Used in Real-World Data Engineering

3
Comments
4 min read
PeachBot Medical KG: A Framework for Structured Clinical Knowledge Engineering

PeachBot Medical KG: A Framework for Structured Clinical Knowledge Engineering

Comments
3 min read
Does ClickHouse Support UPDATEs? A 2026 Data Analysis

Does ClickHouse Support UPDATEs? A 2026 Data Analysis

6
Comments 1
25 min read
Direct Dive into the Data: A Beginner's Guide to Getting Started

Direct Dive into the Data: A Beginner's Guide to Getting Started

1
Comments
5 min read
The Backyard Quarry, Part 7: Systems Beyond the Backyard

The Backyard Quarry, Part 7: Systems Beyond the Backyard

2
Comments
4 min read
Apache Airflow for Beginners: DAGs, Tasks, Operators, and Scheduling Explained

Apache Airflow for Beginners: DAGs, Tasks, Operators, and Scheduling Explained

2
Comments
9 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.