yuer

Posted on Dec 27, 2025

Why Distributed Query Engines Always Accumulate Complexity in the Execution Layer

#architecture #database #rust

If you’ve ever worked on a distributed query engine in production, you’ll eventually notice something uncomfortable:

No matter how clean your SQL parser is, how elegant your optimizer looks, or how solid your storage layer feels,
the real complexity always ends up in the Execution Layer.

Not slowly.
Not theoretically.
But painfully, through refactors, rewrites, and late-night debugging.

This post is a practical summary of lessons learned from building execution layers, working with Rust, fighting Shuffle, and paying for early design shortcuts.

The Execution Layer Isn’t “Just Execution”

Architectural diagrams often split a query engine into neat boxes:

SQL parsing

optimization

plan generation

execution

storage

This decomposition hides an important truth:

Everything before execution is mostly static

The Execution Layer is where reality shows up

Reality means:

skewed data

slow or failing nodes

network jitter

memory pressure that only appears at scale

Every assumption that turns out to be wrong eventually lands in the Execution Layer.

That’s why optimizers tend to age well, while execution layers are constantly reworked.

Shuffle: The Silent System-Wide Cost Multiplier

Shuffle is usually introduced as “just repartitioning data.”

In practice, Shuffle simultaneously stresses:

network bandwidth

memory (often with sharp spikes)

disk I/O (as a fallback)

CPU (hashing, sorting, serialization)

More importantly:

Shuffle amplifies uncertainty.

One slow node becomes a global bottleneck

Slight data skew turns into OOM

Minor network jitter becomes query-wide latency

Many production incidents trace back to Shuffle — even when it wasn’t obvious at first.

Concurrency: Easy to Add, Hard to Control

A common early belief is:

“More parallelism equals better performance.”

Execution layers repeatedly prove this wrong.

Typical mistakes:

async everywhere for CPU-bound operators

mixing async runtimes with manual thread pools

aggressive task spawning without backpressure

The results:

unpredictable scheduling

inflated tail latency

behavior that’s hard to reason about

Once concurrency leaks into operator design, fixing it later usually means rewriting core abstractions.

Rust Prevents Memory Corruption — Not Memory Surprises

Rust is excellent at memory safety.

What it does not guarantee:

predictable memory usage

bounded lifetimes for intermediate data

stable memory peaks under Shuffle or Join

Most execution-layer memory failures are not leaks, but:

buffers retained longer than expected

lifetimes unintentionally extended across stages

memory spikes that only appear under real workloads

These issues are hard to detect early and expensive to fix late.

Why Execution Layer Designs So Often Get Rewritten

Across systems, the same failure patterns keep showing up.

❌ Vague Execution Models

Early designs often treat a query as:

a SQL string

or a loosely defined sequence of steps

Later, teams try to “add” execution plans, operator graphs, and schedulers.

In strongly typed systems (especially Rust), this is rarely salvageable.
Execution semantics must be explicit from day one.

❌ Shared Mutable State

Arc> makes early progress easy.

At scale, it introduces:

lock contention

latency jitter

deadlock risks in async contexts

Execution layers work better when data flows, not when state is shared.

❌ Treating Shuffle as an Optimization Problem

Many teams assume Shuffle can be tuned away:

more partitions

smarter hashing

better caching

In reality, Shuffle has physical lower bounds.
The most effective optimization is often to avoid Shuffle entirely.

❌ Blurred Error Boundaries

Without clear separation between:

task-level failures

stage-level failures

query-level failures

systems become fragile.

Panics and global retries don’t scale in distributed execution.

Hard-Won Engineering Consensus

Teams that survive multiple execution-layer rewrites tend to agree on a few things:

Avoid Shuffle whenever possible

Prefer predictability over peak throughput

Treat execution stability as a first-class concern

Accept that many “optimizations” are really damage control

The Execution Layer isn’t about running operators fast.
It’s about managing uncertainty.

Final Thoughts

If your distributed query engine keeps getting more complex in the Execution Layer — if you’re refactoring, redesigning, and questioning earlier decisions — that’s usually not a failure.

It means the system has left the idealized world and entered reality:

real data

real networks

real machines

Execution-layer complexity is the cost of operating in the real world.

If you’ve built or operated query engines, I’d love to hear how these issues showed up in your system.