DEV Community: Phạm Hồng Phúc

React Rendering Pipeline

Phạm Hồng Phúc — Wed, 17 Jun 2026 06:42:25 +0000

Overview

This article will analyze the rendering pipeline of React from version 16 onward, when React Fiber was introduced, focused on React 18 Concurrent Mode. Instead of describing the old lifecycle model (mounting → updating → unmounting) tied to Class components, this article will follow the new model: Render phase, Commit Phase, and Effect Phase (illustrated in below image).

When React 16 introduced Fiber in 2017, the library was fully rewritten to support an interruptible rendering model, which was impossible in previous stack-based architecture. Before React Fiber, reconciliation was synchronous and could not be interrupted midway. This causes “jank”, missing frames when the component's tree grows too large. React Fiber solved the problem by breaking down the work into small units (fiber nodes), and allowing React to pause, resume or cancel the work in progress.

React 18 continually expanded the capacity with Concurrent Mode, a rendering mode that allows multiple UI versions to exist at the same time, and React actively prioritizes tasks based on their importance.

Background

Virtual DOM and Fiber tree

Virtual DOM refers to the React Element tree (component tree), plain JavaScript objects produced by JSX. When you write , React creates {type: Button, props: {color: "blue"},...}. These objects are cheap to create compared to interacting with the real DOM.

The Fiber tree is React’s internal representation, a separate, richer data structure that wraps the Virtual DOM and adds everything React needs to manage work over time. Each component in the tree maps to a fiber node, a JavaScript object that stores:

The component type and current props/state
A linked list of hooks attached to this component
A list effects that need to run (DOM mutations, layout effects. Passive effects)
Pointers to parent, first child, and next sibling fibers
Work-in-progress flags and priority metadata

The relationship between virtual DOM and fiber node is illustrated in below diagram

JSX
↓
React.createElement()
↓
React Element (Virtual DOM)
↓

React Fiber (internal work unit) wraps the element, adds metadata

Double buffering: always two trees

React maintains two fiber trees simultaneously

Current: the fiber tree currently rendered on screen
Work-in-progress: the fiber tree being built during the current render

This pattern is called double buffering, borrowed from graphics rendering. During the Render Phase, React builds the work-in-progress tree by diffing it against current. After the Commit Phase completes, the trees swap, work-in-progress becomes the new current, and the old current is recycled for the next render.

BEFORE COMMIT:
current → [what's on screen]
work-in-progress → [what React computed]

AFTER COMMIT:
current → [formerly work-in-progress, now on screen]
work-in-progress → [recycled, ready for next render]

This swap is what makes the Commit Phase atomic, the user always sees either the old tree or the new tree, never a mix.

Work loop and time slicing

React Fiber uses two separate loops:

Work loop (Render Phase), interruptible. React processes one fiber node at a time and checks frequently whether the current deadline has passed. If it has, React yields control back to the browser via the MessageChannel API and schedules resumption as a macrotask.
Commit loop (Commit Phase), non-interruptible. Runs synchronously to completion.

Scheduler and Lane Model

React 18 uses a lane modal internally, a system where each update is assigned to one or more lanes. This allows React to batch updates with the same lane and process them together, while separating updates with different lanes to handle them independently.

Lane/Priority	Timeout	Typical trigger	Note
Immediate/Sync	Synchronous	Error, emergency	Block all other work
UserBlocking	250ms	Click, keyboard input	Highest interactive priority
Normal	500ms	Data fetch, state update	Default for most updates
Transition/Low	10000ms	useTransition, useDeferredValue	Can render off-screen in parallel with higher-priority work
Idle	unlimited	Prefetch, background prepare data	Only runs when nothing else is queued

Key point: startTranstion does not just lower priority, it also signals that the update is safe to discard and restart if a higher-priority update arrives. This has direct implications idempotency.

Render Phase

The Render Phase is where React calls component functions, builds the work-in-progress fiber tree, and runs the Reconciliation algorithm to compare it against the current tree. The output is an effect list, a linked list of fiber nodes that require action in the Commit Phase. No DOM mutation occurs here.

Purity and Idempotency

Render Phase must be pure: given the same props and state, a component function must return the same React Element tree. The rule exists because the Render Phase can execute multiple times before the final result is applied (React may discard a work-in-progress tree and restart from scratch). In Concurrent Mode, when React discards and restarts a render:

Update queues on fiber nodes are preserved: pending state updates are not lost
Local variables inside the component function are lost: they belong to the discarded execution context
Side effects triggered during render cannot be cancelled: network requests continue running in the background and may return stale or unrelated data.

Note: In the development, React StrictMode intentionally calls component functions twice to detect purity violations.

Reconciliation and Diffing algorithm

Reconciliation is the process by which React compares the old (current) Fiber tree with the new Fiber tree being built (work-in-progress). React uses two main heuristics to reduce complexity from O(n³) to O(n):

Element type assumption: If the type of a node changes (e.g., from to ), React destroys the entire old subtree and rebuilds from scratch.
The role of keys: In a list, the key allows React to exactly identify which element has been moved, added, or removed without comparing the entire list.

Note: Common key error:

Don’t use index as key in a list can be re-order (items.map((item, i) => ))
Instead, use stable identifier (items.map(item => ))

After diffing, React annotates each fiber node that requires action with a tag

Placement: this node needs to be inserted into the DOM
Update: this node’s attributes or text need to change
Deletion: this node needs to be removed

These annotated nodes are linked together into the effect list, which is the direct input to the Commit Phase. The Render Phase produces this list; the Commit Phase consumes it.

How do hooks work inside the Render Phase?

Hooks are stored as a linked list on the fiber's memoizedState field. This is documented directly in ReactFiberHooks.js. Every time a hook is called during the first render (mount), React runs mountWorkInProgressHook(), which creates a new node and appends it to the list:

function mountWorkInProgressHook(): Hook {
  const hook = {
    memoizedState: null,  // stores the hook's value
    baseState: null,
    queue: null,          // update queue (useState/useReducer)
    next: null,           // pointer to the next hook node
  };

  if (workInProgressHook === null) {
    // first hook — becomes the head of the list
    currentlyRenderingFiber.memoizedState = workInProgressHook = hook;
  } else {
    // subsequent hooks — appended to the tail
    workInProgressHook = workInProgressHook.next = hook;
  }
  return workInProgressHook;
}

On every subsequent render (update), React switches to updateWorkInProgressHook(), which walks this list from the head — advancing one node per hook call, in strict call order. There is no name lookup, no key, no identifier — just sequential pointer traversal. This is the mechanical reason hooks cannot be called inside conditionals or loops: if a hook call is skipped, every node from that point onward is read by the wrong hook. Node 3's memoizedState gets read as if it belongs to Node 2's useEffect, and so on — silently producing wrong values with no error until the node count itself mismatches.

useEffect works differently from other hooks in one important way, it participates in two separate phases at two different points in time:

During the Render Phase, mountWorkInProgressHook() creates Node 2 and stores the dependency array in memoizedState. On re-renders, React reads Node 2, compares the new deps against the stored ones using Object.is, and, if anything changed, marks the fiber with a PassiveEffect flag. The effect function is not touched here. React is only deciding whether it needs to run later.
During the Effect Phase, after the browser has painted, React finds every fiber carrying the PassiveEffect flag, runs the cleanup function stored from the previous render, then runs the new effect function. This is the only point where () => { fetch(...) } actually executes.

The split exists because the Render Phase must remain pure and interruptible, calling an effect function there would violate both properties.

More detail, behavior of each hook during the Render Phase is illustrated in below table

Hook	What happens during Render Phase
useState	Returns the current value from the linked list node. The setter does not change state immediately; it enqueues an update into the fiber's update queue. The Scheduler decides when to re-render.
useReducer	Similar to useState but with a reducer function.
useEffect	Compares the dependency array via Object.is. If changed, marks the fiber with PassiveEffect. The effect function is not called here—it is scheduled to run in the Effect Phase after paint.
useLayoutEffect	Same as useEffect. If changed, marks the fiber with HookLayout. The effect function is not called here; it runs in the Commit Phase Layout sub-phase, before paint.
useMemo	Compares the dependency array using Object.is shallow equality. If any dependency has changed, recomputes the value and stores it in the node. Otherwise, returns the cached value. If dependencies are objects or arrays recreated every render, the memo is always invalidated.
useCallback	Similar to useMemo; memoizes the function reference.
useRef	Returns the same object `{ current: ... }` on every render. Mutating `.current` does not enqueue a re-render and is invisible to React's reconciliation.
useContext	Subscribes the component to a context. When the context value changes, React schedules a re-render of this component regardless of React.memo.

Batching and Concurrent Mode

In React 18, calling multiple setters within the same synchronous event handler, setTimeout, Promise.then, or native event listener results in a single re-render, all updates are batched. React collects them all into the fiber's update queue, then processes them in one pass during the next Render Phase. This is automatic batching, expanded from React 17 which only batched inside React event handlers.

With Concurrent Mode, the Render Phase is interruptible. React processes fibers one at a time in the work loop and checks after each unit of work whether a higher-priority update has arrived. If it has, React:

(1) discards the current work-in-progress tree
(2) processes the higher-priority update
(3) restarts the lower-priority render from scratch

Commit Phase

After the Render Phase completes and the effect list is finalized, React enters the Commit Phase, the phase where it actually interacts with the real DOM. Unlike the Render Phase, the Commit Phase cannot be interrupted and runs completely synchronously. The reason is that if interrupted midway, the user will see an inconsistent interface, part of it updated, part still old.

Three stages of Commit Phase

The Commit Phase is divided into three sequential steps, each step traversing the entire Fiber tree in order bottom-up (children first, parents second)

(1) Before Mutation: react reads DOM state before any changes are made. Why is this necessary? Some DOM properties, scroll position, text selection state, change unpredictably once the DOM is mutated. Reading them here, before Mutation, gives components a reliable snapshot. The snapshot is then passed to componentDidUpdate or stored in a ref for use in useLayoutEffect.
(2) Mutation: react applies the effect list to the real DOM. This is the only step where react actually interacts with the real DOM. It does not re-render the entire DOM tree, it applies the minimum set of changes computed by reconciliation. After the Mutation sub-phase completes, React swaps the two fiber trees: work-in-progress becomes the new current. For each tagged fiber node:
- Placement → parentNode.appendChild(node) or parentNode.insertBefore(node, anchor)
- Update → updates specific attributes, className, style properties, or text content
- Deletion → parentNode.removeChild(node), runs componentWillUnmount / cleanup for useLayoutEffect
(3) Layout: React runs useLayoutEffect (and componentDidMount / componentDidUpdate for Class Components) in this sub-phase. The DOM reflects the new state, but the browser has not yet painted.

After step 3, React relinquishes control of the main flow to the browser. The browser then actually paints, redraws the pixels onto the screen. This is the boundary between the Commit Phase and the Effect Phase.

The architecture helps React prevent Layout Thrashing, which occurs when code alternates between reading and writing to the DOM within the same frame. Based on the above architecture, all writing tasks are done in Mutation step; reading tasks are executed via useLayoutEffect, after writing.

useLayoutEffect

useLayoutEffect runs at the step 3, after React has updated the DOM but before the browser paints. This is the only time the DOM can be read and written synchronously without flickering, as the user hasn't seen any changes yet.

useLayoutEffect common use cases

Measuring element size/position: getBoundingClientRect(), offsetHeight, scrollWidth…
Setting focus: ref.current.focus() immediately after an element appears in the DOM
Synchronizing external animation libraries
Calculating tooltip/popover position based on the actual DOM size

When useLayoutEffect calls setState, React flushes synchronously, implementing the new Render Phase and the new Commit Phase immediately, before handing control to the browser. Users only see the final result, not the intermediate state. This is the core difference compared to useEffect.

Warning about useLayoutEffect

Because useLayoutEffect blocks browser paint, it can cause stuttering if heavy calculations are performed.
useLayoutEffect should not be used for operations that do not need to synchronize with DOM paint.
Server-Side Rendering (SSR): useLayoutEffect does not run on the server, use useEffect or check the typeof window !== "undefined".

Effect Phase

The Effect Phase is the final stage, occurring after the browser has finished painting. Effects are executed asynchronously and do not block the main thread, ensuring the UI remains responsive to the user.

useEffect

React schedules useEffect via the MessageChannel API (not a microtask like Promise, and doesn’t like setTimeout). This ensures the effect runs after the browser paints, but still much earlier than setTimeout.

Microtask (Promise.then): runs before the browser has a chance to paint. If effects ran as microtasks, they would execute before the user sees the new UI — blocking paint.
setTimeout(fn, 0): runs after paint, but browsers throttle it (minimum ~4ms; more when the tab is in the background). Chained setTimeout calls accumulate significant delay.
MessageChannel: runs after paint, not throttled, classified as a macrotask. React uses it to run effects as early as possible after paint without blocking it.

This is why useEffect "feels fast" despite being asynchronous, it runs in the first available macrotask after the browser paints, typically within a single frame. The useEffect render lifecycle is:

React runs cleanup function (return callback) of effect from previous render
React runs the new effect function

Both cleanup function and effect function run bottom-up (Children → Parent → App). React ensures that the child component's cleanup runs before the parent component's cleanup. This helps the parent component remain "alive" while the child component is cleaning up, preventing the child from needing to access the parent's resources but the parent has already cleaned up.

Question: Why doesn’t useEffect run in Render Phase?

Effects usually interact with external services (API, WebSocket, browser APIs), these are not idempotent. If useEffect ran during render and React cancelled that render midway, the network request would still be in-flight. React cannot cancel it. The app might receive a response from a zombie request and update state with stale or unrelated data.

Suspense and the Pipeline

Suspense is worth understanding as a concrete example of "multiple UI versions in memory at the same time" — the core promise of Concurrent Mode.
When a component suspends (throws a Promise during render), React does not commit that subtree. Instead:

React renders the nearest Suspense boundary's fallback in place of the suspended subtree
The suspended work-in-progress tree is kept in memory — not discarded
When the Promise resolves, React retries rendering the suspended subtree from the beginning
If the retry succeeds, React replaces the fallback with the real content in a single atomic commit

This is why component functions used inside Suspense must be pure and idempotent — React will call them more than once before committing, and the results must be consistent.

Conclusion

The three-phase model, Render → Commit → Effect, is not just a description of what the Class Component lifecycle does. It's a new way of thinking that accurately reflects the internal architecture of React Fiber and forms the foundation for understanding advanced features like Concurrent Mode, Suspense, and Server Components. Three core principles to remember:

The Render Phase must be pure: No side effects, no interaction with the DOM or external services.
The Commit Phase is the boundary of the DOM: All operations with the actual DOM occur here, synchronously and uninterruptible.
The Effect Phase is where side effects occur: Asynchronous, after the UI has rendered, with a clear cleanup.

Understanding these three phases allows developers to make the right decisions about where to place logic, choose the right hooks, and design components compatible with Concurrent Mode — an increasingly important requirement as React continues to evolve towards interruptible, prioritized rendering.

Reference

Redis's Event-Driven Architecture and the ae Event Loop

Phạm Hồng Phúc — Fri, 12 Jun 2026 08:38:01 +0000

One of the most common questions about Redis is: "Redis is single-threaded, so how can it handle thousands of concurrent connections?". But the more interesting question is: “Why would we need thousands of threads to handle thousands of connections in the first place?”
The answer lies in understanding the difference between doing work and waiting for work. A connection spends most of its lifetime waiting for data to arrive from the network. Waiting is not computation. If a server creates one thread for every connection, many of those threads spend most of their time blocked on I/O operations. Although blocked threads consume little CPU time, they still require memory for their stacks and introduce scheduling overhead.
Redis avoids this problem by using an event-driven architecture built on I/O multiplexing. Instead of dedicating one thread to each connection, a single thread asks the operating system: “Which connections are actually ready for work right now?”. The thread then processes only those connections.

Blocking I/O - The traditional model

The simplest server implementation uses blocking I/O:

while (1) {
    int fd = accept(server_fd, ...);   // wait for new connection
    handle_client(fd);                  // read, process, reply
    close(fd);                          // only then accept the next client
}

This design has an obvious limitation. While handle_client() is waiting for a client to send data, the entire server is blocked. No other connections can be accepted or processed. A traditional solution is to create one thread per connection:

while (1) {
    int fd = accept(server_fd, ...);
    pthread_create(&tid, NULL, handle_client, (void*)fd);
}

This model can work well at small scales. However, with thousands of concurrent connections, the overhead becomes significant. On many Linux systems, each thread reserves several megabytes of stack space by default. In addition, the OS must continually schedule and switch between threads, causing context-switch overhead.

Non-blocking I/O

Instead of allowing read() to block until data becomes available, a file descriptor can be configured non-blocking mode:

fcntl(fd, F_SETFL, O_NONBLOCK); 
ssize_t n = read(fd, buf, sizeof(buf)); 
if (n == -1 && errno == EAGAIN) { 
    // No data available yet 
}

The problem now becomes determining when to try again. Continuously checking every connection would waste CPU resources. This approach, known as busy waiting, keeps the CPU fully occupied even when no useful work is being performed. What is needed is a mechanism that allows the operating system to notify the application only when a file descriptor becomes ready.

I/O multiplexing

I/O multiplexing enables a single thread to monitor many file descriptors simultaneously.

select(): The first generation

fd_set read_fds;
FD_ZERO(&read_fds);
FD_SET(fd1, &read_fds);
FD_SET(fd2, &read_fds);
FD_SET(fd3, &read_fds);

// Block until at least one fd is ready
select(max_fd + 1, &read_fds, NULL, NULL, NULL);

// Then scan everything to find which ones are ready
for (int i = 0; i <= max_fd; i++) {
    if (FD_ISSET(i, &read_fds)) {
        read(i, buf, ...);
    }
}

select() has two major limitations

It is typically limited to FD_SETSIZE file descriptors (often 1024).
Each invocation requires scanning the entire set of descriptors, resulting in O(n) complexity.

pool() removes the fixed descriptor limit, but it still requires scanning all registered descriptors after each call.
epoll (in Linux) or kqueue (on maxos/bsd with an equivalent design) solves both problems by inverting the design: instead of handing the kernel a list to check on every call, you register once, and the kernel only notifies you about fds that actually have events.

int epfd = epoll_create1(0); 
struct epoll_event ev; 
ev.events = EPOLLIN; 
ev.data.fd = client_fd; 
epoll_ctl(epfd, EPOLL_CTL_ADD, client_fd, &ev); 
struct epoll_event events[MAX_EVENTS]; 
while (1) { 
    int n = epoll_wait(epfd, events, MAX_EVENTS, -1); 
    for (int i = 0; i < n; i++) { 
        handle(events[i].data.fd); 
    } 
}

The key advantage is that epoll_wait() returns only the file descriptors that are ready. If 10,000 connections are registered but only three receive data, Redis processes only those three connections instead of scanning all 10,000.

The ae event library

Redis does not use libraries such as libevent or libuv. Instead, Redis implements its own lightweight event library called ae (A simple Event Library, https://github.com/redis/redis/blob/unstable/src/ae.c). The central data structure is aeEventLoop:

typedef struct aeEventLoop { 
    int maxfd; 
    int setsize; 
    aeFileEvent *events; 
    aeFiredEvent *fired; 
    aeTimeEvent *timeEventHead; 
    aeApiState *apidata; 
} aeEventLoop;

Conceptually, the event loop consists of three parts:

aeEventLoop
├── File events
│ ├── acceptTcpHandler
│ ├── readQueryFromClient
│ └── sendReplyToClient
│
├── Time events
│ └── serverCron
│
└── Backend API
└── epoll / kqueue / select

File events handle socket activity. Time events execute periodic tasks that Redis must perform regardless of network activity. The backend API abstracts platform-specific multiplexing mechanisms.

The main event loop

Redis spends most of its lifetime executing the following loop:

void aeMain(aeEventLoop *eventLoop) { 
    eventLoop->stop = 0; 
    while (!eventLoop->stop) { 
        aeProcessEvents(eventLoop, 
        AE_ALL_EVENTS | 
        AE_CALL_BEFORE_SLEEP | 
        AE_CALL_AFTER_SLEEP); 
    } 
}

Conceptually, each iteration follows this sequence:

aeMain()
↓
aeProcessEvents()
↓
epoll_wait() / kqueue()
↓
Process file events
↓
Process time events
↓
Repeat forever

This design allows Redis to react efficiently to both incoming network requests and scheduled maintenance tasks.

File events and time events

Redis supports two categories of events.

File events

File events are triggered by socket activity. Examples include: acceptTcpHandler, readQueryFromClient, sendReplyToClient. These handlers manage client connections and network communication.

Time events

Time events execute periodically. The most important example is serverCron(). By default, Redis executes serverCron() approximately every 100 milliseconds (determined by the hz configuration parameter). serverCron() performs tasks such as:

Running the active expiration cycle
Collecting statistics
Managing client timeouts
Maintaining replication state,
Performing persistence-related housekeeping,
Executing cluster maintenance tasks.

Without time events, Redis would respond only to network activity and could not perform background maintenance.

Full lifecycle of a request inside ae event loop

To make it concrete, trace a SET key value from start to finish:

Step 1 (server start): During initialization, Redis registers the listening socket: server socket → acceptTcpHandler. Whenever a new connection arrives, the event loop invokes acceptTcpHandler().
Step 2 (client connects): When epoll_wait() reports that the server socket is ready: accept() → new client fd → aeCreateFileEvent(..., readQueryFromClient). Redis registers a readable file event for the client socket.
Step 3 (receive the command): The client sends 3\r\n$3\r\nSET\r\n.... The event loop detects that the client socket is readable and invokes: readQueryFromClient(). The command is copied into: client→querybuf.
Step 4 (parse and execute): Redis parses the RESP protocol, identifies the SET command, locates the appropriate command implementation, and executes it. The reply is generated in memory and appended to the client's output buffer. No I/O happens at this step, it is purely a memory operation.
Step 5 (send the reply): In the same loop iteration (after processing all read events), Redis calls the write handler to write() the reply buffer to the socket. If the reply buffer is large and cannot be flushed in one call, Redis registers a write event handler to continue flushing on the next iteration.

Why Single-Threaded Execution Works Well

A common misconception is that more threads always improve performance. Additional threads are beneficial primarily when the workload is limited by available CPU resources. Many Redis operations, such as GET and SET, perform relatively little computation: lookup key → retrieve value from memory → generate response. For these workloads, using multiple execution threads can increase overhead due to:

synchronization mechanisms protecting shared data,
context switching performed by the operating system,
cache coherence traffic between CPU cores.

By executing commands in a single thread, Redis avoids these costs entirely. This design simplifies the implementation and enables very high throughput for typical in-memory workloads.

The limits of this model

Single-threading has one clear weakness: one blocking command blocks the entire server.

KEYS * on a database with 10 million keys → O(n) scan → every other client waits for the entire duration. This is why KEYS is banned in production and replaced by SCAN (cursor-based, scanning a small portion per call).

Similarly: LRANGE mylist 0 -1 on a list with a million elements, SORT without LIMIT, SMEMBERS on a huge set — all commands that can stall the event loop.

Redis 6.0 partially addressed this with threaded I/O: still single-threaded for command execution, but uses multiple threads for reading and writing sockets. The reason: at high connection rates, read() and write() syscalls start consuming a meaningful share of time relative to command execution. Threaded I/O lets Redis exploit multiple cores without breaking the data model.

Redis 7.0 goes further with Redis Cluster sharding, distributing both data and load across multiple processes — each process still single-threaded — scaling out rather than up.

What actually happens when a Redis client connects?

Phạm Hồng Phúc — Thu, 11 Jun 2026 13:53:24 +0000

Nowadays, almost all Redis deployments use TCP as the primary connection protocol. Redis also supports Unix Domain Socket (UDS) when the client and server run on the same machine. UDS bypasses the TCP/IP stack and network interface entirely, typically reducing latency by 30–40% compared to TCP localhost — though the exact gain depends on workload. Because UDS requires co-location, TCP remains the universal default.

Why Redis uses TCP?

TCP fits Redis for the following reasons

Persistent, stateful connections: Redis processes one command at a time per client, in strict order: the client sends a command, waits for the reply, then sends the next. This request-response model requires a persistent, stateful connection, which TCP provides. Each TCP connection is uniquely identified by a 4-tuple (src_ip, src_port, dst_ip, dst_port), letting Redis maintain per-client state: current database index, transaction state (MULTI/EXEC), subscription lists, and so on. UDP is connectionless and stateless; a server would have to re-identify the client on every single datagram, pushing all that state management into application code.
In-order delivery: Redis uses RESP (Redis Serialization Protocol), which is a stream-based protocol. Commands arrive as a continuous byte stream, and Redis parses them sequentially. If bytes arrived out of order, the parser would break. TCP guarantees the stream is always ordered and complete.
Reliability: TCP automatically retransmits lost packets. Redis does not need to write any retry logic itself; the OS handles it transparently.
Flow control: TCP's sliding window prevents a fast client from overwhelming Redis's read buffer. Every TCP packet Redis sends back to the client carries an rwnd (receive window) value, a number the OS stamps automatically, representing free space remaining in the buffer. The client treats this as a hard cap on how much data it can have in flight. As Redis reads and drains its buffer, rwnd grows, and the client can send more. If Redis falls behind, rwnd shrinks toward zero, and the client throttles itself automatically. Redis never writes a line of code for this — the kernel manages it entirely.
Pipelining: Because TCP preserves byte order across the entire stream, clients can send many commands in one batch without waiting for individual replies. The server reads commands back-to-back from the stream and sends replies in the same order. This is pipelining, and it would be impossible without a reliable, ordered byte stream.

The above diagram describes three phases in Redis server when a Redis client connects for the first time

Accept phase
- The OS and Redis work asynchronously. When a client calls connect(), the OS kernel handles the entire TCP handshake (SYN → SYN-ACK → ACK) on its own, Redis is not involved. Once the handshake completes, the connection is pushed into the accept queue, sitting there until Redis is ready to pick it up.
- Meanwhile, Redis may be busy executing a command for another client. When it finishes, the event loop calls epoll_wait(). If a connection is waiting in the queue, epoll reports it, and only then does Redis call accept(). If Redis is already idle, it calls accept() almost immediately.
Register phase: accept() returns a new file descriptor, an integer that uniquely identifies the connection (for example, fd = 7). Redis registers this fd with epoll (Linux) or kqueue (macOS/BSD). From this point, the OS automatically notifies Redis when data arrives on that fd, so Redis never has to poll in a loop. (To be more understandable, you should read about AE Event Loop).
Allocate phase: Redis allocates a client struct in memory for the connection. This includes a 16 KB read buffer to hold incoming command bytes streaming in over TCP, and a write buffer to hold responses waiting to be sent back. The buffer exists because TCP can split a single command like SET key value across multiple small segments — Redis collects the bytes until it has a complete command before parsing.

The cost of a connection

Opening a connection requires:

A file descriptor (an integer in the kernel)
A client struct in Redis memory, including the 16 KB read buffer and write buffer — roughly ~20 KB of RAM per connection in total
A slot in epoll's interest list
- epoll is a Linux kernel mechanism for monitoring multiple file descriptors simultaneously. Redis uses it to know when a client sends data — without constantly polling ("anything yet? anything yet?").
- epoll maintains an interest list inside the kernel — essentially a table of file descriptors that Redis has registered and wants to be notified about. Each entry in that table is a slot, containing: the file descriptor to watch (e.g. fd = 7), the event type to listen for (e.g. EPOLLIN — data is ready to read), and a pointer back to the corresponding client struct in Redis memory.
- When Redis calls epoll_ctl(ADD, fd) during the Register phase, it is essentially telling the kernel: "add this fd to the interest list, and notify me when it has data."
- Each slot occupies a small amount of kernel memory (a few dozen bytes). More importantly, epoll has a limit on how many fds it can watch simultaneously — a limit typically bounded by ulimit -n at the OS level. So every new connection doesn't just cost RAM on the Redis side; it also consumes a finite slot in the kernel's interest list. In systems with thousands of clients, microservices, workers, cron jobs, if every service opens its own dedicated connection, it is easy to hit Redis's maxclients limit (default: 10000) or the OS-level ulimit -n. The solution is connection pooling.

Connection Pooling

A connection pool is a group of TCP connections that are pre-created and reused. Instead of an application running connect() → use → close() on every request, the pool keeps connections alive and lends them out as needed. Two key configuration values to understand:

minIdle — the minimum number of connections kept ready at all times. This reduces latency spikes when traffic ramps up suddenly, since connections are already warm.
maxTotal — the upper limit on total connections in the pool. This prevents a single application from exhausting Redis's connection slots. Notice: connection pooling works best for long-running processes. In serverless or ephemeral environments where instances spin up and down frequently, persistent pooled connections can actually cause more churn, you may need a different strategy such as a sidecar proxy.

Pipelining

Normally, each command waits for the previous reply before the next command is sent. If the round-trip time (RTT) between client and server is 1 ms, 100 sequential commands take 100 ms of network wait time, even though each command executes in under 1 µs on the server.

Pipelining solves this by sending multiple commands in a single write, without waiting for each response. The client batches commands at the application layer; TCP delivers them in order; Redis reads and executes them sequentially and sends back all replies in one go. The result: 100 commands might complete in just over 1 ms instead of 100 ms.

One common misconception: pipelining is not a transaction. Commands are executed in order, but if command A fails, command B still executes. There is no atomicity. If you need all-or-nothing semantics, use MULTI/EXEC or a Lua script instead.

You can read the full details in the official Redis pipelining documentation.