DEV Community

Cover image for Understanding Node.js Cluster: The Core Concepts
Leapcell
Leapcell

Posted on

3 1 1 1 1

Understanding Node.js Cluster: The Core Concepts

Cover

Preface

If you've used PM2 to manage Node.js processes, you may have noticed it supports a cluster mode. This mode allows Node.js to create multiple processes. When you set the number of instances in cluster mode to max, PM2 will automatically create a number of Node processes corresponding to the CPU cores available on the server.

PM2 achieves this by leveraging Node.js’s Cluster module. The module addresses Node.js's single-threaded nature, which traditionally limits its ability to utilize multiple CPU cores. But how does the Cluster module work internally? How do the processes communicate with each other? How can multiple processes listen on the same port? And how does Node.js distribute requests to these processes? If you’re curious about these questions, read on.

Core Principles

Node.js worker processes are created using the child_process.fork() method. This means there is one parent process and multiple child processes. The code typically looks like this:

const cluster = require('cluster');
const os = require('os');

if (cluster.isMaster) {
  for (let i = 0, n = os.cpus().length; i < n; i++) {
    cluster.fork();
  }
} else {
  // Start the application
}
Enter fullscreen mode Exit fullscreen mode

If you’ve studied operating systems, you’re probably familiar with the fork() system call. The calling process is the parent, while the newly created processes are the children. These child processes share the same data segment and stack as the parent, but their physical memory spaces are not necessarily shared. In a Node.js Cluster, the master process listens on the port and distributes incoming requests to the worker processes. This involves addressing three core topics: inter-process communication (IPC), load balancing strategies, and multi-process port listening.

Inter-Process Communication (IPC)

The master process creates child processes using process.fork(). Communication between these processes is handled via an IPC channel. Operating systems provide several mechanisms for inter-process communication, such as:

  1. Shared Memory Multiple processes share a single memory space, often managed with semaphores for synchronization and mutual exclusion.
  2. Message Passing

    Processes exchange data by sending and receiving messages.

  3. Semaphores

    A semaphore is a system-assigned status value. Processes lacking control will be forced to halt at specific checkpoints, waiting for a signal to proceed. When limited to binary values (0 or 1), this mechanism is known as a "mutex" (mutual exclusion lock).

  4. Pipes

    Pipes connect two processes, allowing the output of one process to serve as the input for another. This can be created using the pipe system call. The | command in shell scripting is a common example of this mechanism.

Node.js uses an event-based mechanism for communication between the parent and child processes. Here’s an example of a parent process sending a TCP server handle to a child process:

const subprocess = require('child_process').fork('subprocess.js');

// Create a server and send its handle.
const server = require('net').createServer();
server.on('connection', (socket) => {
  socket.end('Handled by the parent process');
});
server.listen(1337, () => {
  subprocess.send('server', server);
});
process.on('message', (m, server) => {
  if (m === 'server') {
    server.on('connection', (socket) => {
      socket.end('Handled by the child process');
    });
  }
});
Enter fullscreen mode Exit fullscreen mode

Load Balancing Strategy

As mentioned earlier, all requests are distributed by the master process. Ensuring the server load is evenly distributed among worker processes requires a load balancing strategy. Node.js uses a round-robin algorithm by default.

Round-Robin

The round-robin method is a common load balancing algorithm also employed by Nginx. It works by distributing incoming requests to each process sequentially, starting from the first process and looping back after reaching the last. However, this method assumes equal processing capacity across all processes. In scenarios where request handling time varies significantly, load imbalance may occur.

To address this, Nginx often uses Weighted Round-Robin (WRR), where servers are assigned different weights. The server with the highest weight is selected until its weight is reduced to zero, at which point the cycle starts over based on the new weight sequence.

You can adjust the load balancing strategy in Node.js by setting the NODE_CLUSTER_SCHED_POLICY environment variable or configuring it via cluster.setupMaster(options). Combining Nginx for multi-machine clusters with Node.js Cluster for single-machine multi-process balancing is a common approach.

Multi-Process Port Listening

In early versions of Node.js, multiple processes listening on the same port competed for incoming connections, leading to uneven load distribution. This was later resolved with the round-robin strategy. The current approach works as follows:

  1. The master process creates a socket, binds it to an address, and starts listening.
  2. The socket’s file descriptor (fd) is not passed to the worker processes.
  3. When the master process accepts a new connection, it determines which worker process should handle the connection and forwards it accordingly.

In essence, the master process listens on the port and distributes connections to worker processes using a defined strategy (e.g., round-robin). This design eliminates competition between workers but requires the master process to be highly stable.

Conclusion

Using PM2’s Cluster Mode as an entry point, this article explored the core principles behind Node.js’s Cluster module for implementing multi-process applications. We focused on three key aspects: inter-process communication, load balancing, and multi-process port listening.

By studying the Cluster module, we can see that many fundamental principles and algorithms are universal. For instance, the round-robin algorithm is used in both operating system process scheduling and server load balancing. The master-worker architecture resembles the multi-process design in Nginx. Similarly, mechanisms like semaphores and pipes are ubiquitous in various programming paradigms.

While new technologies continuously emerge, their foundations remain consistent. Understanding these core concepts enables us to extrapolate and adapt to new challenges with confidence.


We are Leapcell, your top choice for deploying Node.js projects to the cloud.

Leapcell

Leapcell is the Next-Gen Serverless Platform for Web Hosting, Async Tasks, and Redis:

Multi-Language Support

  • Develop with Node.js, Python, Go, or Rust.

Deploy unlimited projects for free

  • pay only for usage — no requests, no charges.

Unbeatable Cost Efficiency

  • Pay-as-you-go with no idle charges.
  • Example: $25 supports 6.94M requests at a 60ms average response time.

Streamlined Developer Experience

  • Intuitive UI for effortless setup.
  • Fully automated CI/CD pipelines and GitOps integration.
  • Real-time metrics and logging for actionable insights.

Effortless Scalability and High Performance

  • Auto-scaling to handle high concurrency with ease.
  • Zero operational overhead — just focus on building.

Explore more in the Documentation!

Try Leapcell

Follow us on X: @LeapcellHQ


Read on our blog

Sentry blog image

How I fixed 20 seconds of lag for every user in just 20 minutes.

Our AI agent was running 10-20 seconds slower than it should, impacting both our own developers and our early adopters. See how I used Sentry Profiling to fix it in record time.

Read more

Top comments (1)

Collapse
 
devtostd profile image
Dev Studio

The Most Contextual AI Development Assistant

Pieces.app image

Our centralized storage agent works on-device, unifying various developer tools to proactively capture and enrich useful materials, streamline collaboration, and solve complex problems through a contextual understanding of your unique workflow.

👥 Ideal for solo developers, teams, and cross-company projects

Learn more

👋 Kindness is contagious

Dive into an ocean of knowledge with this thought-provoking post, revered deeply within the supportive DEV Community. Developers of all levels are welcome to join and enhance our collective intelligence.

Saying a simple "thank you" can brighten someone's day. Share your gratitude in the comments below!

On DEV, sharing ideas eases our path and fortifies our community connections. Found this helpful? Sending a quick thanks to the author can be profoundly valued.

Okay