Hrishikesh Dalal

Posted on Jan 1

EP 6.4: What is SHARDING?

#database #resources

In the world of system design, there is a common saying: "Don't shard until you absolutely have to." While sharding offers virtually unlimited scaling potential, it introduces a level of operational complexity that can cripple a small engineering team.

This article I explore what sharding is, why it differs from simple partitioning, the strategies for implementing it. So let us get intoo it.

What is Sharding? (We know Partitioning, so why need Sharding)

To understand sharding, you must first understand Horizontal Partitioning. In partitioning, you take a massive table and split it into smaller "chunks" stored on the same server. The database engine (like PostgreSQL) handles the logic of finding which chunk holds your data.

Sharding takes this a step further. Instead of keeping those chunks on one machine, you place them on entirely different database servers, known as Shards.

The Core Difference:

Partitioning -> database manages the complexity.
Sharding -> you (the application developer) manage the complexity.

Partitioning -> Tables
Sharding -> Databases

The Shard Key:)

When you split a table across multiple servers, you need a rule to decide where each row goes. This is determined by the Sharding Key.

If you shard based on user_id:

Server 1 (Shard 1): Stores users with IDs 1–1000.

Server 2 (Shard 2): Stores users with IDs 1001–2000.

The Sharding Key is the most critical decision in this architecture. A poor key leads to "Hotspots", where one server is at 99% CPU while the others are idling at 5%.

3. Sharding Strategies

There is no "one size fits all" way to distribute data. Here are the four primary strategies used in the industry:

A. Range-Based Sharding

Data is divided into continuous ranges of the shard key.

Ex: Shard A (A-M), Shard B (N-Z).
Pros: Very easy to reason about and implement.
Cons: Leads to Data Skew. If you have 1 million users whose names start with 'S' and only 10 users starting with 'X', Shard B will be overloaded while Shard A sits empty.

B. Hash-Based Sharding

A hash function is applied to the shard key (), where is the number of shards.

Pros: Provides a very even distribution of data across all servers.
Cons: Resharding is a nightmare. If you grow from 3 shards to 4, the result of the modulo operation changes for almost every key, requiring you to move nearly all your data to new servers.

C. Geographic/Entity-Based Sharding

Data is grouped by a logical attribute like region or country.

Ex: European users on a Dublin server; Asian users on a Singapore server.
Pros: Reduces latency for local users and helps with data residency laws (GDPR).
Cons: If your app suddenly goes viral in one specific country, that shard becomes a bottleneck.

D. Directory-Based Sharding

A separate "Lookup Service" or "Mapping Table" keeps track of which shard holds which data.

Pros: Maximum flexibility. You can move a single user from Shard 1 to Shard 5 without changing any hashing logic.
Cons: The directory itself becomes a Single Point of Failure and a performance bottleneck. Every query now requires two hits: one to the directory and one to the shard.

4. Why Sharding is Difficult :(

If sharding allows you to scale to billions of users, why do engineers avoid it?

I. Application Complexity

Since the data is on different servers, the database can't help you find it. Your application code must become "Shard Aware." You have to write logic that says: "If the user is trying to log in with ID 505, connect to Database Server 2." This makes your codebase significantly harder to maintain.

II. The "Join" Problem

In a single database, joining two tables is easy. In a sharded environment, if the Users table is on Shard A and the Orders table is on Shard B, you cannot perform a SQL Join. You must pull the data from both servers into your application memory and "join" them manually—an operation that is slow and memory-intensive.

III. Loss of Transactional Consistency

Classic databases offer ACID properties (Atomicity, Consistency, Isolation, Durability). In sharding, a "transaction" that spans two shards is nearly impossible to guarantee. If Shard A updates successfully but Shard B fails, your data is now in a corrupted, inconsistent state.

5. When should you actually Shard?

Before you choose sharding, you should have already exhausted these three steps:

Vertical Scaling: Buy a bigger server with more RAM and CPU.
Read Replicas (Master-Slave): Use the architecture we discussed earlier to offload all "Read" traffic to Slaves.
Caching: Use Redis to stop 80% of requests from even hitting your database.

DEV Community