Back to list
Lv.2

Sharding

Sharding

A technique that splits massive datasets into smaller chunks distributed across multiple servers to speed up processing.

In Simple Terms

Sharding is a system that splits overgrown data into smaller chunks and manages them on separate computers. When data concentrates in one place, processing slows down—but by distributing it, multiple tasks can be handled simultaneously. For example, a membership list might be split so that entries A through M are stored on one server and N through Z on another, making searches faster. Even as a service grows and users surge, this approach lets you scale the system to keep up.

Behind the Name

The name comes from the English word "shard," meaning a small fragment or broken piece. Just as a large object can shatter into shards, a massive database is broken into smaller fragments—each managed independently. Because data is divided along horizontal lines (splitting rows across servers), the technique is also commonly known as "horizontal partitioning."

Take a Closer Look!

Sharding is a technique that splits a single massive database into multiple smaller units and distributes them across separate servers.
When data volume becomes enormous, a single server can no longer keep up with the load—but this approach distributes the burden and maintains overall performance.

Specifically, data is divided horizontally by rows and stored in different locations.
For example, records 1 through 100 go to Server A and records 101 through 200 go to Server B—data is assigned according to a defined routing rule.
With this setup, even under heavy traffic, multiple servers can share the workload in parallel, reducing wait times.

That said, because data is scattered across different locations, aggregating the full dataset requires additional complexity.
You also need a mechanism that always tracks which data lives on which server.
Sharding plays a critical role in keeping services running smoothly—think social media platforms or online games that handle simultaneous access from users around the world.