Back to list
Lv.2

Scalability

Scalability

The ability of a system to flexibly expand its capacity in response to increased demand.

In Simple Terms

Scalability is a system's ability to grow so it keeps running smoothly even when the number of users suddenly spikes. For example, when an online store kicks off a sale and traffic surges, being able to spin up more servers on the fly to handle the load is called 'high scalability.' On the flip side, a system that slows down or crashes the moment traffic increases is said to have low scalability.

Behind the Name

The word 'scalability' combines 'scale' (size or capacity) and 'ability' (the power to do something). It describes how well a system can adjust its own size and capacity as needs grow or shrink — increasing or decreasing its power to match changing workloads.

Take a Closer Look!

Scalability refers to a computer system's or network's ability to improve its performance flexibly as workload increases — in other words, how well a system can expand its capacity to meet growing demand.

There are two main approaches to boosting performance.
The first is 'scale up' (vertical scaling), which means upgrading the power of an existing server. The second is 'scale out' (horizontal scaling), which means adding more servers to share the load.

To put it simply, scaling up is like installing better parts inside a single computer to make it more powerful, while scaling out is like lining up multiple computers and having them split the work.
With cloud services now mainstream, these adjustments can be made quickly and on demand, which is why scalability has become a key consideration in modern system design.

When scalability is low, a sudden surge in users — such as during a viral trend or flash sale — can overwhelm the system and bring the service down.
That's why designing with scalability in mind from the start, with future growth already factored in, is one of the most important factors in running a web service reliably.