Back to list
Lv.1

Auto Scaling

Auto Scaling

A mechanism that automatically scales the number of servers up or down based on traffic volume.

In Simple Terms

Auto scaling is a system that automatically adds or removes servers when web traffic increases or decreases. For example, even if a site gets a sudden rush of visitors only during lunch hour, the computer adjusts its capacity on its own. When traffic is low, servers are scaled down to save resources; when things get busy, they scale back up.

Behind the Name

"Auto Scaling" — "Auto" means automatic, and "Scaling" means adjusting size or capacity. The word "scale" refers to size or proportion, and the name comes from the idea of stretching or shrinking a system's capacity to match demand.

Take a Closer Look!

Auto scaling is a feature in cloud services and similar platforms that automatically adjusts server capacity based on the amount of incoming traffic.
When traffic spikes and servers are at risk of being overwhelmed, new servers are added automatically to prevent the system from going down.

Conversely, when traffic drops and resources sit idle, those extra servers are automatically shut down to bring things back to baseline.
This means websites and apps can stay fast and responsive around the clock — without anyone having to monitor screens 24/7.

In the past, you had to anticipate traffic spikes in advance and provision large servers ahead of time.
The downside was that during off-peak hours, all that computing power sat unused — leading to wasted costs.

To put it simply, think of it like a store that automatically opens more checkout lanes when it gets busy and closes them again when things quiet down.
The ability to cut unnecessary costs while adapting flexibly to sudden traffic changes is one of the key advantages when running a system.