Auto Scaling

An automated, low-cost, highly efficient and highly fault-tolerant computing resource management policy


Auto Scaling (AS) provides you with a highly efficient management policy for computing resources. You can set the time to execute the management policies regularly or create a real-time monitoring policy to manage the number of CVM instances and deploy the environment for the instances, ensuring that your business runs smoothly. AS automatically increases the number of CVM instances during periods of peak demand to maintain optimal instance performance and decreases the number of CVM instances during periods of low demand to reduce costs.



AS automatically creates and removes CVM instances in real time based on the business load, ensuring that you are running the optimal number of instances and eliminating the need for manual deployment.


AS helps you maintain an optimal number of instances for variable business demand. When the demand increases, AS will automatically and quickly add new CVM instances. Conversely, when the demand decreases, AS will automatically remove unnecessary instances, improving device utilization and reducing deployment and instance costs.


AS allows you to schedule scaling to respond to regular changes in business load (e.g. scaling up at 9 p.m. every day).


AS automatically checks the health of your instances. Once AS detects a faulty instance, it will automatically create a healthy instance to replace the faulty one. This ensures that your application obtains the expected computing capacity so that your business can run normally and smoothly.

Easy to Audit

When using AS, you can record the trigger conditions, time, involved instances and the reasons for success/failure related to each scaling event. AS offers visualized tracking, interface querying and SMS notifications to help you promptly locate and solve the underlying problems.


Tencent Cloud AS can create a scaling policy based on your business needs to automatically manage your CVM computing resources for high efficiency, low costs and prompt fault tolerance.
Alarm-triggered Scaling

You can customize an alarm policy to adjust the business deployment based on the CVM metrics. When a metric (e.g. CPU utilization, memory utilization, outbound and inbound bandwidth on the private network, outbound and inbound bandwidth on the public network) reaches the threshold due to the business load, the policy can automatically increase or decrease the number of CVM instances to flexibly deal with changes in business load, improving device utilization and reducing deployment and instance costs. The monitoring cycle is 1 minute.


Logic-layer CVM Scaling for Flexible Web Services

Business type

E-commerce websites, video websites, online education, etc.


The requests from the clients reach the application CVMs through CLB. When the access volume changes rapidly, AS can flexibly scale up or down the instances based on the amount of requests.

High Performance Computing Cluster Deployment

Business type

Backend computing clusters such as the computing nodes of distributed big data and data indexing servers.


The number of CVMs within the cluster is scaled up or down in real time according to the amount of computation.

Request CVM Deployment

Business type

CVM clusters that send requests or collect data.


As this type of business has high requirements for timeliness, you can use AS to rapidly create and remove request CVMs.


Auto Scaling (AS) is free of charge. However, while using AS, fees for other related products such as pay-as-you-go Cloud Virtual Machine (CVM) may be incurred.