Auto Scaling

Automated, low-cost computing resource management policy with planning capability, efficiency and high fault tolerance


Auto Scaling (AS) provides you a highly efficient management policy for computing resources. You can set the time to execute management policy regularly or create a real-time monitoring policy to manage the number of CVM instances and deploy the environment for the instances, to ensure that your business runs smoothly. AS automatically increases the number of CVM instances during demand surges to maintain good performance and decreases the number during lulls to reduce costs.



AS automatically creates and removes CVM instances in a real time and dynamic way based on the business load, ensuring that you are running the optimal number of instances without manual intervention.

Cost Saving

AS helps you maintain an optimal number of instances for variable business demands. When the demand rises, AS will automatically add new CVM instances rapidly, and when the demand drops, AS will automatically remove unnecessary instances accordingly. This improves device utilization and reduces the costs of deployment and instances.


AS allows you to set a schedule to plan the scaling activities to deal with regular changes in business load (e.g. scaling up at 21:00 every day).

Fault Tolerance

AS automatically checks instances' health. Once AS detects a faulty instance, it will automatically create a healthy instance to replace the faulty one. This ensures that your application is getting the computing capacity as you expect so that your business can run normally and smoothly.

Easy to Audit

When using auto scaling, the user can record the trigger conditions, time, involved instances and success/failure reasons related to each scaling event. Visualized tracking, querying interfaces and SMS notifications are provided to help you locate root causes as soon as possible for prompt solution.


Tencent Cloud AS can create a scaling policy based on your business needs to manage the CVM computing resources automatically for high efficiency, low costs and timely fault tolerance.
Alarm-triggered Scaling

If you want to adjust the business deployment based on CVM metrics, you can customize an alarm policy. When a metric (e.g. CPU utilization, memory utilization, outbound and inbound bandwidth on the private network, outbound and inbound bandwidth on the public network) reaches the threshold due to business load, the policy can automatically increase or decrease the number of CVM instances to deal with the changes in business load flexibly, improving device utilization and reducing costs of deployment and instances. The monitoring cycle is 1 minute.


Logic-Layer CVM Scaling for Flexible Web Service

Business type

E-commerce websites, video websites, online education, etc.


The requests from the clients reach the application CVMs through CLB. When the visits change rapidly, AS can flexibly scale up or down the instances based on the amount of requests.

High Performance Computing Cluster Deployment

Business type

Backend computing clusters such as computing nodes of distributed big data and data indexing servers.


The number of CVMs within the cluster is scaled up or down in real time according to the amount of computation.

Requesting CVM Deployment

Business type

CVM clusters that send requests or collect data


As this kind of business have higher requirement for timeliness, AS can be used to create and remove requesting CVMs promptly.


Auto Scaling (AS) is free of charge. However, other related products such as pay-as-you-go Cloud Virtual Machine (CVM) fees may apply.