As you start adapting more and more to your cloud journey and understanding the capability of Auto Scaling, dynamic scaling is what enables you to respond to the dynamic changes in the load that is being placed on your application. In the diagram below, we have four EC2 instances running and there is a metric threshold defined for CPU utilization to Scale-Out the fleet based on the average load on the four EC2 instances. For example, you might want to add another EC2 instance to the fleet if average CPU utilization goes beyond 80%, or remove an instance when average CPU utilization is below 50%. The fleet size will vary depending on the load on your application and you will be charged only for the duration when you add additional instances.
