Author: Tianle Yuan

Elastic Load Balancing and Auto Scaling⚓︎

What demand?

How can we make sure that our applications have enough capacity and enough EC2 instances available?
How can we distribute incoming connections to those EC2 instances?

How?

Auto Scaling helps to make sure that you have the right number of EC2 instances to service the demand of your application.
Then, put Elastic Load Balancing in front of your application. ELB will distribute incoming connections across the pool of instances that are managed by the Auto Scaling Group.

All in all, those two technologies above enable you to have elastic and fault-tolerant applications.

Elasticity: Scaling Up v.s. Out⚓︎

Assume that we have an EC2 instance for our application now:

Let's see what is Scaling Up (Vertical Scaling):

Let's see what is Scaling Out (Horizontal Scaling):

Amazon EC2 Auto Scaling⚓︎

Amazon EC2 Auto Scaling is horizontal scaling (scales out), which dynamically launches and terminates instances. CloudWatch metrics and EC2 status checks will process the responsed information from EC2 Auto Scaling. The pipeline to realize the elasticity and scalability can be seen below：