Auto Scaling Groups

The goal of an Auto Scaling Group (ASG) is to:
- Scale out (add EC2 instances) to match an increasing load
- Scale in (remove EC2 instances) to mach a decreasing load
- Ensure we have a minimum and a maximum number of machines running
- Automatically register new instances to LB
ASG attributes:
- A launch configuration:
  - AMI + Instance Type
  - EC2 User Data
  - EBS Volumes
  - Security Groups
  - SSH Key Pair
- Min size/max size/initial capacity
- Network + Subnet information
- Load balancer information
- Scaling Policies

Launch Templates vs Launch Configurations

Both allow to specify the AMI, the instance type, a key par, security groups and the other parameters that we use to launch EC2 instances (tags, user-data, etc.)
Launch Configurations are considered to be legacy:
- They must be recreated every time there are updated
Launch Templates:
- They can have multiple versions
- They allow parameter subsets used for partial configuration for re-use and inheritance
- We can provision both On-Demand and Spot instances (or a mix of two)
- Optional: we can set an instance type, a key pair and a security group
- We can use the T2 unlimited burst feature
- Hierarchy: templates can have parents (source templates)
- Recommended by AWS
For ASG we can select between Launch Configurations and Launch Templates

ASG - Types of Scaling

Scheduled scaling:
- Scaling based on a schedule allows us to scale the application ahead of know load changes
Dynamic scaling:
- ASG enables us to follow the demand curve for our application closely, reducing the need to manually provision instances
- ASG can automatically adjust the number of EC2 instances as needed to maintain a target
Predictive scaling:
- ASG uses machine learning to schedule the right number of EC2 instances in anticipation of traffic changes

Scheduled Actions

Can be used if we can anticipate scaling based on known usage patterns
Example: increase the min capacity to 10 at 5 PM on Fridays
Occurrence can be once, every 5, 30, 60 minutes or a cron expression

Scaling Policies

Default cooldown: number of seconds after a scaling activity completes before another can begin (cooldown period). Default value is 300 seconds
Warm up period: number of seconds ASG has to wait until the metric of a new instance can be taken in consideration for further ASG action
Target Tracking Scaling
- Most simple and easy to setup
- Example: we want the average ASG CPU to stay around 40%
Simple/Step Scaling
- Requires the presence of a CloudWatch alarm
- Example:
  - When a CloudWatch alarm is triggered (example average CPU > 70%), then add 2 units
  - When a CloudWatch alarm is triggered (example average CPU < 30%), then remove 1 unit
Scheduled Actions
- Can be used if we can anticipate scaling based on known usage patterns
- Example: increase the min capacity to 10 at 5 PM on Fridays

ALB Integration

Slow start duration: in target group we can set a duration period during which the number of requests will be gradually increased to the new instance

ALB Troubleshooting - Suspend Processes

Reference https://docs.aws.amazon.com/autoscaling/ec2/userguide/as-suspend-resume-processes.html
Administrative suspension: most commonly applies to Auto Scaling groups that have been trying to launch instances for over 24 hours but have not succeeded in launching any instances
Detach an instance:
- The instance will be removed from the ASG
- The ASG will replace the instance with a new one and will register it to the ELB
Standby mode:
- The instance will be removed from the ELB
- The load will be increased on other instances, no new instance will be created
Scale in protection: this can be set one a specific instance. If a scale in action happens, instances protected by scale in protection wont be terminated

ASG Lifecycle Hooks

Reference: https://docs.aws.amazon.com/autoscaling/ec2/userguide/lifecycle-hooks.html
We can add lifecycle hooks to an ASG
These hooks enable ASG to be aware of events during scaling and perform custom actions when events happens
Use cases:
- We can run a script to download and install software when a scale-out event occurs
- When a scale-in event happens, we can send a notification to EventBridge to execute a Lambda in order to download logs from the instance
Transitions between ASG states:

Complete lifecycle action:

  aws autoscaling complete-lifecycle-action --lifecycle-action-result CONTINUE --lifecycle-hook-name LaunchHook --auto-scaling-group-name demo-asg --instance-id i-xxxx -region <region> --profile <profile>

ASG Termination Policies

Reference: https://docs.aws.amazon.com/autoscaling/ec2/userguide/as-instance-termination.html
With termination policies we can control which instances are terminated first in case of a scale-in event
Default termination policy: detects which AZ has the most instances with at least on instance which does not have termination protection. Within an AZ the default termination policy behavior is the following:
1. Determine which instances to terminate first based on allocation strategy in case of mixed instance types (on-demand, spot)
2. Determine whether any of the instances use the oldest launch template/launch configuration
3. If there are multiple instances with the latest launch configuration, terminate the one with is the closes to the next billing hour. If there are multiple of this, terminate on at random
Custom termination policies:
- Default: (presented above)
- AllocationStrategy: terminate instances to align the remaining ones to the allocation strategy
- OldestLaunchTemplate: terminate instances which use an older launch configuration
- ClosestToNextInstanceHour
- NewestInstance
- OldestInstance

ASG Integration with SQS

Reference: https://docs.aws.amazon.com/autoscaling/ec2/userguide/as-using-sqs-queue.html
ApproximateNumberOfMessagesVisible: the number of messages in the queue
We can create a custom metric which takes in consideration the number of messages in the queue, the number of currently running instances in the ASG and the processing time for a message from the queue. We can compute the number of instances required based on the accepted latency, example:
```
  ApproximateNumberOfMessages = 1500
  running capacity = 10
  processing time per message = 0.1 seconds
  acceptable latency = 10 seconds

  1500 / 10 = 150 * 0.1 = 15 - we need 15 instances to process the messages
```
Scale In protection: we should protect our instances from scale in, in case there are processing a message, we would not want the instance to be terminated

ASG ASG CloudFormation Creation Policy

CreationPolicy: we can assign a creation policy in order to notify CloudFormation if the instances from an ASG were created successfully
We can attach a timeout to the creation policy

ASG CloudFormation Update Policy

UpdatePolicy attribute: specified how CloudFormation handles updates:
- AutoScalingReplacingUpdate: specify whether CloudFormation replaces an Auto Scaling group with a new one or replaces only the instances in the Auto Scaling group
  - WillReplace: specifies whether an Auto Scaling group and the instances it contains are replaced during an update. During replacement, CloudFormation retains the old group until it finishes creating the new one. If the update fails, CloudFormation can roll back to the old Auto Scaling group and delete the new Auto Scaling group
- AutoScalingRollingUpdate: rolling updates enable us to specify whether AWS CloudFormation updates instances that are in an Auto Scaling group in batches or all at once
- If both the AutoScalingReplacingUpdate and AutoScalingRollingUpdate policies are specified, setting the WillReplace property to true gives AutoScalingReplacingUpdate precedence
- AutoScalingScheduledAction: prevent scheduled actions from modifying min/max/desired capacity for CloudFormation

ASG CodeDeploy Integration

CodeDeploy deployment to ASG: Whenever a new instance from an ASG is coming up, CodeDeploy will automatically deploy the application to it
Scale-up events during deployment: if a scaling event happens during a deployment, the created instances will have the most recently deployed revision, not the currently deploying revision of the application
Solution for this issue:
- Redeploy the application
- Suspend Launch process during deployment Reference: https://docs.aws.amazon.com/codedeploy/latest/userguide/integrations-aws-auto-scaling.html

ASG Deployment Strategies

In place (one LB, one target group one ASG): instance state is mutate
Rolling (one LB, one TG, one ASG, new instances): new instances are created with the newer version
Replace (one LB, one TG, two ASG, new instances): new ASG is created with a new target group and new instances
Blue/Green (tow LB, two TG, two ASG, new instances, R53)

AWS Solutions Architect - Associate

Auto Scaling Groups

If you are studying for AWS Devops Engineer Professional Exam, this guide will help you with quick revision before the exam. it can use as study notes for your preparation.

Auto Scaling Groups

Launch Templates vs Launch Configurations

ASG - Types of Scaling

Scheduled Actions

Scaling Policies

ALB Integration

ALB Troubleshooting - Suspend Processes

ASG Lifecycle Hooks

ASG Termination Policies

ASG Integration with SQS

ASG ASG CloudFormation Creation Policy

ASG CloudFormation Update Policy

ASG CodeDeploy Integration

ASG Deployment Strategies