Understanding Scaling
Scaling is the process of adding or removing resources to match the changing demands of your application. As your application grows, you will need to add more resources to handle the increased load. And as the load decreases, you can remove the extra resources to save costs.
Terraform makes it easy to scale your infrastructure by providing a declarative way to define your resources. You can define the number of resources you need and Terraform will automatically create or destroy the resources as needed.
Pre-requisites
Create a terraform.tf file declaring the providers to be installed in the system according to the resources required.
Create a provider.tf file to declare the region.
Create a vpc.tf file to create VPC for the instance.
Create two public subnets that need to be further initialised to the auto-scaling group as a multi-subnet initialization. Therefore create a subnet.tf terraform file.
Create internetgateway.tf file and declare the configurations.
Create routetable.tf file to define all the configurations for routing.
Create a security group to define and allow SSH, HTTP and egress traffic to the instance. Therefore create a securitygroup.tf file
Task 1: Create an Auto Scaling Group
Auto Scaling Groups are used to automatically add or remove EC2 instances based on the current demand. Follow these steps to create an Auto Scaling Group:
COPY
resource "aws_launch_configuration" "web_server_as" {
image_id = "ami-005f9685cb30f234b"
instance_type = "t2.micro"
security_groups = [aws_security_group.web_server.name]
user_data = <<-EOF
#!/bin/bash
echo "<html><body><h1>You're doing really Great</h1></body></html>" > index.html
nohup python -m SimpleHTTPServer 80 &
EOF
}
resource "aws_autoscaling_group" "web_server_asg" {
name = "web-server-asg"
launch_configuration = aws_launch_configuration.web_server_lc.name
min_size = 1
max_size = 3
desired_capacity = 2
health_check_type = "EC2"
load_balancers = [aws_elb.web_server_lb.name]
vpc_zone_identifier = [aws_subnet.public_subnet_1a.id, aws_subnet.public_subnet_1b.id]
}
Create a ec2_autoscalling.tf file while contains EC2 configuration along with auto-scaling configurations.
Run terraform apply to create the Auto Scaling Group and all of its dependent resources through terraform as mentioned in the pre-requisites.
Let's navigate to AWS management console to check the created auto-scaling group.
In the auto-scaling terraform configuration we set the desired capacity to 2. Therefore there must be two instances created.
Task 2: Test Scaling
Go to the AWS Management Console and select the Auto Scaling Groups service.
Select the Auto Scaling Group you just created and click on the "Edit" button.
Increase the "Desired Capacity" to 3 and click on the "Save" button.
Wait a few minutes for the new instances to be launched.
Go to the EC2 Instances service and verify that the new instances have been launched.
Decrease the "Desired Capacity" to 1 and wait a few minutes for the extra instances to be terminated.
Go to the EC2 Instances service and verify that the extra instances have been terminated.