Overview

Aptible Apps are scaled at the Service level, meaning each App Service is scaled independently. App Services can be scaled by adding more CPU/RAM (vertical scaling) or by adding more containers (horizontal). App Services can be scaled manually via the CLI or UI, automatically with the Autoscaling, or programmatically with Terraform. Apps with more than two containers are deployed in a high-availability configuration, ensuring redundancy across different zones. When Apps are scaled, a new set of containers will be launched to replace the existing ones for each of your App’s Services.

High-availability Apps

Apps scaled to 2 or more Containers are automatically deployed in a high-availability configuration, with Containers deployed in separate AWS Availability Zones.

Horizontal Scaling

Scale Apps horizontally by adding more Containers to a given Service. Each App Service can scale up to 32 Containers via the Aptible Dashboard. Scaling above 32 containers for a service (with a maximum of 128 containers) is supported via the Aptible CLI and the Aptible Terraform Provider.

Manual Horizontial Scaling

App Services can be manually scaled via the Dashboard or aptible apps:scale CLI command. Example:

  aptible apps:scale SERVICE [--container-count COUNT]

Horizontal Autoscaling

Horizontal Autoscaling is only available on the Production and Enterprise plans.

When Horizontal Autoscaling is enabled on a Service, the autoscaler evaluates Services every 5 minutes and generates scaling adjusted based on CPU usage (as percentage of total cores). Data is analyzed over a 30-minute period, with post-scaling cooldowns of 5 minutes for scaling down and 1 minute for scaling up. After any release, an additional 5-minute cooldown applies. Metrics are evaluated at the 99th percentile aggregated across all of the service containers over the past 30 minutes. This feature can also be configured via Terraform or the aptible services:autoscaling_policy:set CLI command. By default, a Horizontal Autoscaling Operation follows the regular Container Lifecycle and Releases pattern of restarting all current containers when modifying the number of running containers. However, this behavior can be disabled by enabling the Restart Free Scaling (use_horizontal_scale in Terraform) setting when configuring autoscaling for the service. With restart free scaling enabled, containers are added and removed without restarting the existing ones. When removing containers in this configuration, the service’s stop timeout is still respected. Note that if the service has a TCP, ELB, or GRPC endpoint, the regular full restart will still occur even with restart free scaling enabled.

Guide for Configuring Horizontial Autoscaling

Configuration Options

Container & CPU Threshold Settings

Time-Based Settings

General Settings

Vertical Scaling

Scale Apps vertically by changing the size of Containers, i.e., changing their Memory Limits and CPU Limits. The available sizes are determined by the Container Profile.

Manual Vertical Scaling

App Services can be manually scaled via the Dashboard or aptible apps:scale CLI command. Example:

    aptible apps:scale SERVICE [--container-size SIZE_MB]

Vertical Autoscaling

Vertical Autoscaling is only available on the Enterprise plan.

When Vertical Autoscaling is enabled on a Service, the autoscaler also evaluates services every 5 minutes and generates scaling recommendations based:

RSS usage in GB divided by the CPU
RSS usage levels

Data is analyzed over a 30-minute lookback period. Post-scaling cooldowns are 5 minutes for scaling down and 1 minute for scaling up. An additional 5-minute cooldown applies after a service release. Metrics are evaluated at the 99th percentile aggregated across all of the service containers over the past 30 minutes. This feature can also be configured via Terraform or the aptible services:autoscaling_policy:set CLI command.

Configuration Options

RAM & CPU Threshold Settings

Time-Based Settings

FAQ

How do I scale my apps and services?

Getting Started

Core Concepts

Reference

App Scaling

Overview

High-availability Apps

Horizontal Scaling

Manual Horizontial Scaling

Horizontal Autoscaling

Guide for Configuring Horizontial Autoscaling

Configuration Options

Vertical Scaling

Manual Vertical Scaling

Vertical Autoscaling

Configuration Options

FAQ

Getting Started

Core Concepts

Reference

​Overview

​High-availability Apps

​Horizontal Scaling

​Manual Horizontial Scaling

​Horizontal Autoscaling

Guide for Configuring Horizontial Autoscaling

​Configuration Options

​Vertical Scaling

​Manual Vertical Scaling

​Vertical Autoscaling

​Configuration Options

​FAQ

Overview

High-availability Apps

Horizontal Scaling

Manual Horizontial Scaling

Horizontal Autoscaling

Configuration Options

Vertical Scaling

Manual Vertical Scaling

Vertical Autoscaling

Configuration Options

FAQ