What You Need to Know About Kubernetes Autoscaling

by Vivek Singh

This page contains highlights I saved while reading What You Need to Know About Kubernetes Autoscaling by Vivek Singh. These quotes were collected using Readwise.

Highlights

Cluster Autoscaler

CA adjusts the size of a Kubernetes cluster automatically by adding or removing compute nodes based on the demand for resources by the workloads running on the cluster. Its goal is to keep a sufficient amount of resources running and to keep idle resources to a minimum.

Permalink to this highlight

Vertical Pod Autoscaler

VPA automatically adjusts container resource requests and limits within a pod based on the containers' actual resource utilization.

Permalink to this highlight

Because HPA relies on metrics such as CPU and memory usage to determine when to scale pods, there may be a delay between the time demand increases and the time additional pods are up to meet it. This delay can potentially slow down response times, temporarily reducing performance for end users.

Permalink to this highlight

Horizontal Pod Autoscaler

HPA adjusts the number of replicas of a deployment or replica set based on pod resource utilization. HorizontalPodAutoscaler automatically changes the number of replicas in response to changes in traffic or demand.

Permalink to this highlight

If your application experiences sudden spikes in traffic, HPA may be the better choice. If it has stable traffic but varying resource requirements, you should probably go with VPA.

Permalink to this highlight

VPA can also increase overhead and complexity in a Kubernetes cluster, since it continuously monitors and adjusts pod resource allocation based on current utilization.

Permalink to this highlight

Want more like this? See all articles or get a random quote.