gke/Cluster Autoscaler

Analyses logs in the project where the cluster is running.

Product: Google Kubernetes Engine Kind: Debugging Tree

Description

If there are log entries that contain messages listed in the public documentation https://cloud.google.com/kubernetes-engine/docs/how-to/cluster-autoscaler-visibility#messages then provide details on how each particular issue can be solved.

The following ScaleUP logs messages are covered:

  • scale.up.error.out.of.resources
  • scale.up.error.quota.exceeded
  • scale.up.error.waiting.for.instances.timeout
  • scale.up.error.ip.space.exhausted
  • scale.up.error.service.account.deleted

The following ScaleDown logs messages are covered:

  • scale.down.error.failed.to.evict.pods
  • no.scale.down.node.node.group.min.size.reached

Executing this runbook

gcpdiag runbook gke/cluster-autoscaler \
  -p project_id=value \
  -p name=value \
  -p location=value

Parameters

Name Required Default Type Help
project_id True None str The ID of the project hosting the GKE Cluster
name False None str (Optional) The name of the GKE cluster, to limit search only for this cluster
location False None str The zone or region of the GKE cluster

Get help on available commands

gcpdiag runbook --help

Potential Steps