gce/Vm Termination

GCE VM shutdowns and reboots Root Cause Analysis (RCA)

Product: Compute Engine Kind: Debugging Tree

Description

This runbook is designed to assist you in investigating and understanding the underlying reasons behind the unexpected termination or reboot of your GCE Virtual Machines (VMs) within Google Cloud Platform.

Key Investigation Areas:

System Event-Triggered Shutdowns/Reboots: Identifies terminations initiated by internal Google Cloud systems due to system maintenance events, normal hardware failures, resource constraints.

System Admin Activities-Triggered Shutdowns/Reboots: Investigates terminations caused by direct actions, such as API calls made by users or service accounts. These actions may include manual shutdowns, restarts, or automated processes impacting VM states.

RCA Text Generation: Provides a detailed Root Cause Analysis text, outlining the identified cause of termination, the involved systems or activities, and recommendations to prevent future occurrences where applicable.

Executing this runbook

gcpdiag runbook gce/vm-termination \
  -p project_id=value \
  -p name=value \
  -p id=value \
  -p zone=value \
  -p start_time_utc=value \
  -p end_time_utc=value

Parameters

Name Required Default Type Help
project_id True None str The Project ID associated with the terminated VM.For investigations covering multiple VMs, provide only the Project ID.
name True None str The name of the terminated VM. Or provide the id i.e -p id=
id False None int The instance ID of the terminated VM. Or provide name instead i.e -p name=
zone True None str The Google Cloud zone where the terminated VM is located.
start_time_utc False None datetime The start window to investigate vm termination. Format: YYYY-MM-DDTHH:MM:SSZ
end_time_utc False None datetime The end window for the investigation. Format: YYYY-MM-DDTHH:MM:SSZ

Get help on available commands

gcpdiag runbook --help

Potential Steps