gke/Unallocatable Gpu

Checks GPU allocation

Product: Google Kubernetes Engine
Step Type: AUTOMATED STEP

Description

None

Failure Reason

The node {NODE} was auto-repaired because it had unallocatable GPU(s) for more than 15 minutes.

Failure Remediation

The auto-repair should have fixed the detected unallocatable GPU(s). For more details check: https://cloud.google.com/kubernetes-engine/docs/how-to/node-auto-repair

Success Reason

The node {NODE} was auto-repaired for reasons other than unallocatable GPU(s).