gke/Unallocatable Gpu
Checks GPU allocation
Product: Google Kubernetes Engine
Step Type: AUTOMATED STEP
Description
None
Failure Reason
The node {NODE} was auto-repaired because it had unallocatable GPU(s) for more than 15 minutes.
Failure Remediation
The auto-repair should have fixed the detected unallocatable GPU(s). For more details check: https://cloud.google.com/kubernetes-engine/docs/how-to/node-auto-repair
Success Reason
The node {NODE} was auto-repaired for reasons other than unallocatable GPU(s).