dataproc/ERR/2023_001

Dataproc cluster initialization completed by the end of the timeout period.

Product: Cloud Dataproc
Rule class: ERR - Something that is very likely to be wrong

Description

Dataproc cluster initialization action executable or script should be completed within the timeout value. If the initialization script has not completed, dataproc cancels the initialization action and fails with “Initialization action timed out” error. The default timeout value is 10 minutes.

Remediation

  • You can check the output logs of the initialization script for more details. You can find the initialization script output file location from the log error message ( log query given below). The output file location pattern will be like ‘gs:// /google-cloud-dataproc-metainfo///dataproc-initialization-script-0_output’.

    Log query example :

    resource.type=“cloud_dataproc_cluster”
    resource.labels.cluster_name=""
    “Initialization action timed out”
    logName: “projects//logs/google.dataproc.agent”

  • You can specify initialization timeout with –initialization-action-timeout flag to change timeout period if required.

Further information

Initialization action timeout