dataproc/WARN/2024_002

HDFS can write file(s) to DataNode(s).

Product: Cloud Dataproc
Rule class: WARN - Something that is possibly wrong

Description

HDFS had issues writing file(s) to DataNode(s). The HDFS NameNode logs show the following message:

java.io.IOException: File [...] could only be [written/replicated] to x nodes instead of minReplication (=n). There are y datanode(s) running and z node(s) are excluded in this operation.

This can be caused by different reasons:

  • The HDFS filesystem has no available space.

  • The DataNodes may be unavailable or there is no communication between NameNode and Datanode(s).

  • The reserved space for non-DFS use is not sufficient (dfs.datanode.du.reserved).

Remediation

  • Check the HDFS remaining capacity: dfs/FSNamesystem/CapacityRemainingGB.

    • If low, free up some HDFS space or add primary worker nodes to the cluster.
  • Check the status of NameNode and DataNodes services and the related logs.

    • Address the issue based on the findings, e.g. network related issues related to the communication between NameNode and Datanodes.