datafusion/ERR/2022_001

Firewall rules allow for Data Fusion to communicate to Dataproc VMs.

Product: Cloud Data Fusion
Rule class: ERR - Something that is very likely to be wrong

Description

Cloud Data Fusion runs in a Google managed tenant project. In Data Fusion versions below 6.2.0 and private Data Fusion instances the Google managed resources in the tenant project need to be able to communicate to the Dataproc running pipeline jobs in the client project.

Remediation

  • If your Cloud Data Fusion is private, create the allow-ssh firewall rule with a high priority (e.g --priority=100) to override any conflicting firewall rules that may be blocking communication.

  • If your Cloud Data Fusion is public and has a version below 6.2.0 create the Default allow ssh firewall rule with a high priority (e.g --priority=100) to override any conflicting firewall rules that may be blocking communication.

Further information

  • You can find the firewall version disclaimer in the Creating a Cloud Data Fusion instance Before you begin section.