I am running a sqoop job on google cloud to import data from psql database after applying joins on 3 tables. However, sqoop job is getting failed with the below mentioned error, though it imports the data from the table within 15min but gets failed after running for 2hrs. I am able to extract other tables easily.
Data Size: 13GB
Query: gcloud compute ssh $INSTANCE_NAME --project=$PROJECT_ID --service-account=$ACCOUNT --command="""$SQOOP_HOME/bin/sqoop-import -D mapreduce.output.basename='$TABLE_EXPORT' --connect jdbc:$JDBC://$HOST_NAME:$PORT/$DATABASE --username $USERNAME --password '$PASSWORD' --target-dir $BUCKET_STORAGE -m $NUM_WORKERS --split-by $SPLIT_BY --query '$QUERY \$CONDITIONS ' --map-column-java $MAPPING_COLUMNS --fields-terminated-by '|' --null-string '' """ --zone=$ZONE 2>&1
Error:
20/06/12 22:14:36 INFO mapreduce.Job: map 0% reduce 0%
20/06/12 22:14:49 INFO mapreduce.Job: map 50% reduce 0%
20/06/12 22:14:50 INFO mapreduce.Job: map 75% reduce 0%
packet_write_wait: Connection to XX.XX.XXX.XXX port XX: Broken pipe
ERROR: (gcloud.compute.ssh) [/usr/bin/ssh] exited with return code [255].
Command exited with return code 1