I am trying to run a spark scala application on Amazon EMR cluster using Amazon Data-Pipeline. The step was added as follows in EMRActivity:
command-runner.jar,spark-submit,--deploy-mode,cluster,--class,com.demo.GettingStarted,s3://myBucket/sampleApps/HelloWorld.jar
After looking into the EMR logs, the job is failing consistently with following stack trace:
Exception in thread "main" org.apache.spark.SparkException: Application
application_1517065923932_0001 finished with failed status
at org.apache.spark.deploy.yarn.Client.run(Client.scala:1034)
at org.apache.spark.deploy.yarn.Client$.main(Client.scala:1081)
at org.apache.spark.deploy.yarn.Client.main(Client.scala)
What may be the possbile cause of this error?
This is a sample app that prints Hello World to console.
The same job works on AWS EMR.