Spark number of executors that job uses

Question

Here is my spark cluster details - Memory - 29.3GB and 10 cores.

Now I run this job,

spark-submit --master spark://hadoop-master:7077 --executor-memory 1g -- executor-cores 2 /home/hduser/ratings-counter.py

but when I click the completed application, I see 5 executors being executed.

How does spark determines to execute 5 executors?

Himaprasoon Himaprasoon · Accepted Answer · 2017-03-02T20:50:29

From spark configuration docs :

spark.executor.cores : The number of cores to use on each executor. In standalone and Mesos coarse-grained modes, setting this parameter allows an application to run multiple executors on the same worker, provided that there are enough cores on that worker. Otherwise, only one executor per application will run on each worker.

As you have 10 cores and has set executor-cores as 2, It spawns 5 executors.

Spark number of executors that job uses

2 Answers