Hadoop 1.0.3 mapred.map.tasks property not working

Question

I am using hadoop 1.0.3 to run map reduce jobs. I have a 3 node cluster setup. The problem is that I have set the property mapred.map.tasks to 20 in my /conf/mapred-site.xml, but hadoop is only showing 6 map tasks when I run the job and access the cluster information using web page at :50030. I have edited the above mentioned configuration file on all the nodes in the cluster. Please help.

Regards, Mohsin

How big is in the input data? If the input data is split into n splits, then Hadoop will only n map tasks and not more. — Praveen Sripati
@PraveenSripati I want to set number of parallel map tasks. I can see in my web console that it has 764 map tasks. But running map tasks are 6 only. — sp3tsnaz

Uirri Uirri · Accepted Answer · 2013-01-24T16:05:59

As mentioned by miguno, Hadoop only considers the value of mapred.map.tasks as a hint.

That being said, when I was messing around with MapReduce I was able to increase the map count by specifying a max count. This might not work for you, but you might give it a shot.

<property>
    <name>mapred.tasktracker.map.tasks.maximum</name>
    <value>60</value>
</property>

NOTE: This value represents the TOTAL amount of maps. So if you want each of your (3) nodes to run 20 maps you have to specify mapred.map.tasks, like so:

<property>
    <name>mapred.map.tasks</name>
    <value>20</value>
</property>

Hadoop 1.0.3 mapred.map.tasks property not working

3 Answers