I have a small hadoop/hive cluster (6 nodes in total). Using "hadoop dfsadmin -report" I see that are datanodes are working well and connected. Additionally when I add data in a hive table I can see that the data are being distributed all over the node. (Easy to check, as the disk space used increases).
I am trying to create some indexes on one table. From the jobtracker http interface, I see only one node available. I tried to run multiple queries ( I use mysql for the metadata) but they appear to run only on the node that hive is installed.
Basically My question is how to make the jobtracker to utilize the other nodes as well.