I am trying to use R on AWS to connect to our cluster running Cloudera hadoop. Following the steps mentioned here - http://blog.cloudera.com/blog/2013/12/how-to-do-statistical-analysis-with-impala-and-r/
So far, I could initiate the jdbc driver but not able to connect to impala.
From some investigation, I can see that the impala daemon is running in all our worker nodes. And the ports are configured like this.
Also, I logged in to one of the worker node and checked the ports which are listening. I can see port 21050 listening, Here it is,
Here in rimpala connect, I am using public IP of the worker node. Still not able to connect to that. I can use the public IP and port 25000 to see impala web UI, but cannot connect to this port listening jdbc requests. Can anyone help me in this?


