2
votes

I followed the implementation of Apache-flink via: quick_start

I am not able to perform the last task i.e. 'Analyze the Result' because there is no result file inside the kmeans folder.

enter image description here

If you look into the above screenshot of flink JobManager, there you can see Status as FAILED for KMeans Example. And may be due to this failed status there is no result file inside the kmeans folder.

Now on clicking the KMeans Example, I get the following visualization:

enter image description here

And below is the screenshot of exceptions:

enter image description here

Could you please guide me what am I doing wrong.

1

1 Answers

4
votes

The problem is that the cluster has been started with a single TaskManager which has only a single slot and that you want to execute the KMeans job at the same time with a parallelism of 4.

In order to run the job with parallelism of 4, you have to increase the number of TaskManager of your cluster or the number of slots on each TaskManager. The latter can be set in the Flink configuration flink-conf.yaml with taskmanager.numberOfTaskSlots: 4. For the former, you can modify the conf/slaves file to add new machines for the additional TaskManager.

Alternatively, you can decrease the parallelism of your job to 1. You can control the parallelism with the command line option -p. E.g. bin/flink run -p 1 -c JobClass job.tar.