The 3 diagramms (i), (ii), (iii) show training sets having 2 numerical attributes (x and y axis) and a target attribute with two classes (circle and square).
I am now wondering how good the data mining algorithms (Nearest Neighbor, Naive Bayes and Decision Tree) solve each of the classification problems.
I suppose that the Naive Bayes (with the naive assumption that the attributes are uncorrelated) solves the second problem better than (i) and (iii) because here the numerical attributes tend to be more independent from each other.