I have difficulties to understand how to measure precision and recall for multi class clustering. Here is an example with 9 elements:
considering the following ground truth:
A,B,C,D
E,F,G
H,I
and the following observed clustering:
A,B,C
D
E,F,G,H,I
how should I calculate the number of true positives (TP), false positives (FP) and false negatives (FN) ?
my naive approach has been to consider all pairs of elements:
TP = 7 (A-B, A-C, B-C, E-F, E-G, F-G, H-I)
FP = 6 (E-H, E-I, F-H, F-I, G-H, G-I)
FN = 3 (A-D, B-D, C-D)
Is it the right way of doing it ?
Thanks