I have to evaluate the performance of two classification algorithms. I obtain the False Positive Rate and the True Positive Rate using the roc_curve from sklearn (here documentation). I used the following code:
fpr = dict()
tpr = dict()
roc_auc = dict()
for i in range(2):
fpr[i], tpr[i], _ = roc_curve(true_labels, pred_labels)
roc_auc[i] = auc(fpr[i], tpr[i])
I have this information for two classifiers, and now I want to compare them. I know that it is possible to use AUC or other metrics, but I really want to understand the percentage of times a ROC curve is over the other. For this reason, I would like to compare pairwise points from the two curves.
My first idea was to compare, for each fpr element the corresponding tpr. Unfortunately, the fpr and tpr objects have different size, since they are the output of an interpolation step.
Any idea on how to do so? It seems that I cannot say to the roc_curve function how many (and which) points I want in the output, in order to make the comparison feasible. It is not mandatory to use sklearn.