create ROC from 10 different thresholds

Question

I have output from svmlight which has x=predictions (0.1,-0.6,1.2, -0.7...), y=actual class {+1,-1}. I want to create an ROC curve for 10 specific different thresholds (let t be a vector that contains 10 different threshold values). I checked ROCR package but I didn't see any option for supplying threshold vector. I need to calculate TPR and FPR for each threshold value and plot. Is there any other way to do that ? I am new to R programming.

I am also at a loss for how to set the thresholds in pred. I tried a naive approach that, not surprisingly, didn't work: pred<-prediction(x,y,alpha.values=c(0.0,0.05,0.1,0.15,0.2,0.25,0.3)) I have two prediction systems, but one produces consistently different numbers, so I need to force ROCR to apply the same thresholds to both prediction systems. Has anyone done this? — Chris Anderson

patr1ckm patr1ckm · Accepted Answer · 2013-08-10T19:05:28

ROCR creates an ROC curve by plotting the TPR and FPR for many different thresholds. This can be done with just one set of predictions and labels because if an observation is classified as positive for one threshold, it will also be classified as positive at a lower threshold. I found this paper to be helpful in explaining ROC curves in more detail.

You can create the plot as follows in ROCR where x is the vector of predictions, and y is the vector of class labels:

pred <- prediction(x,y) 
perf <- performance(pred,"tpr","fpr")
plot(perf)

If you want to access the TPR and FPR associated with all the thresholds, you can examine the performance object 'perf':

str(perf)

The following answer shows how to obtain the threshold values in more detail:

https://stackoverflow.com/a/16347508/786220

create ROC from 10 different thresholds

2 Answers