Get precison model through GridSearchCV for recall optimization

Question

Given a machine learning model RBF SVC called 'm', I performed a gridSearchCV on gamma value, to optimize recall. I'm looking to answer to this: "The grid search should find the model that best optimizes for recall. How much better is the recall of this model than the precision?"

So I did the gridSearchCV:

grid_values = {'gamma': [0.001, 0.01, 0.05, 0.1, 1, 10, 100]}
grid_m_re = GridSearchCV(m, param_grid = grid_values, scoring = 'recall')
grid_m_re.fit(X_train, y_train)
y_decision_fn_scores_re = grid_m_re.decision_function(X_test) 

print('Grid best parameter (max. recall): ', grid_m_re.best_params_)
print('Grid best score (recall): ', grid_m_re.best_score_)

This tell me the best model is for gamma=0.001 and it has a recall score of 1.

I'm wondering how to get the precision for this model to get the trade of of this model, cause the GridSearchCV only has attribute to get what it was optimize for.([Doc sklearn.GridSearchCV][1])

In newer version of scikit, you can supply multiple scoring metrics. But then the best_params_ and best_score_ will work only if you set the refit param to the scorer with respect to which you want the best params. — Vivek Kumar

BrauHaus BrauHaus · Accepted Answer · 2019-03-06T21:48:44

Not sure if there's an easier/more direct way to get this, but this approach also allows you to capture the 'best' model to play around with later:

First do you CV fit on training data:

grid_m_re = GridSearchCV(m, param_grid = grid_values, scoring = 'recall')
grid_m_re.fit(X_train, y_train)

Once you're done, you can pull out the 'best' model (as determined by your scoring criteria during CV), and then use it however you want:

m_best = grid_m_re.best_estimator_

and in your specific case:

from sklearn.metrics import precision_score

y_pred = m_best.predict(X_test)
precision_score(y_test, y_pred)

Get precison model through GridSearchCV for recall optimization

2 Answers