Scikit-learn - Stochastic Gradient Descent with custom cost and gradient functions

Question

I am implementing matrix factorization to predict a movie rating by a reviewer. The dataset is taken from MovieLen (http://grouplens.org/datasets/movielens/). This is a well-studied recommendation problem so I just implement this matrix factorization method as for my learning purpose.

I model the cost function as a root-mean-square error between predict rating and actual rating in the training dataset. I use scipy.optimize.minimize function (I use conjugate gradient descent) to factor the movie rating matrix, but this optimization tool is too slow even for only a dataset with 100K items. I plan to scale my algorithms for the dataset with 20 million items.

I have been searching for a Python-based solution for Stochastic Gradient Descent, but the stochastic gradient descent I found on scikit-learn does not allow me to use my custom cost and gradient functions.

I can implement my own stochastic gradient descent but I am checking with you guys if there exists a tool for doing this already.

Basically, I am wondering if there is such as API that is similar to this:

optimize.minimize(my_cost_function,
                  my_input_param,
                  jac=my_gradient_function,
                  ...)

Thanks! Un

There are two things you should look at: (1) Whether or not the matrix libraries are vectorized/parallelized and (2) The convergence of your gradient step size. Plot the cost function versus iteration to see if step size can make it go faster. You might be taking too many small steps to converge to a solution. — duffymo

Andreas Mueller Andreas Mueller · Accepted Answer · 2015-05-06T03:12:00

This is so simple (at least the vanilla method) to implement that I don't think there is a "framework" around it. It is just

my_input_param += alpha * my_gradient_function

Maybe you want to have a look at theano, which will do the differentiation for you, though. Depending on what you want to do, it might be a bit overkill, though.

Scikit-learn - Stochastic Gradient Descent with custom cost and gradient functions

2 Answers