I have written a multi-class classifier using logistic regression that is trained using a one vs all approach. I want to plot the learning curve for the trained classifier.
Should the learning curve be plotted on a class by class basis or should it be a single plot for the classifier as a whole? Does it make a difference?
To clarify, the learning curve is a plot of the training & cross validation/test set error/cost vs training set size. This plot should allow you to see if increases the training set size improves performance. More generally, the learning curve allows you to identify whether your algorithm suffers from a bias (under fitting) or variance (over fitting) problem.
Some details regarding my code:
- analyzes MNIST handwritten digit images
- predicts digit (0-9) in image
- based on Andrew Ng's Coursera class on machine learning