I have a rather limited data set upon which I am performing supervised-learning, multi-class text classification using scikit-learn. To alleviate the shortage of information slightly, I wanted to do the following:
Extract ngrams from the content I want to classify, merge it with the unigrams of the content and perform classification
Implement (or use an existing implementation of) a vote-based ensemble classifier to improve classification accuracy. For example, both Multinomial Bayes and KNN seem to give good results for different classes: ideally I would combine these such that I get slightly better (and hopefully not worse) performance rather than the shoddy ~50% I am able to get using my limited data set.
While the first step is trivial, I cannot find much on how I would be able to do ensemble classification using scikit-learn. I've noted that that scikit-learn has some entries on ensemble classes such as this one, but it doesn't seem to be quite what I'm looking for.
Does anyone know of a concrete example of doing this using scikit-learn?