I have a set of trainFeatures
and a set of testFeatures
with positive, neutral and negative labels:
trainFeats = negFeats + posFeats + neutralFeats
testFeats = negFeats + posFeats + neutralFeats
For example, one entry inside the trainFeats
is
(['blue', 'yellow', 'green'], 'POSITIVE')
the same for the list of test features, so I specify the labels for each set. My question is how can I use the scikit implementation of Random Forest classifier and SVM to get the accuracy of this classifier altogether with precision and recall scores for each class? The problem is that I am currently using words as features, while from what I read these classifiers require numbers. Is there a way I can achieve my purpose without changing functionality? Many thanks!