Training Naive Bayes Classifier

Question

I am developing a naive bayes classifier using simple bag of words concept. My question is in naive bayes or in any other machine learning senario 'training' the classifier is an important matter. But how to train naive bayes classifier when I already have a bag_of_words of various classes.

@TimBiegeleisen I have read the turorial. But a question still remains. Suppose I have two classes positive and negative. Now in my training data set of postive class I have a no. of positive strings and in negative class I also have a no. of negative strings. But in positive strings not all the words are postive. There the problem arises. When I try to take words from them and put them in postive bag_of_words then some negative words are also being added which hampers the later classification. — Pritam
@Pritam is the positive or negative slant of the words depending of the context? If so you need to add the context as features in your X vector for each sample (word). Otherwise, how would the classifier be able to distinguish? — miraculixx

miraculixx miraculixx · Accepted Answer · 2015-04-18T14:40:52

how to train naive bayes classifier when I already have a bag_of_words of various classes.

In general, what you do is this:

split your bag of words into two random subsets, call one training the other test
train the classifier on the training subset
validate the classifier's accuracy by running it against the test subset

'training' the classifier is an important matter

indeed -- that's how your classifier learns to separate words from different classes.

Training Naive Bayes Classifier

2 Answers