I understood the general formula:
P(i | x) = (p(i)p(x|i))/(sum(p(j)(p(x|j))
But I cannot successfully apply it to this exercise:
Consider the data sets for two classes X1 = {(0,0)} and X2 = {(1,0), (0,1)}. Which classification probabilities will a naive Bayes classifier produce for the feature vector (0,0)?
I can't understand what p(1) and p((0,0)|1) would be in this case.
(x,y)
:(0,0)
andX2
contains two training examples. I see training data with 2 classes (X1 and X2), 3 instances total, and 2 attributes. I have to agree that the question is using a different syntax than most books, but so what. I would again use a different syntax in an answer. - Has QUIT--Anony-Mousse