Given the Y as the output such as classified labels, and the X as the input feature vector consisted of a series of properties, the Bayesian classifier proposes a rule that a map from the
X
to
Y=[y1,y2,⋯,yk],X=[x1,x2,⋯,xn]
which yk represents a label and xi denotes a property of input data.
Bayesian theorem is defined as following:
P(Y∣X)=P(X∣Y)∗P(Y)P(X) =P(X∣Y)∗P(Y)∑i=ki=0P(X∣Y=yi)∗P(yi)
Naive Bayesian theorem that assumes the elements of
X
are conditional independence is defined as following:
Combined the two formulas above, the Naive Bayesian Classifier can be applied to classify task by using the big data.