朴素贝叶斯分类工作过程:
1,设D是训练元组和相关联的类标号的集合。
2,假定有m个类C1,C2,C3,...Cm。给定元组X,分类法将预测X属于具有最高后验概率(条件X下)的类,即,当P(Ci|X)>P(Cj|X),朴素贝叶斯分类法预测X属于类Cj
贝叶斯定理:P(Ci|X)=P(X|Ci)P(Ci)/P(X)
3,问题转换为根据P(X|Ci)P(Ci)/P(X)的大小判断类别,先求P(Ci)的先验概率
4,假定类条件独立,P(X|Ci)=P(x1|Ci)*P(x2|Ci).....*P(xn|Ci),比较结果确定属于哪个类别。
训练集:
<30 high no fair no
<30 high no excellent no
30-40 high no fair yes
>40 medium no fair yes
>40 low yes fair yes
>40 low yes excellent no
30-40 low yes excellent yes
<30 medium no fair no
<30 low yes fair yes
>40 medium yes fair yes
<30 medium yes excellent yes
30-40 medium yes excellent yes
30-40 high yes fair yes
>40 medium no excellent no
测试集:
<30 medium yes fair
>40 high no excellent