机器学习笔记之Naive Bayes

最新推荐文章于 2022-04-02 17:09:58 发布

YukAgame

最新推荐文章于 2022-04-02 17:09:58 发布

阅读量131

点赞数

分类专栏：机器学习学习笔记

本文链接：https://blog.csdn.net/weixin_38686737/article/details/107958188

版权

7 篇文章 2 订阅

订阅专栏

A family of classifiers that are quite similar to linear models, but they train faster. Price for its efficiency is that these models often provide worse generalization performances.
The reason that naïve Bayes models are so efficient is that they
learn parameters by looking at each feature individually and collect
simply per-class statistics from each feature.
There are three kinds of naive Bayers classifiers implemented in
scikit-learn: GaussianNB, BernoulliNB, MultinomialNB.
GaussianNB can be applied to any continuous data, while BernoulliNB assumes binary data and MultinomialNB assumes count data. (BernoulliNB and MultinomialNB are mostly used in text data
classification)
MultinomialNB and BernoulliNB have a single parameter, alpha, which controls complexity. Alpha increases many virtual data points that have positive values for all the feature. GaussianNB is mostly used on high-dimensional data.
Naive Bayes models share many strengths and weakness of the linear
models.
- Fast to train and to predict. Work well with high-dimensional
  sparse data and relatively robust to the parameters. Great
  baseline models and often used on very large datasets.