比较AdaBoost.M1和AdaBoost.M2集成学习算法

最新推荐文章于 2023-07-30 21:10:59 发布

KPer_Yang

最新推荐文章于 2023-07-30 21:10:59 发布

阅读量787

点赞数 3

分类专栏：机器学习文章标签：机器学习人工智能算法

本文链接：https://blog.csdn.net/KPer_Yang/article/details/124787900

版权

机器学习专栏收录该内容

87 篇文章 18 订阅

订阅专栏

AdaBoost.M2出处

论文指出AdaBoost.M1的主要缺点：

AdaBoost.M解决AdaBoost.M1缺点的方式：

AdaBoost.M2出处

本文参考paper《Experiments with a New Boosting Algorithm》

Experiments with a New Boosting Algorithm论文地址

论文指出AdaBoost.M1的主要缺点：

The main disadvantage of AdaBoost.M1 is that it is unable to handle weak hypotheses with error greater than 1/2.when class num > 2, the requirement that the error be less than 1/2 is quite strong and may often be hard to meet.

AdaBoost.M1不能够处理错误率大于0.5的弱假设，并且现实中要求错误率大于0.5是很难满足的强假设。

AdaBoost.M解决AdaBoost.M1缺点的方式：

Method:extending the communication between the boosting algorithm and the weak learner.

Advantage :

the boosting algorithm can focus the weak learner not only on hard-to-classify examples, but more specifically, on the incorrect labels that are hardest to discriminate.（主要因为使用了伪标签、模糊的分数和使用伪损失）

具体的Methods:

1. give the weak learning algorithm more expressive power

a. allow the weak learner to generate more expressive hypotheses, which, rather than identifying a single label in Y, instead choose a set of “plausible” labels.This may often be easier than choosing just one label.（生成一个似是而非的标签，例如手写数字，生成7和9两个标签）

b.allow the weak learner to indicate a “degree of plausibility.” Thus, each weak hypothesis outputs a kdim [0, 1] vector.（学习器可以输出一个非概率值的处于[0, 1]之间的的似乎可信的值。）

2.place a more complex requirement on the performance of the weak hypotheses.Rather than using the usual prediction error, we ask that the weak hypotheses do well with respect to a more sophisticated error measure that we call the pseudo-loss(不使用M1使用的错误率，而是使用“伪损失”。)