Boosting

最新推荐文章于 2024-08-11 22:44:22 发布

eivind7

最新推荐文章于 2024-08-11 22:44:22 发布

阅读量105

点赞数

分类专栏： Run Do Not Walk Away

本文链接：https://blog.csdn.net/weixin_44130745/article/details/100046789

版权

Run Do Not Walk Away 专栏收录该内容

7 篇文章 0 订阅

订阅专栏

5.2 Boosting (Sequential) Learning

Boosting is under the PAC learning model. It involves incrementally building an ensemble by training each new model instance to emphasize the training instances that previous models mis-classified.

5.21 AdaBoost

AdaBoost = Weak Algorithm + Re-Weighting + Linear Aggregation

5.211 What is AdaBoost?

AdaBoost calls a given weak or base learning algorithm repeatedly in a series of rounds $t = 1, 2, . . ., T$ ,which is used for classification, mostly binary in practice.
This algorithm takes as input a training set $x_1,y_1),(x_2,y_2),...,(x_N,y_N)$ , where $x_n$ belongs to some domain or instance space $X$ . And each lable $y_n$ is in some lable set $Y$ .
One of the main ideas of the algorithm is to maintain a distribution or set of weights over the training set. The weight of this distribution on $\bf x_n$ on round $t$ is denoted $u_n^{(t)}$ . Initially ,all weights are set equally,but on each round, the weights of incorrectly classified examples are increased so that the weak learner is forced to focus on the hard examples in the training set.

集成学习，适用于分类问题 (classification)。
有序地训练学习基 (base hypothesis or base learner, which is possibly weak).
当前轮的学习基在训练过程中会更重视上一个学习器中判错的样本。

5.212 Why AdaBoost?

AdaBoost can handle weak hypotheses which output real-valued or confidence-rated predictions. That is, for each instance, the weak hypothesis $g_t$ outputs a prediction $g_t(\bf x_n) $ $\,\R$ , whose sign is the predicted lable (+1, -1)and whose magnitude $|g_t(\bf x_n) |$ gives a measure of “confidence” in the prediction.

5.213 How AdaBoost?

Our mission is to find a weak hypothesis $g_t$ for each round and their weights used for the final linear aggregation.

Given the learning data set $D :$ { $x_1,y_1),(x_2,y_2),...,(x_N,y_N)$ } and algorithm $A .$

Set the initial weight $u^{(1)}=[{1\over N},{1\over N},...,{1\over N}]$
$\bf for\ t=1,2,...,T$
$g_t \leftarrow u^{(t)}$
$\epsilon_t=\frac{\sum_{n=1}^{N}u_n^{(t)}I\ [y_n\neq g_t(\bf x_n)]}{\sum_{n=1}^{N}u_n^{(t)}}$
$\ \epsilon_t>0.5$ , then break
$\alpha_t={1\over2}\ln({{1-\epsilon_t}\over \epsilon_t})$
$u^{(t+1)} \leftarrow u^{(t)}$
$y_n\neq g_t(\bf x_n):\quad$ $u^{(t+1)} \leftarrow u^{(t)}\sqrt{ {1-\epsilon_t}\over \epsilon_t}$
$y_n= g_t(\bf x_n):\quad$ $u^{(t+1)} \leftarrow u^{(t)}/\sqrt{ {1-\epsilon_t}\over \epsilon_t}$
$\bf end \ for$
$\bf Return$ $G(\bf x)$ $[\sum_{t=1}^T\alpha_tg_t(\bf x)]$