周志华Watermelon Book SVM部分公式补充、一些原理解释

最新推荐文章于 2019-07-17 07:10:03 发布

linjiet

最新推荐文章于 2019-07-17 07:10:03 发布

阅读量242

点赞数

分类专栏：机器学习机器学习与统计学习方法文章标签： SVM

本文链接：https://blog.csdn.net/qq_39742013/article/details/89459652

版权

机器学习同时被 2 个专栏收录

27 篇文章 0 订阅

订阅专栏

机器学习与统计学习方法

4 篇文章 1 订阅

订阅专栏

Z.F Zhou Watermelon Book SVM.This article is a supplementary material for SVM.

Watermelon Book

6.3

Suppose training data set is linear separable.Minimum margin is $\delta$ .
$\begin{cases} w^Tx+b>=+\delta,y_i=+1\\ w^Tx+b<=-\delta,y_i=-1 \end{cases}\\ \rightarrow \begin{cases} \frac{w^T}{\delta}x+\frac{b}{\delta}>=+1,y_i=+1\\ \frac{w^T}{\delta}x+\frac{b}{\delta}<=-1,y_i=-1 \end{cases}$

6.8

Refer to the below link about lagrange and KKT.

6.9

$L=1/2w^Tw+\sum_{i=1}^ma_i(1-y_i(w^tx_i+b))\\ derivation\\ 0=w-\sum_{i=1}^ma_iy_ix_i\\ \rightarrow w=\sum_{i=1}^ma_iy_ix_i$

6.11

You can compute $\frac{1}{2}||w||^2$ and $\sum_{i=1}^ma_i(1-y_i(w^tx_i+b))$ respectively.I am lazy.

6.18

Because the $a_i$ may have error with theory value using SMO.

6.41

In below of 6.41,the interpretation about support vectors is based on $a_i!=0$ .

[1] Partial Reference

KKT

part1

在这里插入图片描述
What is clear interpretation is:
Three equations base on:
$u=0\ or\ g=0\ where\ u\ and\ g\ are\ vectors.(u_i=0\ or\ g_i=0)$
For three equations,we can compute a $x$ minimizing $f (x)$ by compute another formulation,e,g, $min_x\max_u L(x,u)$ .Their $x$ and $u$ are common.

part2

在这里插入图片描述
Why is the $L(\hat{x},u)$ equivalent to the $min_x L(x,u)$ ?
I think it can use proof by contradiction.Suppose $L(x',u)=\min_x L(x,u)$ .Then $\max_u\max_xL(x,u)=\max_uL(x',u)=f(x')!=f(\hat{x})$ .It is contradictory.

Lagrange

I think the link in below is very detailed.

Reference

[1]Lagrange and KKT condition.
[2]拉格朗日乘数

linjiet

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
周志华Watermelon Book SVM部分公式补充、一些原理解释

Z.F Zhou Watermelon Book SVM.This article is a supplementary material for SVM.Watermelon Book6.3Suppose training data set is linear separable.Minimum margin is δ\deltaδ.{wTx+b&gt;=+δ,yi=+1wT...
复制链接

扫一扫