SVM基础

最新推荐文章于 2020-05-01 22:19:58 发布

alaclp

最新推荐文章于 2020-05-01 22:19:58 发布

阅读量987

点赞数

分类专栏：人工智能算法文章标签： variables constraints training matrix vector user

算法同时被 2 个专栏收录

341 篇文章 3 订阅

订阅专栏

人工智能

85 篇文章 2 订阅

订阅专栏

The Standard SVM Formulation

Given an implicit embedding Φ and training data (x_i, y_i) from 2 classes such that y_i = ± 1, a Support Vector Machine finds a hyperplane w^T Φ(x) + b = 0 that best separates the two classes (see Fig. 1). The learnt hyperplane is optimal in the sense that it maximises the margin while minimising some measure of loss on the training data.

Figure 1. The SVM learns a hyperplane which best separates the two classes. Red dots have a label y_i = +1 while blue dots have a label y_i = -1.

More formally, the primal formulation of the l₁ C-SVM is

With C being a user specified misclassification penalty. The primal variables can not be solved for directly since w is often infinite dimensional and \Φ is unspecified. The solution is obtained by moving to a dual formulation. First, the Lagrangian is formed by adding the constraints to the objective. Next, it is shown that for the given problem, the order of first maximising with respect to the Lagrange multipliers and then minimising with respect to the primal variables can be switched. This is helpful since a lot of terms simplify by first minimising over the primal variables analytically. This leads to the following simplified dual formulation

where Y is a diagonal matrix with the labels on the diagonal. The dual is an instance of a convex Quadratic Programming problem and therefore has a unique global optimum. Having solved for α, the perpendicular to the separating hyperplane turns out to be w = Σ_i y_i α_iΦ(x_i) and b can be solved for by using w^T Φ(x) +b = y_i for support vectors. A novel points x can now be classified as ±1 by evaluating sign(w^T Φ(x) + b).

来源： http://research.microsoft.com/en-us/um/people/manik/projects/trade-off/svm.html

alaclp

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
SVM基础

The Standard SVM FormulationGiven an implicit embedding Φ and training data (xi, yi) from 2 classes such that yi = ± 1, a Support Vector Machine finds a hyperplane wT Φ(x) + b = 0 that best sepa
复制链接

扫一扫

专栏目录