Classifier Reviews in Machine Learning#1

最新推荐文章于 2022-07-09 01:20:25 发布

一个肉包

最新推荐文章于 2022-07-09 01:20:25 发布

阅读量99

点赞数

本文链接：https://blog.csdn.net/weixin_45525086/article/details/103331976

版权

Classifiers Reviews - #1

SVM (Support Vector Machine)

1. Concept

SVM is a kind of classifier which is also a line that can separate two sets of linearly separable data. And this line is special because it is right in the middle of the two sets, it is a line that has the largest distance to the closest data point. When you add more out of sample data into the sets, the line will still work.
Remove the data point on the support vector will affect the decision boundary.
Datasets which have a clear classification boundary will function best with SVM’s.

2. Parameters

1. C

C—misclassification penalty,
The higher the C, less toleration for the misclassification, may result in over-fitting.
The lower the C, more toleration for the misclassification, may result in under-fitting.
C too big or small will make lower the ability of generalization.

Hard margin: The SVM allows very low error in classification.
Soft margin: also called noisy linear SVM which includes some miss-classified points.

2. Kernel

The selection of kernel in SVM is very important, especially for those data that is not linearly separable. The goal is to project the linearly inseparable data onto a High-dimensional eigenspace in order to let the data become linearly separable. We define this projection as $\Phi (x)$ .
When it comes to optimization, there will be $\Phi (i) \cdot \Phi (j)$ which requires a very large calculation dimension, so we introduce kernel into the calculation, which is be much faster.
Here are some kernels that are often used in SVM:

Name	Usage	Function
Linear kernel	mainly used for linearly separable data, also when there are large amount of features	$x_j) = x \cdot x_i$
Polynomial kernel	can achieve the projection, but the parameters are a lot, so when the degree is high, the elements in the metrics will be close to zero, calculation complexity will be huge	$x_j) = ((x \cdot x_i) + 1)^d$
RBF kernel	Linearly inseparable, less parameters, normal sample amount and less features amount, when you don’t know what to use, use this one first (most used one)	$x_j) = exp(- \frac{\vert\vert{x-x_i\vert\vert ^2}}{\sigma ^2})$
Sigmoid	achieve neural networks	$k(x, x_j) = tanh$

gamma

gamma is a parameter that from the RBF kernel.
The greater the gamma, the less the support vectors. The smaller the gamma, the more the support vectors. The number of support vectors affect the training and predicting speed.

一个肉包

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
Classifier Reviews in Machine Learning#1

Classifiers Reviews - #1SVM (Support Vector Machine)1. ConceptSVM is a kind of classifier which is also a line that can separate two sets of linearly separable data. And this line is special becaus...
复制链接

扫一扫