Machine Learning--SVMs

最新推荐文章于 2023-10-13 16:21:19 发布

I.M.

最新推荐文章于 2023-10-13 16:21:19 发布

阅读量244

点赞数

文章标签： Machine Learning SVMs

本文链接：https://blog.csdn.net/qq_43405274/article/details/103474205

版权

本文介绍了SVM（支持向量机），包括其基本概念如超平面、边际和支撑向量。SVM旨在找到最大化边际的超平面进行分类。讨论了Sequential Minimal Optimization（SMO）算法，包括简单SMO和Platt SMO，用于优化算法中α的求解。还涉及到非线性分类问题，通过径向基函数（RBF）核方法解决。最后，指出在实际应用中，选择合适的RBF参数对SVM性能至关重要。

摘要由CSDN通过智能技术生成

Machine Learning–SVMs

基本介绍

linearly sperate : 可以用直线分隔数据的两部分的情况

hyberplane：决策边界，一般是 N-1 维的

margin ：最近元素离 hyberplane 的距离，margin = label * (W^T X+ b)

support vecotors ：距离 hyberplane 最近的点（们）

SVM 就是要找到具有最大 margin 的 hyberplane

先找到距离 hyberplane 最近的 n 个点，然后计算它们的 margin ，最后让这 n 个点的 margin 值之和最大。

为了简化计算，用 Largrange mulitiplier 来代替上面的计算：

注意这里面假设所有的数据都是 linearly sperate 的，为了提高准确性，引入 slack variables，c 是我们设置的常量。

因此 SVMs 的目标变成了寻找 αs

Sequential Minimal Optimization

SMO 是用来寻找 αs 的优化算法

simple SMO

# open the file and parse each line into class labels and data matrix

def loadDataSet(fileName):
    dataMat = []
    labelMat = []
    fr = open(fileName)
    for line in fr.readlines():
        lineArr = line.strip().split('\t')
        dataMat.append([float(lineArr[0]), float(lineArr[1])])
        labelMat.append(float(lineArr[2]))
    return dataMat, labelMat

# return an alpha as long as it is not the same as the present one(i), m is the total number of alphas
def selectJrand(i, m):
    j=i
    while (j==i):
        j = int(random.uniform(0, m))
    return j

# get ones higher than H, and lower than L
def clipAlpha(aj, H, L):
    if aj > H:
        aj = H
    if L > aj:
        aj = L
    return aj

# toler, tolerance; maxIter, max number of iterations before quitting;
def smoSimple(dataMatIn, classLabels, C, toler, maxIter):
    dataMatrix = mat(dataMatIn)
    # a column matrix
    labelMat = mat(classLabels).transpose()
    b = 0
    m, n = shape(dataMatrix)
    # a column matrix initialized to zero
    alphas = mat(zeros((m, 1)))
    # each time we go through the dataset without any alphas changed, iter will increase by 1.
    iter = 0
    while (iter < maxIter):
        # once getting into and completely going through the loop, alphaPairChanged will change to 1
        alphaPairsChanged = 0
        for i in range(m):
            # our prediction of the class
            fXi = float(multiply(alphas, labelMat).T * (dataMatrix*dataMatrix[i, :].T)) + b
            # the error between the real class
            Ei = fXi - float(labelMat[i])
            # the error is large enough and alphas[i](which will be changed later) is in the right range, which means \
            # alpha could be optimized.
            if((labelMat[i]*Ei < -toler) and (alphas[i] < C)) or ((labelMat[i]*Ei > toler) and (alphas[i] > 0)):
                # randomly select a second alpha[i](the first alpha is alphas[i])
                j = selectJrand(i, m)
                # calculate the prediction and error as done on alphas[i]
                fXj = float(multiply(alphas, labelMat).T * (dataMatrix*dataMatrix[j, :].T)) + b
                Ej = fXj - float(labelMat[j])
                # get the old value of calculated alphas[i] and alphas[j].
                alphaIold = alphas[i].copy()
                alphaJold = alphas[j].copy()
                # calculate the L and H in order to make alpha[j] between 0 and C.
                if(labelMat[i] != labelMat[j]):
                    L = max(0, alphas[j] - alphas[i])
                    H = min(C, C + alphas[j] - alphas[i])
                else:
                    L = max(0, alphas[j] + alphas[i] - C)
                    H = min(C, alphas[i] + alphas[j])
                # if L == H, we could not change anything.
                if L==H:
                    print("L == h")
                    continue
                # calculate the optimal amount to change alpha[j]
                eta = 2.0 * dataMatrix[i, :] * dataMatrix[j, :].T - dataMatrix[i, :] * dataMatrix