李飞飞CS231n关于hinge loss函数求导的问题

最新推荐文章于 2021-01-31 03:32:25 发布

JackTheWhite

最新推荐文章于 2021-01-31 03:32:25 发布

阅读量1.3k

点赞数

分类专栏：李飞飞 CS231n 文章标签：机器学习

本文链接：https://blog.csdn.net/qq_37099369/article/details/107144890

版权

李飞飞同时被 2 个专栏收录

1 篇文章 0 订阅

订阅专栏

CS231n

1 篇文章 0 订阅

订阅专栏

在做CS231 2020 Assignment1的SVM部分时，遇到了关于hinge loss的求梯度（求导）编程实现的问题，故在此记录一下。
首先，给出hinge loss在多分类时的表达式：
$L_i=\sum_{j\neq y_i}max(0,w_j^Tx_i-w_{y_i}^Tx_i+\Delta)$
其中， $\Delta=1$ 。通过对 $w$ 求偏导，可以得到最终的求导结果：
$\frac{\partial{L_i}}{\partial{w_j}}=\bold 1(w_j^Tx_i-w_{y_i}^Tx+\Delta)x_i$
$\frac{\partial{L_i}}{\partial{w_{y_i}}}=-(\sum_{j\neq y_i}\bold1(w_j^Tx_i-w_{y_i}^Tx+\Delta))x_i$
最终的代码如下所示：

    dW = np.zeros(W.shape) # initialize the gradient as zero

    # compute the loss and the gradient
    num_classes = W.shape[1]
    num_train = X.shape[0]
    loss = 0.0
    for i in range(num_train):
        scores = X[i].dot(W)
        #print(scores.shape) (10,)
        correct_class_score = scores[y[i]]
        for j in range(num_classes):
            if j == y[i]:
                continue
            margin = scores[j] - correct_class_score + 1 # note delta = 1
            if margin > 0:
                loss += margin
                dW[:,y[i]]-=X[i].T
                dW[:,j]+=X[i].T

    # Right now the loss is a sum over all training examples, but we want it
    # to be an average instead so we divide by num_train.
    loss /= num_train

    # Add regularization to the loss.
    loss += reg * np.sum(W * W)

    #############################################################################
    # TODO:                                                                     #
    # Compute the gradient of the loss function and store it dW.                #
    # Rather than first computing the loss and then computing the derivative,   #
    # it may be simpler to compute the derivative at the same time that the     #
    # loss is being computed. As a result you may need to modify some of the    #
    # code above to compute the gradient.                                       #
    #############################################################################
    # *****START OF YOUR CODE (DO NOT DELETE/MODIFY THIS LINE)*****

    dW/=num_train
	dW+=reg*W
	
    # *****END OF YOUR CODE (DO NOT DELETE/MODIFY THIS LINE)*****
    
    return loss, dW

JackTheWhite

关注

0
点赞
踩
5

收藏

觉得还不错? 一键收藏
0
评论
李飞飞CS231n关于hinge loss函数求导的问题

在做CS231 2020 Assignment1的SVM部分时，遇到了关于hinge loss的求梯度（求导）编程实现的问题，故在此记录一下。首先，给出hinge loss在多分类时的表达式：Li=∑j≠yimax(0,wjTxi−wyiTxi+Δ)L_i=\sum_{j\neq y_i}max(0,w_j^Tx_i-w_{y_i}^Tx_i+\Delta)Li=j=yi∑max(0,wjTxi−wyiTxi+Δ)其中，Δ=1\Delta=1Δ=1。通过对www求偏导，可以得到
复制链接

扫一扫