cs231n - assignment1 - softmax 梯度推导

最新推荐文章于 2025-03-17 20:33:28 发布

蜗牛一步一步往上爬

最新推荐文章于 2025-03-17 20:33:28 发布

阅读量1.6w

点赞数 13

分类专栏： machine learning 文章标签： cs231n

本文链接：https://blog.csdn.net/yc461515457/article/details/51924604

版权

这篇博客介绍了如何实现和优化Softmax分类器，包括完全向量化损失函数、梯度推导、数值梯度检查、学习率和正则化参数的调整，以及权重可视化。通过softmax的链式法则，详细解释了在网络复杂时如何求导。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

Softmax exercise

Complete and hand in this completed worksheet (including its outputs and any supporting code outside of the worksheet) with your assignment submission. For more details see the assignments page on the course website.

This exercise is analogous to the SVM exercise. You will:

- implement a fully-vectorized loss function for the Softmax classifier
- implement the fully-vectorized expression for its analytic gradient
- check your implementation with numerical gradient
- use a validation set to tune the learning rate and regularization strength
- optimize the loss function with SGD
- visualize the final learned weights

和linear_svm一样，主要难点是求导操作，不过softmax的求导更简单一些。
首先还是给出 Loss 的公式：